InferX — Serverless GPU Inference Platform for Production Workloads

Funcpod

Tenant Namespace Podname Model
public Trial public/Trial/L3.3-70B-Loki-V2.0/94/139 L3.3-70B-Loki-V2.0

State

State Time
Init 2026-03-01 23:31:10
PullingImage 2026-03-01 23:31:10
Creating 2026-03-01 23:31:10
Restoring 2026-03-01 23:31:13
Standby 2026-03-01 23:31:13
Resuming 2026-03-01 23:34:42
Ready 2026-03-01 23:34:52

Log

INFO 03-01 23:34:52 [logger.py:42] Received request cmpl-71a8ad67ba87e1f5f7766690267748bd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=800, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.4:123 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:34:52 [async_llm.py:261] Added request cmpl-71a8ad67ba87e1f5f7766690267748bd-0.
INFO 03-01 23:34:53 [logger.py:42] Received request cmpl-2775d2d5148148dd8ef27777ec29d7d8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:34:53 [async_llm.py:261] Added request cmpl-2775d2d5148148dd8ef27777ec29d7d8-0.
INFO 03-01 23:34:54 [logger.py:42] Received request cmpl-7b5d028b09854725912a8014d02c5c6d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:34:54 [async_llm.py:261] Added request cmpl-7b5d028b09854725912a8014d02c5c6d-0.
INFO 03-01 23:34:55 [logger.py:42] Received request cmpl-85fafb65fd3f4511b1d235e15d085ad5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:34:55 [async_llm.py:261] Added request cmpl-85fafb65fd3f4511b1d235e15d085ad5-0.
INFO 03-01 23:34:56 [logger.py:42] Received request cmpl-0a113c214e9b41039acc20ba4f3e64d9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:34:56 [async_llm.py:261] Added request cmpl-0a113c214e9b41039acc20ba4f3e64d9-0.
INFO 03-01 23:34:57 [loggers.py:116] Engine 000: Avg prompt throughput: 15.5 tokens/s, Avg generation throughput: 19.5 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 1.3%, Prefix cache hit rate: 43.0%
INFO 03-01 23:34:58 [logger.py:42] Received request cmpl-c06e3835461f405e9788b81d58e28932-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:34:58 [async_llm.py:261] Added request cmpl-c06e3835461f405e9788b81d58e28932-0.
INFO 03-01 23:34:59 [logger.py:42] Received request cmpl-3a79b59f53004f8ba18ee6304151c9fa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:34:59 [async_llm.py:261] Added request cmpl-3a79b59f53004f8ba18ee6304151c9fa-0.
INFO 03-01 23:35:00 [logger.py:42] Received request cmpl-1f4eadaa3af24b8d955c47dfb1fb28f2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:35:00 [async_llm.py:261] Added request cmpl-1f4eadaa3af24b8d955c47dfb1fb28f2-0.
INFO 03-01 23:35:01 [logger.py:42] Received request cmpl-44bb0a33706145d5acd9edc7cb240e49-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:35:01 [async_llm.py:261] Added request cmpl-44bb0a33706145d5acd9edc7cb240e49-0.
INFO 03-01 23:35:02 [logger.py:42] Received request cmpl-80dac4ae7a244bb2b96d89a13c6c79ea-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:35:02 [async_llm.py:261] Added request cmpl-80dac4ae7a244bb2b96d89a13c6c79ea-0.
INFO 03-01 23:35:03 [logger.py:42] Received request cmpl-bab21b3f622d490093d04003e374b157-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:35:03 [async_llm.py:261] Added request cmpl-bab21b3f622d490093d04003e374b157-0.
INFO 03-01 23:35:05 [logger.py:42] Received request cmpl-3dadbdaa05bf4bc2b87066196767100e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:35:05 [async_llm.py:261] Added request cmpl-3dadbdaa05bf4bc2b87066196767100e-0.
INFO 03-01 23:35:06 [logger.py:42] Received request cmpl-bd3f9e24137247549c0035b8e9455a1c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:35:06 [async_llm.py:261] Added request cmpl-bd3f9e24137247549c0035b8e9455a1c-0.
INFO 03-01 23:35:07 [logger.py:42] Received request cmpl-1c41e2408f7641b29f0300b2a65bf180-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:35:07 [async_llm.py:261] Added request cmpl-1c41e2408f7641b29f0300b2a65bf180-0.
INFO 03-01 23:35:07 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 21.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 48.2%
INFO 03-01 23:35:08 [logger.py:42] Received request cmpl-9fa291ddb76f4c26acd4c293048f976f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:35:08 [async_llm.py:261] Added request cmpl-9fa291ddb76f4c26acd4c293048f976f-0.
INFO 03-01 23:35:09 [logger.py:42] Received request cmpl-95c65341ddf54c869d63df434ec0f4e6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:35:09 [async_llm.py:261] Added request cmpl-95c65341ddf54c869d63df434ec0f4e6-0.
INFO 03-01 23:35:10 [logger.py:42] Received request cmpl-8514ec4c1653423481989e54b62723b5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:35:10 [async_llm.py:261] Added request cmpl-8514ec4c1653423481989e54b62723b5-0.
INFO 03-01 23:35:12 [logger.py:42] Received request cmpl-e8d0f376152a4f37ae736e528c65e4fc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:35:12 [async_llm.py:261] Added request cmpl-e8d0f376152a4f37ae736e528c65e4fc-0.
INFO 03-01 23:35:13 [logger.py:42] Received request cmpl-3649f3f9f6174f25b09db8fd75015d06-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:35:13 [async_llm.py:261] Added request cmpl-3649f3f9f6174f25b09db8fd75015d06-0.
INFO 03-01 23:35:14 [logger.py:42] Received request cmpl-ec24201418e1440db3e56339cff46a01-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:35:14 [async_llm.py:261] Added request cmpl-ec24201418e1440db3e56339cff46a01-0.
INFO 03-01 23:35:15 [logger.py:42] Received request cmpl-113dd35c6937492aa820cd466ac06458-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:35:15 [async_llm.py:261] Added request cmpl-113dd35c6937492aa820cd466ac06458-0.
INFO 03-01 23:35:16 [logger.py:42] Received request cmpl-bc8b629a12824a0d95c765b8d966b246-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:35:16 [async_llm.py:261] Added request cmpl-bc8b629a12824a0d95c765b8d966b246-0.
INFO 03-01 23:35:17 [logger.py:42] Received request cmpl-c9478e4329b0449886fb330f779a1b32-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:35:17 [async_llm.py:261] Added request cmpl-c9478e4329b0449886fb330f779a1b32-0.
INFO 03-01 23:35:17 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.1 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.3%, Prefix cache hit rate: 49.5%
INFO 03-01 23:35:19 [logger.py:42] Received request cmpl-4aa9e2863b5e46669eeae499c1e01b90-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:35:19 [async_llm.py:261] Added request cmpl-4aa9e2863b5e46669eeae499c1e01b90-0.
INFO 03-01 23:35:20 [logger.py:42] Received request cmpl-82f01d325a92486791e7a967ac2cdab3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:35:20 [async_llm.py:261] Added request cmpl-82f01d325a92486791e7a967ac2cdab3-0.
INFO 03-01 23:35:21 [logger.py:42] Received request cmpl-aba5defa79c44fbbbba7bff09632eebe-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:35:21 [async_llm.py:261] Added request cmpl-aba5defa79c44fbbbba7bff09632eebe-0.
INFO 03-01 23:35:22 [logger.py:42] Received request cmpl-cce669600d314a11b6384058ea967198-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:35:22 [async_llm.py:261] Added request cmpl-cce669600d314a11b6384058ea967198-0.
INFO 03-01 23:35:23 [logger.py:42] Received request cmpl-1939120b3e264894bb6df28e69974e3b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:35:23 [async_llm.py:261] Added request cmpl-1939120b3e264894bb6df28e69974e3b-0.
INFO 03-01 23:35:24 [logger.py:42] Received request cmpl-49d61cd304944ac8b513541b42db599c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:35:24 [async_llm.py:261] Added request cmpl-49d61cd304944ac8b513541b42db599c-0.
INFO 03-01 23:35:26 [logger.py:42] Received request cmpl-a671cd163402477dac225bf686d2b275-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:35:26 [async_llm.py:261] Added request cmpl-a671cd163402477dac225bf686d2b275-0.
INFO 03-01 23:35:27 [logger.py:42] Received request cmpl-6c41d3dea23c449abac5a110dea6b73a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:35:27 [async_llm.py:261] Added request cmpl-6c41d3dea23c449abac5a110dea6b73a-0.
INFO 03-01 23:35:27 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.4 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 50.0%
INFO 03-01 23:35:28 [logger.py:42] Received request cmpl-438dda8b8fdf48b5929611debba9f58d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:35:28 [async_llm.py:261] Added request cmpl-438dda8b8fdf48b5929611debba9f58d-0.
INFO 03-01 23:35:29 [logger.py:42] Received request cmpl-a0e8d114cdfd40baa19ce46508463546-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:35:29 [async_llm.py:261] Added request cmpl-a0e8d114cdfd40baa19ce46508463546-0.
INFO 03-01 23:35:30 [logger.py:42] Received request cmpl-8ce0f163077c4b32aac9bdf7e468addc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:35:30 [async_llm.py:261] Added request cmpl-8ce0f163077c4b32aac9bdf7e468addc-0.
INFO 03-01 23:35:31 [logger.py:42] Received request cmpl-104f0dc7d2364078b22fad7eda549cda-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:35:31 [async_llm.py:261] Added request cmpl-104f0dc7d2364078b22fad7eda549cda-0.
INFO 03-01 23:35:33 [logger.py:42] Received request cmpl-3853e2ed54764d659588249b3768f5d6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:35:33 [async_llm.py:261] Added request cmpl-3853e2ed54764d659588249b3768f5d6-0.
INFO 03-01 23:35:34 [logger.py:42] Received request cmpl-7c7f5e5472b841388ace2dbf8680860c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:35:34 [async_llm.py:261] Added request cmpl-7c7f5e5472b841388ace2dbf8680860c-0.
INFO 03-01 23:35:35 [logger.py:42] Received request cmpl-0343311c9f3948a49f13abeeb99c4b25-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:35:35 [async_llm.py:261] Added request cmpl-0343311c9f3948a49f13abeeb99c4b25-0.
INFO 03-01 23:35:36 [logger.py:42] Received request cmpl-f125640493f64ab78a93a8a9d6e8d6ea-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:35:36 [async_llm.py:261] Added request cmpl-f125640493f64ab78a93a8a9d6e8d6ea-0.
INFO 03-01 23:35:37 [logger.py:42] Received request cmpl-12afcd53cd35468d968188023cadff2b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:35:37 [async_llm.py:261] Added request cmpl-12afcd53cd35468d968188023cadff2b-0.
INFO 03-01 23:35:37 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 50.4%
INFO 03-01 23:35:38 [logger.py:42] Received request cmpl-b389f82f18a844f58e8c6022175235f2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:35:38 [async_llm.py:261] Added request cmpl-b389f82f18a844f58e8c6022175235f2-0.
INFO 03-01 23:35:40 [logger.py:42] Received request cmpl-454747fd392c4d01bf24aa39e897f8a4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:35:40 [async_llm.py:261] Added request cmpl-454747fd392c4d01bf24aa39e897f8a4-0.
INFO 03-01 23:35:41 [logger.py:42] Received request cmpl-9855535b54ba4652afc693c928b9ae2d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:35:41 [async_llm.py:261] Added request cmpl-9855535b54ba4652afc693c928b9ae2d-0.
INFO 03-01 23:35:42 [logger.py:42] Received request cmpl-b81ddff040ec402fb2b1e7586f3c71ea-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:35:42 [async_llm.py:261] Added request cmpl-b81ddff040ec402fb2b1e7586f3c71ea-0.
INFO 03-01 23:35:43 [logger.py:42] Received request cmpl-195d02aca8c54ec5992ed1deaefe4781-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:35:43 [async_llm.py:261] Added request cmpl-195d02aca8c54ec5992ed1deaefe4781-0.
INFO 03-01 23:35:44 [logger.py:42] Received request cmpl-16df19260a9f4705868231a4210b861d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:35:44 [async_llm.py:261] Added request cmpl-16df19260a9f4705868231a4210b861d-0.
INFO 03-01 23:35:45 [logger.py:42] Received request cmpl-7c36c1b01f5e47ac834a09f87e567f49-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:35:45 [async_llm.py:261] Added request cmpl-7c36c1b01f5e47ac834a09f87e567f49-0.
INFO 03-01 23:35:46 [logger.py:42] Received request cmpl-05b5a92a69a84eda823124df994a436a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:35:46 [async_llm.py:261] Added request cmpl-05b5a92a69a84eda823124df994a436a-0.
INFO 03-01 23:35:47 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 50.6%
INFO 03-01 23:35:48 [logger.py:42] Received request cmpl-548b4cbd59b24a95b3e4246534bdf9d2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:35:48 [async_llm.py:261] Added request cmpl-548b4cbd59b24a95b3e4246534bdf9d2-0.
INFO 03-01 23:35:49 [logger.py:42] Received request cmpl-63f18c07a5d947709ac87b67a682208a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:35:49 [async_llm.py:261] Added request cmpl-63f18c07a5d947709ac87b67a682208a-0.
INFO 03-01 23:35:50 [logger.py:42] Received request cmpl-c6a31068b3804624ad0f511e0d88c0b3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:35:50 [async_llm.py:261] Added request cmpl-c6a31068b3804624ad0f511e0d88c0b3-0.
INFO 03-01 23:35:51 [logger.py:42] Received request cmpl-b676329e190048858a57cd0849a1cea6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:35:51 [async_llm.py:261] Added request cmpl-b676329e190048858a57cd0849a1cea6-0.
INFO 03-01 23:35:52 [logger.py:42] Received request cmpl-6993223e2a744db4a085f3b11ff0ec45-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:35:52 [async_llm.py:261] Added request cmpl-6993223e2a744db4a085f3b11ff0ec45-0.
INFO 03-01 23:35:53 [logger.py:42] Received request cmpl-f860e538cd724bab8bbdb515950c669a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:35:53 [async_llm.py:261] Added request cmpl-f860e538cd724bab8bbdb515950c669a-0.
INFO 03-01 23:35:55 [logger.py:42] Received request cmpl-7cbba80273d0406ea7904ac6d232afb7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:35:55 [async_llm.py:261] Added request cmpl-7cbba80273d0406ea7904ac6d232afb7-0.
INFO 03-01 23:35:56 [logger.py:42] Received request cmpl-aba88f2c1df4483c937d0b24a23c3658-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:35:56 [async_llm.py:261] Added request cmpl-aba88f2c1df4483c937d0b24a23c3658-0.
INFO 03-01 23:35:57 [logger.py:42] Received request cmpl-51f642facb0b42868c5dedef6c0a513f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:35:57 [async_llm.py:261] Added request cmpl-51f642facb0b42868c5dedef6c0a513f-0.
INFO 03-01 23:35:57 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 50.7%
INFO 03-01 23:35:58 [logger.py:42] Received request cmpl-d7ab2931d9d2426eb89fdafa7b187d61-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:35:58 [async_llm.py:261] Added request cmpl-d7ab2931d9d2426eb89fdafa7b187d61-0.
INFO 03-01 23:35:59 [logger.py:42] Received request cmpl-285f8207b07943aba691d77f7abb7bf6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:35:59 [async_llm.py:261] Added request cmpl-285f8207b07943aba691d77f7abb7bf6-0.
INFO 03-01 23:36:00 [logger.py:42] Received request cmpl-24f02c6f01e44bc48baac04c5bd0a398-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:36:00 [async_llm.py:261] Added request cmpl-24f02c6f01e44bc48baac04c5bd0a398-0.
INFO 03-01 23:36:01 [logger.py:42] Received request cmpl-0efcfeddb0c840499db3f378b69c83fb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:36:01 [async_llm.py:261] Added request cmpl-0efcfeddb0c840499db3f378b69c83fb-0.
INFO 03-01 23:36:03 [logger.py:42] Received request cmpl-2f97099d42ed41feb6830911f6125211-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:36:03 [async_llm.py:261] Added request cmpl-2f97099d42ed41feb6830911f6125211-0.
INFO 03-01 23:36:04 [logger.py:42] Received request cmpl-1a73f4d2e9e846f498825473e61b2989-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:36:04 [async_llm.py:261] Added request cmpl-1a73f4d2e9e846f498825473e61b2989-0.
INFO 03-01 23:36:05 [logger.py:42] Received request cmpl-aa76133071914b2eb150f154c1f169aa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:36:05 [async_llm.py:261] Added request cmpl-aa76133071914b2eb150f154c1f169aa-0.
INFO 03-01 23:36:06 [logger.py:42] Received request cmpl-de0ed0f7f27b494dae137ab7a95adb32-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:36:06 [async_llm.py:261] Added request cmpl-de0ed0f7f27b494dae137ab7a95adb32-0.
INFO 03-01 23:36:07 [logger.py:42] Received request cmpl-93a47559a8f3431c844e9f9b154c49dc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:36:07 [async_llm.py:261] Added request cmpl-93a47559a8f3431c844e9f9b154c49dc-0.
INFO 03-01 23:36:07 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 50.8%
INFO 03-01 23:36:08 [logger.py:42] Received request cmpl-8b1ae20ee76746cfa188d7237413b9ba-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:36:08 [async_llm.py:261] Added request cmpl-8b1ae20ee76746cfa188d7237413b9ba-0.
INFO 03-01 23:36:10 [logger.py:42] Received request cmpl-3409b7b520834d3a970fe03c3b89242d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:36:10 [async_llm.py:261] Added request cmpl-3409b7b520834d3a970fe03c3b89242d-0.
INFO 03-01 23:36:11 [logger.py:42] Received request cmpl-ce1e3574d45146a2b1d8083f65c4ad43-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:36:11 [async_llm.py:261] Added request cmpl-ce1e3574d45146a2b1d8083f65c4ad43-0.
INFO 03-01 23:36:12 [logger.py:42] Received request cmpl-e895e58513f5496fbbdc8c13f693492c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:36:12 [async_llm.py:261] Added request cmpl-e895e58513f5496fbbdc8c13f693492c-0.
INFO 03-01 23:36:13 [logger.py:42] Received request cmpl-bdf8a729ac6040ed9294072b11f7431c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:36:13 [async_llm.py:261] Added request cmpl-bdf8a729ac6040ed9294072b11f7431c-0.
INFO 03-01 23:36:14 [logger.py:42] Received request cmpl-dbd0022337524f76af976c269edc24fa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:36:14 [async_llm.py:261] Added request cmpl-dbd0022337524f76af976c269edc24fa-0.
INFO 03-01 23:36:15 [logger.py:42] Received request cmpl-c1c504e586c24efabb3c389b0880f6ac-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:36:15 [async_llm.py:261] Added request cmpl-c1c504e586c24efabb3c389b0880f6ac-0.
INFO 03-01 23:36:17 [logger.py:42] Received request cmpl-896ad8a8954d4e4e8098e005b51b5084-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:36:17 [async_llm.py:261] Added request cmpl-896ad8a8954d4e4e8098e005b51b5084-0.
INFO 03-01 23:36:17 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 50.9%
INFO 03-01 23:36:18 [logger.py:42] Received request cmpl-39dae8f059b748779e93b4d57ef5e489-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:36:18 [async_llm.py:261] Added request cmpl-39dae8f059b748779e93b4d57ef5e489-0.
INFO 03-01 23:36:19 [logger.py:42] Received request cmpl-9afcbd92eb4141b5b8b1381eb35c0830-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:36:19 [async_llm.py:261] Added request cmpl-9afcbd92eb4141b5b8b1381eb35c0830-0.
INFO 03-01 23:36:20 [logger.py:42] Received request cmpl-b1710ca3e16e434ca281087944502bc2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:36:20 [async_llm.py:261] Added request cmpl-b1710ca3e16e434ca281087944502bc2-0.
INFO 03-01 23:36:21 [logger.py:42] Received request cmpl-57c56dfabe6549c89ca9848ae0da69e7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:36:21 [async_llm.py:261] Added request cmpl-57c56dfabe6549c89ca9848ae0da69e7-0.
INFO 03-01 23:36:22 [logger.py:42] Received request cmpl-91f33d63fee94c89aab1d42c67800977-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:36:22 [async_llm.py:261] Added request cmpl-91f33d63fee94c89aab1d42c67800977-0.
INFO 03-01 23:36:23 [logger.py:42] Received request cmpl-130586da50de411983968f48777f1a81-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:36:23 [async_llm.py:261] Added request cmpl-130586da50de411983968f48777f1a81-0.
INFO 03-01 23:36:25 [logger.py:42] Received request cmpl-ff48bceef24b4bee8e105ecb52b37458-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:36:25 [async_llm.py:261] Added request cmpl-ff48bceef24b4bee8e105ecb52b37458-0.
INFO 03-01 23:36:26 [logger.py:42] Received request cmpl-59d86d7955964b569bacb95afcaca031-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:36:26 [async_llm.py:261] Added request cmpl-59d86d7955964b569bacb95afcaca031-0.
INFO 03-01 23:36:27 [logger.py:42] Received request cmpl-85d1a9f6fd754653968417c29959a5ed-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:36:27 [async_llm.py:261] Added request cmpl-85d1a9f6fd754653968417c29959a5ed-0.
INFO 03-01 23:36:27 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.0%
INFO 03-01 23:36:28 [logger.py:42] Received request cmpl-d122a70d1d7b4798a76ee01ead10850e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:36:28 [async_llm.py:261] Added request cmpl-d122a70d1d7b4798a76ee01ead10850e-0.
INFO 03-01 23:36:29 [logger.py:42] Received request cmpl-f021a27480854982a6757f93c7f9a6ce-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:36:29 [async_llm.py:261] Added request cmpl-f021a27480854982a6757f93c7f9a6ce-0.
INFO 03-01 23:36:30 [logger.py:42] Received request cmpl-1b7f7e5f347141968da0d080970c5eca-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:36:30 [async_llm.py:261] Added request cmpl-1b7f7e5f347141968da0d080970c5eca-0.
INFO 03-01 23:36:32 [logger.py:42] Received request cmpl-a731791aa37e41eb8d8b9a09f228b7ff-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:36:32 [async_llm.py:261] Added request cmpl-a731791aa37e41eb8d8b9a09f228b7ff-0.
INFO 03-01 23:36:33 [logger.py:42] Received request cmpl-a597b253881a43ae933539706b77a024-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:36:33 [async_llm.py:261] Added request cmpl-a597b253881a43ae933539706b77a024-0.
INFO 03-01 23:36:34 [logger.py:42] Received request cmpl-a03c9ab69ec841a196c52e973b9381e2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:36:34 [async_llm.py:261] Added request cmpl-a03c9ab69ec841a196c52e973b9381e2-0.
INFO 03-01 23:36:35 [logger.py:42] Received request cmpl-87d2bf024d4b4d6d8945159482ce5287-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:36:35 [async_llm.py:261] Added request cmpl-87d2bf024d4b4d6d8945159482ce5287-0.
INFO 03-01 23:36:36 [logger.py:42] Received request cmpl-561cb5e4600642518a20ffb3efe17996-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:36:36 [async_llm.py:261] Added request cmpl-561cb5e4600642518a20ffb3efe17996-0.
INFO 03-01 23:36:37 [logger.py:42] Received request cmpl-d0011368bf514935bdd8bdc70e2d7bd3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:36:37 [async_llm.py:261] Added request cmpl-d0011368bf514935bdd8bdc70e2d7bd3-0.
INFO 03-01 23:36:37 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.1%
INFO 03-01 23:36:38 [logger.py:42] Received request cmpl-bf87abddca8c4964b91ad3d2e158feac-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:36:38 [async_llm.py:261] Added request cmpl-bf87abddca8c4964b91ad3d2e158feac-0.
INFO 03-01 23:36:40 [logger.py:42] Received request cmpl-d182d5e90d6c4e02b42690aced7b0b4d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:36:40 [async_llm.py:261] Added request cmpl-d182d5e90d6c4e02b42690aced7b0b4d-0.
INFO 03-01 23:36:41 [logger.py:42] Received request cmpl-a7fb2a94c2ac4e3898f159ec8edfc136-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:36:41 [async_llm.py:261] Added request cmpl-a7fb2a94c2ac4e3898f159ec8edfc136-0.
INFO 03-01 23:36:42 [logger.py:42] Received request cmpl-3ed4c38e18d74b09a20b9073e56b19d3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:36:42 [async_llm.py:261] Added request cmpl-3ed4c38e18d74b09a20b9073e56b19d3-0.
INFO 03-01 23:36:43 [logger.py:42] Received request cmpl-9fb7afaf57024013a0d042d77ca42831-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:36:43 [async_llm.py:261] Added request cmpl-9fb7afaf57024013a0d042d77ca42831-0.
INFO 03-01 23:36:44 [logger.py:42] Received request cmpl-92b342fbaee445649f36fbba3a418b3a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:36:44 [async_llm.py:261] Added request cmpl-92b342fbaee445649f36fbba3a418b3a-0.
INFO 03-01 23:36:45 [logger.py:42] Received request cmpl-d1778118739941c09bb3ad7ba5cd910d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:36:45 [async_llm.py:261] Added request cmpl-d1778118739941c09bb3ad7ba5cd910d-0.
INFO 03-01 23:36:47 [logger.py:42] Received request cmpl-ca23e3d0e4c54ef1bdbf92f093cfe471-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:36:47 [async_llm.py:261] Added request cmpl-ca23e3d0e4c54ef1bdbf92f093cfe471-0.
INFO 03-01 23:36:47 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.1%
INFO 03-01 23:36:48 [logger.py:42] Received request cmpl-8b719c1deb4245729a548d4eaed56523-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:36:48 [async_llm.py:261] Added request cmpl-8b719c1deb4245729a548d4eaed56523-0.
INFO 03-01 23:36:49 [logger.py:42] Received request cmpl-a96b174cfb644bb8a7a9310eaa85db09-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:36:49 [async_llm.py:261] Added request cmpl-a96b174cfb644bb8a7a9310eaa85db09-0.
INFO 03-01 23:36:50 [logger.py:42] Received request cmpl-91512a1967544537ad5f3f7e981e7e95-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:36:50 [async_llm.py:261] Added request cmpl-91512a1967544537ad5f3f7e981e7e95-0.
INFO 03-01 23:36:51 [logger.py:42] Received request cmpl-8b9167aa236844b5bbd229abaafff423-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:36:51 [async_llm.py:261] Added request cmpl-8b9167aa236844b5bbd229abaafff423-0.
INFO 03-01 23:36:52 [logger.py:42] Received request cmpl-5069d024835c4faf916aeedd733ae89b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:36:52 [async_llm.py:261] Added request cmpl-5069d024835c4faf916aeedd733ae89b-0.
INFO 03-01 23:36:54 [logger.py:42] Received request cmpl-36464ede2bb34eb289e159e61b9064b0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:36:54 [async_llm.py:261] Added request cmpl-36464ede2bb34eb289e159e61b9064b0-0.
INFO 03-01 23:36:55 [logger.py:42] Received request cmpl-0d70d638c4634ac3871fb2eb5613b863-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:36:55 [async_llm.py:261] Added request cmpl-0d70d638c4634ac3871fb2eb5613b863-0.
INFO 03-01 23:36:56 [logger.py:42] Received request cmpl-0bf524d5e9de4e67bbeca98dc58fd281-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:36:56 [async_llm.py:261] Added request cmpl-0bf524d5e9de4e67bbeca98dc58fd281-0.
INFO 03-01 23:36:57 [logger.py:42] Received request cmpl-23d58621aea049428fe1593d20295d07-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:36:57 [async_llm.py:261] Added request cmpl-23d58621aea049428fe1593d20295d07-0.
INFO 03-01 23:36:57 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.1%
INFO 03-01 23:36:58 [logger.py:42] Received request cmpl-531f86ac79714f41b06f39bb59e9da6c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:36:58 [async_llm.py:261] Added request cmpl-531f86ac79714f41b06f39bb59e9da6c-0.
INFO 03-01 23:36:59 [logger.py:42] Received request cmpl-00d4d50fb81b460e9469ec45cae872d3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:36:59 [async_llm.py:261] Added request cmpl-00d4d50fb81b460e9469ec45cae872d3-0.
INFO 03-01 23:37:00 [logger.py:42] Received request cmpl-501100b39c5f4520bd99e581f3cfcb9a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:37:00 [async_llm.py:261] Added request cmpl-501100b39c5f4520bd99e581f3cfcb9a-0.
INFO 03-01 23:37:02 [logger.py:42] Received request cmpl-beda40c4c3194f188632ec76ce4865f8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:37:02 [async_llm.py:261] Added request cmpl-beda40c4c3194f188632ec76ce4865f8-0.
INFO 03-01 23:37:03 [logger.py:42] Received request cmpl-c99532da41034bf693164515cc02c8a7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:37:03 [async_llm.py:261] Added request cmpl-c99532da41034bf693164515cc02c8a7-0.
INFO 03-01 23:37:04 [logger.py:42] Received request cmpl-37319165a85e404a962c065467c0fa35-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:37:04 [async_llm.py:261] Added request cmpl-37319165a85e404a962c065467c0fa35-0.
INFO 03-01 23:37:05 [logger.py:42] Received request cmpl-8f258fb697cf480ea311451172424140-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:37:05 [async_llm.py:261] Added request cmpl-8f258fb697cf480ea311451172424140-0.
INFO 03-01 23:37:06 [logger.py:42] Received request cmpl-6fef817b20894a4aa283badc0225873f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:37:06 [async_llm.py:261] Added request cmpl-6fef817b20894a4aa283badc0225873f-0.
INFO 03-01 23:37:07 [logger.py:42] Received request cmpl-8f87314a0c174dc3a2b8c788c2ae6c01-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:37:07 [async_llm.py:261] Added request cmpl-8f87314a0c174dc3a2b8c788c2ae6c01-0.
INFO 03-01 23:37:07 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.3 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.4%, Prefix cache hit rate: 51.2%
INFO 03-01 23:37:09 [logger.py:42] Received request cmpl-d373dfc44c0d43a4a0a25fdddd2b1753-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:37:09 [async_llm.py:261] Added request cmpl-d373dfc44c0d43a4a0a25fdddd2b1753-0.
INFO 03-01 23:37:10 [logger.py:42] Received request cmpl-d69925c382824c8ab3f0ae1975a06325-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:37:10 [async_llm.py:261] Added request cmpl-d69925c382824c8ab3f0ae1975a06325-0.
INFO 03-01 23:37:11 [logger.py:42] Received request cmpl-0457b7fd49aa455791fd089fe4757168-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:37:11 [async_llm.py:261] Added request cmpl-0457b7fd49aa455791fd089fe4757168-0.
INFO 03-01 23:37:12 [logger.py:42] Received request cmpl-d4ef89c0dfd74e238f5a9e03dce8c336-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:37:12 [async_llm.py:261] Added request cmpl-d4ef89c0dfd74e238f5a9e03dce8c336-0.
INFO 03-01 23:37:13 [logger.py:42] Received request cmpl-613977711d874b99a882b419160c376f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:37:13 [async_llm.py:261] Added request cmpl-613977711d874b99a882b419160c376f-0.
INFO 03-01 23:37:14 [logger.py:42] Received request cmpl-8ae4b33154c04895ab640b93a5ee5988-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:37:14 [async_llm.py:261] Added request cmpl-8ae4b33154c04895ab640b93a5ee5988-0.
INFO 03-01 23:37:15 [logger.py:42] Received request cmpl-475103b8e3b2439f959027f1276f91c0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:37:15 [async_llm.py:261] Added request cmpl-475103b8e3b2439f959027f1276f91c0-0.
INFO 03-01 23:37:17 [logger.py:42] Received request cmpl-23dbef0bb26943be9fc96e63bdcad03c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:37:17 [async_llm.py:261] Added request cmpl-23dbef0bb26943be9fc96e63bdcad03c-0.
INFO 03-01 23:37:17 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.2 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.2%
INFO 03-01 23:37:18 [logger.py:42] Received request cmpl-9c51df5e27ed4f27a58a85d1e1d093a9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:37:18 [async_llm.py:261] Added request cmpl-9c51df5e27ed4f27a58a85d1e1d093a9-0.
INFO 03-01 23:37:19 [logger.py:42] Received request cmpl-7b83502e38ba4c1f9f500781d96c5403-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:37:19 [async_llm.py:261] Added request cmpl-7b83502e38ba4c1f9f500781d96c5403-0.
INFO 03-01 23:37:20 [logger.py:42] Received request cmpl-f51eee3e3cc54425bec6cdd2de5b532f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:37:20 [async_llm.py:261] Added request cmpl-f51eee3e3cc54425bec6cdd2de5b532f-0.
INFO 03-01 23:37:21 [logger.py:42] Received request cmpl-74b511be485d4626b9224fcadff863cc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:37:21 [async_llm.py:261] Added request cmpl-74b511be485d4626b9224fcadff863cc-0.
INFO 03-01 23:37:22 [logger.py:42] Received request cmpl-4ed1fb7788634711aa86ccf26e023e3d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:37:22 [async_llm.py:261] Added request cmpl-4ed1fb7788634711aa86ccf26e023e3d-0.
INFO 03-01 23:37:24 [logger.py:42] Received request cmpl-5e8b2288dd4640e1845161c3ac1c9589-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:37:24 [async_llm.py:261] Added request cmpl-5e8b2288dd4640e1845161c3ac1c9589-0.
INFO 03-01 23:37:25 [logger.py:42] Received request cmpl-4801a62f777d4466b42d20a2dec1a640-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:37:25 [async_llm.py:261] Added request cmpl-4801a62f777d4466b42d20a2dec1a640-0.
INFO 03-01 23:37:26 [logger.py:42] Received request cmpl-70e8382ef9444db48835b6845e34939b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:37:26 [async_llm.py:261] Added request cmpl-70e8382ef9444db48835b6845e34939b-0.
INFO 03-01 23:37:27 [logger.py:42] Received request cmpl-a08a636804294cda95984067caf45708-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:37:27 [async_llm.py:261] Added request cmpl-a08a636804294cda95984067caf45708-0.
INFO 03-01 23:37:27 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.2%
INFO 03-01 23:37:28 [logger.py:42] Received request cmpl-9bccc45aa3d4461a94db2d94a78e4c4b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:37:28 [async_llm.py:261] Added request cmpl-9bccc45aa3d4461a94db2d94a78e4c4b-0.
INFO 03-01 23:37:29 [logger.py:42] Received request cmpl-09dccdf2a7e44637b9f22ae5689bf580-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:37:29 [async_llm.py:261] Added request cmpl-09dccdf2a7e44637b9f22ae5689bf580-0.
INFO 03-01 23:37:30 [logger.py:42] Received request cmpl-02fb5ca0f07447a1a7a1a00b59e56669-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:37:30 [async_llm.py:261] Added request cmpl-02fb5ca0f07447a1a7a1a00b59e56669-0.
INFO 03-01 23:37:32 [logger.py:42] Received request cmpl-28716d1f9b7d4daca9bbeceaff48202a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:37:32 [async_llm.py:261] Added request cmpl-28716d1f9b7d4daca9bbeceaff48202a-0.
INFO 03-01 23:37:33 [logger.py:42] Received request cmpl-fd3c4b4c34744d019681e6a232bad20c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:37:33 [async_llm.py:261] Added request cmpl-fd3c4b4c34744d019681e6a232bad20c-0.
INFO 03-01 23:37:34 [logger.py:42] Received request cmpl-b16d4323959147a4b844d7ee2ad56f04-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:37:34 [async_llm.py:261] Added request cmpl-b16d4323959147a4b844d7ee2ad56f04-0.
INFO 03-01 23:37:35 [logger.py:42] Received request cmpl-8f6b7f85ee094deb8a89a42d58233ae9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:37:35 [async_llm.py:261] Added request cmpl-8f6b7f85ee094deb8a89a42d58233ae9-0.
INFO 03-01 23:37:36 [logger.py:42] Received request cmpl-84b6bc5736cd47b4b353a659f0f6af9f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:37:36 [async_llm.py:261] Added request cmpl-84b6bc5736cd47b4b353a659f0f6af9f-0.
INFO 03-01 23:37:37 [logger.py:42] Received request cmpl-2459fbcb6b954b6d9855bc581d1d361e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:37:37 [async_llm.py:261] Added request cmpl-2459fbcb6b954b6d9855bc581d1d361e-0.
INFO 03-01 23:37:37 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.1 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.3%, Prefix cache hit rate: 51.3%
INFO 03-01 23:37:39 [logger.py:42] Received request cmpl-66f4927aa41c4acf93a6d81172fde0c1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:37:39 [async_llm.py:261] Added request cmpl-66f4927aa41c4acf93a6d81172fde0c1-0.
INFO 03-01 23:37:40 [logger.py:42] Received request cmpl-9906c51309094d5e8ff549a3f2689c27-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:37:40 [async_llm.py:261] Added request cmpl-9906c51309094d5e8ff549a3f2689c27-0.
INFO 03-01 23:37:41 [logger.py:42] Received request cmpl-3659afd349644f8fb220a95a2608e88f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:37:41 [async_llm.py:261] Added request cmpl-3659afd349644f8fb220a95a2608e88f-0.
INFO 03-01 23:37:42 [logger.py:42] Received request cmpl-0984d92eb75441d499534214c78585ac-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:37:42 [async_llm.py:261] Added request cmpl-0984d92eb75441d499534214c78585ac-0.
INFO 03-01 23:37:43 [logger.py:42] Received request cmpl-ce979121daac4c1bbbb8c731192abd56-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:37:43 [async_llm.py:261] Added request cmpl-ce979121daac4c1bbbb8c731192abd56-0.
INFO 03-01 23:37:44 [logger.py:42] Received request cmpl-58e2f6f6c3c2446d9efd92552b33f50c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:37:44 [async_llm.py:261] Added request cmpl-58e2f6f6c3c2446d9efd92552b33f50c-0.
INFO 03-01 23:37:46 [logger.py:42] Received request cmpl-18157147121f4c64b594f8ca0ed0ab8d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:37:46 [async_llm.py:261] Added request cmpl-18157147121f4c64b594f8ca0ed0ab8d-0.
INFO 03-01 23:37:47 [logger.py:42] Received request cmpl-ddb1d8436d794330bf57a0baa4cf608a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:37:47 [async_llm.py:261] Added request cmpl-ddb1d8436d794330bf57a0baa4cf608a-0.
INFO 03-01 23:37:47 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.4 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.3%
INFO 03-01 23:37:48 [logger.py:42] Received request cmpl-9383ceaa9da54731996fb19859b29eb4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:37:48 [async_llm.py:261] Added request cmpl-9383ceaa9da54731996fb19859b29eb4-0.
INFO 03-01 23:37:49 [logger.py:42] Received request cmpl-d9f7957950874bdbb7b855ec17a24f01-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:37:49 [async_llm.py:261] Added request cmpl-d9f7957950874bdbb7b855ec17a24f01-0.
INFO 03-01 23:37:50 [logger.py:42] Received request cmpl-de552910d3f246e08fccf28f4dc35205-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:37:50 [async_llm.py:261] Added request cmpl-de552910d3f246e08fccf28f4dc35205-0.
INFO 03-01 23:37:51 [logger.py:42] Received request cmpl-93c93a5061404dddbebcebb66d96f9f5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:37:51 [async_llm.py:261] Added request cmpl-93c93a5061404dddbebcebb66d96f9f5-0.
INFO 03-01 23:37:52 [logger.py:42] Received request cmpl-a934b115dff44cb38260ad7ed42919c7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:37:52 [async_llm.py:261] Added request cmpl-a934b115dff44cb38260ad7ed42919c7-0.
INFO 03-01 23:37:54 [logger.py:42] Received request cmpl-c3e8546c8a134f5c86e57fd6592e84ef-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:37:54 [async_llm.py:261] Added request cmpl-c3e8546c8a134f5c86e57fd6592e84ef-0.
INFO 03-01 23:37:55 [logger.py:42] Received request cmpl-dda23980afda4ea485f3d1af7486f985-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:37:55 [async_llm.py:261] Added request cmpl-dda23980afda4ea485f3d1af7486f985-0.
INFO 03-01 23:37:56 [logger.py:42] Received request cmpl-1058033b1e4343988c63f4707e77bba0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:37:56 [async_llm.py:261] Added request cmpl-1058033b1e4343988c63f4707e77bba0-0.
INFO 03-01 23:37:57 [logger.py:42] Received request cmpl-fff39c52c2fd4e9fbd43fb08b2a63544-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:37:57 [async_llm.py:261] Added request cmpl-fff39c52c2fd4e9fbd43fb08b2a63544-0.
INFO 03-01 23:37:57 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.3%
INFO 03-01 23:37:58 [logger.py:42] Received request cmpl-b09cca132251456cb69cd0478093c3a4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:37:58 [async_llm.py:261] Added request cmpl-b09cca132251456cb69cd0478093c3a4-0.
INFO 03-01 23:37:59 [logger.py:42] Received request cmpl-e6288d678362454c9ba4937314dafdf4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:37:59 [async_llm.py:261] Added request cmpl-e6288d678362454c9ba4937314dafdf4-0.
INFO 03-01 23:38:01 [logger.py:42] Received request cmpl-039a4f3f201743f19a37d6f40c2f3b12-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:38:01 [async_llm.py:261] Added request cmpl-039a4f3f201743f19a37d6f40c2f3b12-0.
INFO 03-01 23:38:02 [logger.py:42] Received request cmpl-a05ec5cd39ee4d67a984bd874737b725-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:38:02 [async_llm.py:261] Added request cmpl-a05ec5cd39ee4d67a984bd874737b725-0.
INFO 03-01 23:38:03 [logger.py:42] Received request cmpl-370f8ddad2ed45b090dc09e061cc6ac2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:38:03 [async_llm.py:261] Added request cmpl-370f8ddad2ed45b090dc09e061cc6ac2-0.
INFO 03-01 23:38:04 [logger.py:42] Received request cmpl-6a3bb22f771a4a02b13c5260a3d985d7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:38:04 [async_llm.py:261] Added request cmpl-6a3bb22f771a4a02b13c5260a3d985d7-0.
INFO 03-01 23:38:05 [logger.py:42] Received request cmpl-a47e30c602e342aca3f8b419b87e7e1c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:38:05 [async_llm.py:261] Added request cmpl-a47e30c602e342aca3f8b419b87e7e1c-0.
INFO 03-01 23:38:06 [logger.py:42] Received request cmpl-888cf68f146846378cbf8ceca8df84bd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:38:06 [async_llm.py:261] Added request cmpl-888cf68f146846378cbf8ceca8df84bd-0.
INFO 03-01 23:38:07 [logger.py:42] Received request cmpl-8c3037ca4274485fac8de53475e8c300-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:38:07 [async_llm.py:261] Added request cmpl-8c3037ca4274485fac8de53475e8c300-0.
INFO 03-01 23:38:07 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.3%
INFO 03-01 23:38:09 [logger.py:42] Received request cmpl-9a71c70027f74408a48b0e73bb6a4a07-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:38:09 [async_llm.py:261] Added request cmpl-9a71c70027f74408a48b0e73bb6a4a07-0.
INFO 03-01 23:38:10 [logger.py:42] Received request cmpl-1227ff3df94e4bcaad841a196a55ef64-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:38:10 [async_llm.py:261] Added request cmpl-1227ff3df94e4bcaad841a196a55ef64-0.
INFO 03-01 23:38:11 [logger.py:42] Received request cmpl-9de548930d9d4cfbac3887f62a7ec8b1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:38:11 [async_llm.py:261] Added request cmpl-9de548930d9d4cfbac3887f62a7ec8b1-0.
INFO 03-01 23:38:12 [logger.py:42] Received request cmpl-213c3c5788004fe7bf5df9200fd842de-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:38:12 [async_llm.py:261] Added request cmpl-213c3c5788004fe7bf5df9200fd842de-0.
INFO 03-01 23:38:13 [logger.py:42] Received request cmpl-c5f736a102a84f93b8dc3b20d795dc59-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:38:13 [async_llm.py:261] Added request cmpl-c5f736a102a84f93b8dc3b20d795dc59-0.
INFO 03-01 23:38:14 [logger.py:42] Received request cmpl-c17d9a7c01084e988fcc2d54e149a601-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:38:14 [async_llm.py:261] Added request cmpl-c17d9a7c01084e988fcc2d54e149a601-0.
INFO 03-01 23:38:16 [logger.py:42] Received request cmpl-c713269b4cd6462ea5b69073a409b7d6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:38:16 [async_llm.py:261] Added request cmpl-c713269b4cd6462ea5b69073a409b7d6-0.
INFO 03-01 23:38:17 [logger.py:42] Received request cmpl-23bbc28e8b8645ed905c85a17494b4f9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:38:17 [async_llm.py:261] Added request cmpl-23bbc28e8b8645ed905c85a17494b4f9-0.
INFO 03-01 23:38:17 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.3%
INFO 03-01 23:38:18 [logger.py:42] Received request cmpl-3096a62116914d60bd01610fdeb03e3b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:38:18 [async_llm.py:261] Added request cmpl-3096a62116914d60bd01610fdeb03e3b-0.
INFO 03-01 23:38:19 [logger.py:42] Received request cmpl-0f4ae27248764813ba14bace991d0d31-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:38:19 [async_llm.py:261] Added request cmpl-0f4ae27248764813ba14bace991d0d31-0.
INFO 03-01 23:38:20 [logger.py:42] Received request cmpl-bdb4626f23904d74b479be77df13d487-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:38:20 [async_llm.py:261] Added request cmpl-bdb4626f23904d74b479be77df13d487-0.
INFO 03-01 23:38:21 [logger.py:42] Received request cmpl-c82835f1d3cd472b96bd3a7c0a6afac9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:38:21 [async_llm.py:261] Added request cmpl-c82835f1d3cd472b96bd3a7c0a6afac9-0.
INFO 03-01 23:38:22 [logger.py:42] Received request cmpl-e7602ec3b66b455e881e6d73e8429b4e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:38:22 [async_llm.py:261] Added request cmpl-e7602ec3b66b455e881e6d73e8429b4e-0.
INFO 03-01 23:38:24 [logger.py:42] Received request cmpl-372b23299578444181cee2bea858be51-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:38:24 [async_llm.py:261] Added request cmpl-372b23299578444181cee2bea858be51-0.
INFO 03-01 23:38:25 [logger.py:42] Received request cmpl-45a9b7e85cd44caa855e145c9c2198ed-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:38:25 [async_llm.py:261] Added request cmpl-45a9b7e85cd44caa855e145c9c2198ed-0.
INFO 03-01 23:38:26 [logger.py:42] Received request cmpl-8ce85a82b2d9499f92914ce93193ecf7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:38:26 [async_llm.py:261] Added request cmpl-8ce85a82b2d9499f92914ce93193ecf7-0.
INFO 03-01 23:38:27 [logger.py:42] Received request cmpl-c0a849d486f9428985cbe2da180b3ae1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:38:27 [async_llm.py:261] Added request cmpl-c0a849d486f9428985cbe2da180b3ae1-0.
INFO 03-01 23:38:27 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.3%
INFO 03-01 23:38:28 [logger.py:42] Received request cmpl-94964b39ea884d27a31935c84a8bf203-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:38:28 [async_llm.py:261] Added request cmpl-94964b39ea884d27a31935c84a8bf203-0.
INFO 03-01 23:38:29 [logger.py:42] Received request cmpl-23a565447fc946028087377a4ad09234-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:38:29 [async_llm.py:261] Added request cmpl-23a565447fc946028087377a4ad09234-0.
INFO 03-01 23:38:31 [logger.py:42] Received request cmpl-1d2aeedf27194196aee526a3d6004912-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:38:31 [async_llm.py:261] Added request cmpl-1d2aeedf27194196aee526a3d6004912-0.
INFO 03-01 23:38:32 [logger.py:42] Received request cmpl-b265b752253f4fba9fc426fe01348ca3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:38:32 [async_llm.py:261] Added request cmpl-b265b752253f4fba9fc426fe01348ca3-0.
INFO 03-01 23:38:33 [logger.py:42] Received request cmpl-01045340304c48589f24e90c70dfb3be-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:38:33 [async_llm.py:261] Added request cmpl-01045340304c48589f24e90c70dfb3be-0.
INFO 03-01 23:38:34 [logger.py:42] Received request cmpl-9d61efba40c8451183b844a63b96612d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:38:34 [async_llm.py:261] Added request cmpl-9d61efba40c8451183b844a63b96612d-0.
INFO 03-01 23:38:35 [logger.py:42] Received request cmpl-b9afa17b0ac840bdbe4a062298d6f89a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:38:35 [async_llm.py:261] Added request cmpl-b9afa17b0ac840bdbe4a062298d6f89a-0.
INFO 03-01 23:38:36 [logger.py:42] Received request cmpl-df945f811c0b4be1bcf10b14737c9306-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:38:36 [async_llm.py:261] Added request cmpl-df945f811c0b4be1bcf10b14737c9306-0.
INFO 03-01 23:38:37 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.3%
INFO 03-01 23:38:38 [logger.py:42] Received request cmpl-f958c9bde36c4866877bed9372e55542-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:38:38 [async_llm.py:261] Added request cmpl-f958c9bde36c4866877bed9372e55542-0.
INFO 03-01 23:38:39 [logger.py:42] Received request cmpl-4b2bdffef0b245fb980b37a83c15b661-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:38:39 [async_llm.py:261] Added request cmpl-4b2bdffef0b245fb980b37a83c15b661-0.
INFO 03-01 23:38:40 [logger.py:42] Received request cmpl-69bff791cc594b37b84cb5565e3a3d1b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:38:40 [async_llm.py:261] Added request cmpl-69bff791cc594b37b84cb5565e3a3d1b-0.
INFO 03-01 23:38:41 [logger.py:42] Received request cmpl-d48a8c587f534d4ba3c0ad93f85d2f89-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:38:41 [async_llm.py:261] Added request cmpl-d48a8c587f534d4ba3c0ad93f85d2f89-0.
INFO 03-01 23:38:42 [logger.py:42] Received request cmpl-147af6756f664400baa18df24394ca5e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:38:42 [async_llm.py:261] Added request cmpl-147af6756f664400baa18df24394ca5e-0.
INFO 03-01 23:38:43 [logger.py:42] Received request cmpl-b39a07016d3643b4bf18c04c70db892f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:38:43 [async_llm.py:261] Added request cmpl-b39a07016d3643b4bf18c04c70db892f-0.
INFO 03-01 23:38:44 [logger.py:42] Received request cmpl-c894733c3df240febd2ebc3c3477a4f7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:38:44 [async_llm.py:261] Added request cmpl-c894733c3df240febd2ebc3c3477a4f7-0.
INFO 03-01 23:38:46 [logger.py:42] Received request cmpl-a13ea5ed0dc94a01922a8c3163f90376-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:38:46 [async_llm.py:261] Added request cmpl-a13ea5ed0dc94a01922a8c3163f90376-0.
INFO 03-01 23:38:47 [logger.py:42] Received request cmpl-aceb420058e54844b2a9714468618a3a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:38:47 [async_llm.py:261] Added request cmpl-aceb420058e54844b2a9714468618a3a-0.
INFO 03-01 23:38:47 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.4%
INFO 03-01 23:38:48 [logger.py:42] Received request cmpl-94b12942db184c8c8cf24184378fa93f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:38:48 [async_llm.py:261] Added request cmpl-94b12942db184c8c8cf24184378fa93f-0.
INFO 03-01 23:38:49 [logger.py:42] Received request cmpl-2481f4a826be4594bf576417b7d54843-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:38:49 [async_llm.py:261] Added request cmpl-2481f4a826be4594bf576417b7d54843-0.
INFO 03-01 23:38:50 [logger.py:42] Received request cmpl-fb88964df1964d65be03788042bc834e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:38:50 [async_llm.py:261] Added request cmpl-fb88964df1964d65be03788042bc834e-0.
INFO 03-01 23:38:51 [logger.py:42] Received request cmpl-385ebea78b224d7db993c49778bf3b23-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:38:51 [async_llm.py:261] Added request cmpl-385ebea78b224d7db993c49778bf3b23-0.
INFO 03-01 23:38:53 [logger.py:42] Received request cmpl-150fbef316f049ad8b1b0ec353ca705a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:38:53 [async_llm.py:261] Added request cmpl-150fbef316f049ad8b1b0ec353ca705a-0.
INFO 03-01 23:38:54 [logger.py:42] Received request cmpl-705fc49827944e6395e6aeb084c7d366-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:38:54 [async_llm.py:261] Added request cmpl-705fc49827944e6395e6aeb084c7d366-0.
INFO 03-01 23:38:55 [logger.py:42] Received request cmpl-47c4869f4629442c8d9fbbe2cfa95c2d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:38:55 [async_llm.py:261] Added request cmpl-47c4869f4629442c8d9fbbe2cfa95c2d-0.
INFO 03-01 23:38:56 [logger.py:42] Received request cmpl-a1551dec4b12401ab1864b1eeab4067f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:38:56 [async_llm.py:261] Added request cmpl-a1551dec4b12401ab1864b1eeab4067f-0.
INFO 03-01 23:38:57 [logger.py:42] Received request cmpl-7b014374612f4b4ea229f6fb7b989f88-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:38:57 [async_llm.py:261] Added request cmpl-7b014374612f4b4ea229f6fb7b989f88-0.
INFO 03-01 23:38:57 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.4%
INFO 03-01 23:38:58 [logger.py:42] Received request cmpl-88a6655533ad4c3cbdc8f31860b4ee07-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:38:58 [async_llm.py:261] Added request cmpl-88a6655533ad4c3cbdc8f31860b4ee07-0.
INFO 03-01 23:38:59 [logger.py:42] Received request cmpl-6033439f0d694488ad8b462429de8548-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:38:59 [async_llm.py:261] Added request cmpl-6033439f0d694488ad8b462429de8548-0.
INFO 03-01 23:39:01 [logger.py:42] Received request cmpl-06b9ec24314c4792bbbe7b98af1b95df-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:39:01 [async_llm.py:261] Added request cmpl-06b9ec24314c4792bbbe7b98af1b95df-0.
INFO 03-01 23:39:02 [logger.py:42] Received request cmpl-06777afde62249428e4545d18f609d87-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:39:02 [async_llm.py:261] Added request cmpl-06777afde62249428e4545d18f609d87-0.
INFO 03-01 23:39:03 [logger.py:42] Received request cmpl-502e50fd9ae74b739a10a6df0198a71a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:39:03 [async_llm.py:261] Added request cmpl-502e50fd9ae74b739a10a6df0198a71a-0.
INFO 03-01 23:39:04 [logger.py:42] Received request cmpl-a5aa916694054675be0f3046648188ce-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:39:04 [async_llm.py:261] Added request cmpl-a5aa916694054675be0f3046648188ce-0.
INFO 03-01 23:39:05 [logger.py:42] Received request cmpl-e212dc400ef14ea09caeb0b822fefbb6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:39:05 [async_llm.py:261] Added request cmpl-e212dc400ef14ea09caeb0b822fefbb6-0.
INFO 03-01 23:39:06 [logger.py:42] Received request cmpl-8e537ad92cfe4d7fac99899619109dd6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:39:06 [async_llm.py:261] Added request cmpl-8e537ad92cfe4d7fac99899619109dd6-0.
INFO 03-01 23:39:07 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.4%
INFO 03-01 23:39:08 [logger.py:42] Received request cmpl-3a64a6dd14dc45c9a71e85824b958cc7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:39:08 [async_llm.py:261] Added request cmpl-3a64a6dd14dc45c9a71e85824b958cc7-0.
INFO 03-01 23:39:09 [logger.py:42] Received request cmpl-5982e5fb4bd642b18f71421f7a982924-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:39:09 [async_llm.py:261] Added request cmpl-5982e5fb4bd642b18f71421f7a982924-0.
INFO 03-01 23:39:10 [logger.py:42] Received request cmpl-a7f647d49a4d4620baa68e426c7c0a06-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:39:10 [async_llm.py:261] Added request cmpl-a7f647d49a4d4620baa68e426c7c0a06-0.
INFO 03-01 23:39:11 [logger.py:42] Received request cmpl-b8e7f7e394934bef9e19b694f626b4dd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:39:11 [async_llm.py:261] Added request cmpl-b8e7f7e394934bef9e19b694f626b4dd-0.
INFO 03-01 23:39:12 [logger.py:42] Received request cmpl-d66e73cd51424795a4c6631b1b0770d3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:39:12 [async_llm.py:261] Added request cmpl-d66e73cd51424795a4c6631b1b0770d3-0.
INFO 03-01 23:39:13 [logger.py:42] Received request cmpl-5db0799f6df844e4b0afb2899d996ac8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:39:13 [async_llm.py:261] Added request cmpl-5db0799f6df844e4b0afb2899d996ac8-0.
INFO 03-01 23:39:14 [logger.py:42] Received request cmpl-4b6a15f3667e4914b21955c2b64282a7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:39:14 [async_llm.py:261] Added request cmpl-4b6a15f3667e4914b21955c2b64282a7-0.
INFO 03-01 23:39:16 [logger.py:42] Received request cmpl-23c69c511f124d58bc6742a26ff736f3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:39:16 [async_llm.py:261] Added request cmpl-23c69c511f124d58bc6742a26ff736f3-0.
INFO 03-01 23:39:17 [logger.py:42] Received request cmpl-1e798c4a9a0f4213b874c13a31a5423b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:39:17 [async_llm.py:261] Added request cmpl-1e798c4a9a0f4213b874c13a31a5423b-0.
INFO 03-01 23:39:17 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.4%
INFO 03-01 23:39:18 [logger.py:42] Received request cmpl-062d2126b537406dbb60f74bd5016b60-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:39:18 [async_llm.py:261] Added request cmpl-062d2126b537406dbb60f74bd5016b60-0.
INFO 03-01 23:39:19 [logger.py:42] Received request cmpl-481ceb90427f48b982496150d7690d8b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:39:19 [async_llm.py:261] Added request cmpl-481ceb90427f48b982496150d7690d8b-0.
INFO 03-01 23:39:20 [logger.py:42] Received request cmpl-8e98e349e6134cfc9ca0acb0bdfbaac4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:39:20 [async_llm.py:261] Added request cmpl-8e98e349e6134cfc9ca0acb0bdfbaac4-0.
INFO 03-01 23:39:21 [logger.py:42] Received request cmpl-d4d4bea05ee5424fb011918d201f586a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:39:21 [async_llm.py:261] Added request cmpl-d4d4bea05ee5424fb011918d201f586a-0.
INFO 03-01 23:39:23 [logger.py:42] Received request cmpl-607e5ddb14e44576a9c7fc08d64930d5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:39:23 [async_llm.py:261] Added request cmpl-607e5ddb14e44576a9c7fc08d64930d5-0.
INFO 03-01 23:39:24 [logger.py:42] Received request cmpl-bf39627734cb445b910c537d291bdfc6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:39:24 [async_llm.py:261] Added request cmpl-bf39627734cb445b910c537d291bdfc6-0.
INFO 03-01 23:39:25 [logger.py:42] Received request cmpl-ed5975e23a90422a8263d53e86b84c27-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:39:25 [async_llm.py:261] Added request cmpl-ed5975e23a90422a8263d53e86b84c27-0.
INFO 03-01 23:39:26 [logger.py:42] Received request cmpl-ea6dcd29b49d459393a660110afe0e87-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:39:26 [async_llm.py:261] Added request cmpl-ea6dcd29b49d459393a660110afe0e87-0.
INFO 03-01 23:39:27 [logger.py:42] Received request cmpl-ff704d3cbdc643dd8264362ae912e4ff-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:39:27 [async_llm.py:261] Added request cmpl-ff704d3cbdc643dd8264362ae912e4ff-0.
INFO 03-01 23:39:27 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.4%
INFO 03-01 23:39:28 [logger.py:42] Received request cmpl-89bfd1dcb3304dff996bf7d5e1a13055-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:39:28 [async_llm.py:261] Added request cmpl-89bfd1dcb3304dff996bf7d5e1a13055-0.
INFO 03-01 23:39:29 [logger.py:42] Received request cmpl-a75d8c5f5ab14d0ca464f897667d9260-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:39:29 [async_llm.py:261] Added request cmpl-a75d8c5f5ab14d0ca464f897667d9260-0.
INFO 03-01 23:39:31 [logger.py:42] Received request cmpl-24e48de5bf9d481d8ed6fb0ccb43dfbe-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:39:31 [async_llm.py:261] Added request cmpl-24e48de5bf9d481d8ed6fb0ccb43dfbe-0.
INFO 03-01 23:39:32 [logger.py:42] Received request cmpl-98e94b59b27341a3b7313e5a43beca99-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:39:32 [async_llm.py:261] Added request cmpl-98e94b59b27341a3b7313e5a43beca99-0.
INFO 03-01 23:39:33 [logger.py:42] Received request cmpl-82c0fc16ae6a448e95ee53321fb4d401-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:39:33 [async_llm.py:261] Added request cmpl-82c0fc16ae6a448e95ee53321fb4d401-0.
INFO 03-01 23:39:34 [logger.py:42] Received request cmpl-f5d64a3d0d4f446eb6793d24179b0d5d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:39:34 [async_llm.py:261] Added request cmpl-f5d64a3d0d4f446eb6793d24179b0d5d-0.
INFO 03-01 23:39:35 [logger.py:42] Received request cmpl-ec3acee5e47c4c4ea7a331cb980e97fc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:39:35 [async_llm.py:261] Added request cmpl-ec3acee5e47c4c4ea7a331cb980e97fc-0.
INFO 03-01 23:39:36 [logger.py:42] Received request cmpl-69857f3ac4b141c69a7ff0150a004aee-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:39:36 [async_llm.py:261] Added request cmpl-69857f3ac4b141c69a7ff0150a004aee-0.
INFO 03-01 23:39:37 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.4%
INFO 03-01 23:39:38 [logger.py:42] Received request cmpl-84237ee981e0402fb71fedd341a5e357-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:39:38 [async_llm.py:261] Added request cmpl-84237ee981e0402fb71fedd341a5e357-0.
INFO 03-01 23:39:39 [logger.py:42] Received request cmpl-4f7d864800fe443e98cd4d951b133ae8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:39:39 [async_llm.py:261] Added request cmpl-4f7d864800fe443e98cd4d951b133ae8-0.
INFO 03-01 23:39:40 [logger.py:42] Received request cmpl-4d22e1eaaac54bf4a8191a5ef14ce5ce-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:39:40 [async_llm.py:261] Added request cmpl-4d22e1eaaac54bf4a8191a5ef14ce5ce-0.
INFO 03-01 23:39:41 [logger.py:42] Received request cmpl-b7c73e363b61485d86a6cf878ed12a50-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:39:41 [async_llm.py:261] Added request cmpl-b7c73e363b61485d86a6cf878ed12a50-0.
INFO 03-01 23:39:42 [logger.py:42] Received request cmpl-d660771dba44486ab6911cd63df38d51-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:39:42 [async_llm.py:261] Added request cmpl-d660771dba44486ab6911cd63df38d51-0.
INFO 03-01 23:39:43 [logger.py:42] Received request cmpl-21bc3c53ba214c93b5f14b22c18de9de-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:39:43 [async_llm.py:261] Added request cmpl-21bc3c53ba214c93b5f14b22c18de9de-0.
INFO 03-01 23:39:44 [logger.py:42] Received request cmpl-f4fe38b017a546778c35e2d5e6053c76-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:39:44 [async_llm.py:261] Added request cmpl-f4fe38b017a546778c35e2d5e6053c76-0.
INFO 03-01 23:39:46 [logger.py:42] Received request cmpl-86def2eae213482c889a6782b8af0394-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:39:46 [async_llm.py:261] Added request cmpl-86def2eae213482c889a6782b8af0394-0.
INFO 03-01 23:39:47 [logger.py:42] Received request cmpl-4c6512c3c0114cd3abdf1294aee5f64b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:39:47 [async_llm.py:261] Added request cmpl-4c6512c3c0114cd3abdf1294aee5f64b-0.
INFO 03-01 23:39:47 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.4%
INFO 03-01 23:39:48 [logger.py:42] Received request cmpl-80ab768a258740d8a380cd0157d782ff-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:39:48 [async_llm.py:261] Added request cmpl-80ab768a258740d8a380cd0157d782ff-0.
INFO 03-01 23:39:49 [logger.py:42] Received request cmpl-511500e709a74f20842822a8f0d48faf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:39:49 [async_llm.py:261] Added request cmpl-511500e709a74f20842822a8f0d48faf-0.
INFO 03-01 23:39:50 [logger.py:42] Received request cmpl-466dc48cfc3b4ef5830e401e08697294-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:39:50 [async_llm.py:261] Added request cmpl-466dc48cfc3b4ef5830e401e08697294-0.
INFO 03-01 23:39:51 [logger.py:42] Received request cmpl-42e18971687943969d3f983bbe4c2305-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:39:51 [async_llm.py:261] Added request cmpl-42e18971687943969d3f983bbe4c2305-0.
INFO 03-01 23:39:53 [logger.py:42] Received request cmpl-f0a79dc9aeb24cf6a2c9c7fd06d2d4b7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:39:53 [async_llm.py:261] Added request cmpl-f0a79dc9aeb24cf6a2c9c7fd06d2d4b7-0.
INFO 03-01 23:39:54 [logger.py:42] Received request cmpl-6d80d0a1f2294b5bb512a4fc6ce1df3a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:39:54 [async_llm.py:261] Added request cmpl-6d80d0a1f2294b5bb512a4fc6ce1df3a-0.
INFO 03-01 23:39:55 [logger.py:42] Received request cmpl-710697da821e470e847b4c7faba21d2a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:39:55 [async_llm.py:261] Added request cmpl-710697da821e470e847b4c7faba21d2a-0.
INFO 03-01 23:39:56 [logger.py:42] Received request cmpl-87f626cce5a5467d8ea5a59f63fc5362-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:39:56 [async_llm.py:261] Added request cmpl-87f626cce5a5467d8ea5a59f63fc5362-0.
INFO 03-01 23:39:57 [logger.py:42] Received request cmpl-40f9e38952f5401c847a358243a572cc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:39:57 [async_llm.py:261] Added request cmpl-40f9e38952f5401c847a358243a572cc-0.
INFO 03-01 23:39:57 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.4%
INFO 03-01 23:39:58 [logger.py:42] Received request cmpl-565d99475d534211ac9b3a89fed2885e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:39:58 [async_llm.py:261] Added request cmpl-565d99475d534211ac9b3a89fed2885e-0.
INFO 03-01 23:39:59 [logger.py:42] Received request cmpl-9aebaa82351d44e78be01a005f69871f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:39:59 [async_llm.py:261] Added request cmpl-9aebaa82351d44e78be01a005f69871f-0.
INFO 03-01 23:40:01 [logger.py:42] Received request cmpl-4e256c30cf1a49bb9fc7057f8839a71a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:40:01 [async_llm.py:261] Added request cmpl-4e256c30cf1a49bb9fc7057f8839a71a-0.
INFO 03-01 23:40:02 [logger.py:42] Received request cmpl-a834deec9e0f448faa4fb90c6812f404-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:40:02 [async_llm.py:261] Added request cmpl-a834deec9e0f448faa4fb90c6812f404-0.
INFO 03-01 23:40:03 [logger.py:42] Received request cmpl-bf65452bbcd74ff98e5e5bd2269c7bf2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:40:03 [async_llm.py:261] Added request cmpl-bf65452bbcd74ff98e5e5bd2269c7bf2-0.
INFO 03-01 23:40:04 [logger.py:42] Received request cmpl-88c7383f14b645e6a3c4e1faf0a56a79-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:40:04 [async_llm.py:261] Added request cmpl-88c7383f14b645e6a3c4e1faf0a56a79-0.
INFO 03-01 23:40:05 [logger.py:42] Received request cmpl-1cab129c563e4e21b82364df6cf1c892-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:40:05 [async_llm.py:261] Added request cmpl-1cab129c563e4e21b82364df6cf1c892-0.
INFO 03-01 23:40:06 [logger.py:42] Received request cmpl-4cc030fe748c432fa749b3f774476a04-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:40:06 [async_llm.py:261] Added request cmpl-4cc030fe748c432fa749b3f774476a04-0.
INFO 03-01 23:40:07 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.4%
INFO 03-01 23:40:08 [logger.py:42] Received request cmpl-560fde17f71d4ab2a5bd59c58174962a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:40:08 [async_llm.py:261] Added request cmpl-560fde17f71d4ab2a5bd59c58174962a-0.
INFO 03-01 23:40:09 [logger.py:42] Received request cmpl-edd85af4c2044192b731f669d7bf6628-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:40:09 [async_llm.py:261] Added request cmpl-edd85af4c2044192b731f669d7bf6628-0.
INFO 03-01 23:40:10 [logger.py:42] Received request cmpl-29cdd0105c5747af8e769cd77935d6d9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:40:10 [async_llm.py:261] Added request cmpl-29cdd0105c5747af8e769cd77935d6d9-0.
INFO 03-01 23:40:11 [logger.py:42] Received request cmpl-307a8d68752648c09261eac22ada28dd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:40:11 [async_llm.py:261] Added request cmpl-307a8d68752648c09261eac22ada28dd-0.
INFO 03-01 23:40:12 [logger.py:42] Received request cmpl-3dd7834526754dd38d9fdaae1bf8ba36-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:40:12 [async_llm.py:261] Added request cmpl-3dd7834526754dd38d9fdaae1bf8ba36-0.
INFO 03-01 23:40:13 [logger.py:42] Received request cmpl-d657a030ca60471d8355bef912460ef6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:40:13 [async_llm.py:261] Added request cmpl-d657a030ca60471d8355bef912460ef6-0.
INFO 03-01 23:40:15 [logger.py:42] Received request cmpl-1c6b40c841ef460aaf63692eb02aff61-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:40:15 [async_llm.py:261] Added request cmpl-1c6b40c841ef460aaf63692eb02aff61-0.
INFO 03-01 23:40:16 [logger.py:42] Received request cmpl-e768bf79d9bc4206864f5c65149bed1d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:40:16 [async_llm.py:261] Added request cmpl-e768bf79d9bc4206864f5c65149bed1d-0.
INFO 03-01 23:40:17 [logger.py:42] Received request cmpl-097ad33d2ddc4c16a6d84cdb6dd67283-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:40:17 [async_llm.py:261] Added request cmpl-097ad33d2ddc4c16a6d84cdb6dd67283-0.
INFO 03-01 23:40:17 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.4%
INFO 03-01 23:40:18 [logger.py:42] Received request cmpl-b31e54f9fe6c4830abfea8967ff3fc1c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:40:18 [async_llm.py:261] Added request cmpl-b31e54f9fe6c4830abfea8967ff3fc1c-0.
INFO 03-01 23:40:19 [logger.py:42] Received request cmpl-f6467a2264b9454abc7687f983e6507b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:40:19 [async_llm.py:261] Added request cmpl-f6467a2264b9454abc7687f983e6507b-0.
INFO 03-01 23:40:20 [logger.py:42] Received request cmpl-9bd7b336f01a4072819c5680f3c989a9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:40:20 [async_llm.py:261] Added request cmpl-9bd7b336f01a4072819c5680f3c989a9-0.
INFO 03-01 23:40:21 [logger.py:42] Received request cmpl-be6c75e428534909a9200c9298ea941c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:40:21 [async_llm.py:261] Added request cmpl-be6c75e428534909a9200c9298ea941c-0.
INFO 03-01 23:40:23 [logger.py:42] Received request cmpl-56caab4517d04c1991b472c0aee04f8c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:40:23 [async_llm.py:261] Added request cmpl-56caab4517d04c1991b472c0aee04f8c-0.
INFO 03-01 23:40:24 [logger.py:42] Received request cmpl-40991110f95f4dea854ac6e65cc11813-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:40:24 [async_llm.py:261] Added request cmpl-40991110f95f4dea854ac6e65cc11813-0.
INFO 03-01 23:40:25 [logger.py:42] Received request cmpl-3811e3a72dd146c6a7af2cb2df490d94-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:40:25 [async_llm.py:261] Added request cmpl-3811e3a72dd146c6a7af2cb2df490d94-0.
INFO 03-01 23:40:26 [logger.py:42] Received request cmpl-a08b8ded164944818d2bd9636ccec675-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:40:26 [async_llm.py:261] Added request cmpl-a08b8ded164944818d2bd9636ccec675-0.
INFO 03-01 23:40:27 [logger.py:42] Received request cmpl-2eda3d5316d7410f9d9abb5a6005f9f2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:40:27 [async_llm.py:261] Added request cmpl-2eda3d5316d7410f9d9abb5a6005f9f2-0.
INFO 03-01 23:40:27 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.4%
INFO 03-01 23:40:28 [logger.py:42] Received request cmpl-75dabc01436442f68af63df894fd50b9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:40:28 [async_llm.py:261] Added request cmpl-75dabc01436442f68af63df894fd50b9-0.
INFO 03-01 23:40:30 [logger.py:42] Received request cmpl-655e82486e534accbf031609ed4e2451-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:40:30 [async_llm.py:261] Added request cmpl-655e82486e534accbf031609ed4e2451-0.
INFO 03-01 23:40:31 [logger.py:42] Received request cmpl-67bc76e5af1141ae83a5060d9d46f9ef-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:40:31 [async_llm.py:261] Added request cmpl-67bc76e5af1141ae83a5060d9d46f9ef-0.
INFO 03-01 23:40:32 [logger.py:42] Received request cmpl-ed31e193d3204b9982f4c3dcb6170412-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:40:32 [async_llm.py:261] Added request cmpl-ed31e193d3204b9982f4c3dcb6170412-0.
INFO 03-01 23:40:33 [logger.py:42] Received request cmpl-40123d9bf0f64dc9afd8f8513ffbe597-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:40:33 [async_llm.py:261] Added request cmpl-40123d9bf0f64dc9afd8f8513ffbe597-0.
INFO 03-01 23:40:34 [logger.py:42] Received request cmpl-3893fe5fb5b44fe68aab14f46ea2a776-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:40:34 [async_llm.py:261] Added request cmpl-3893fe5fb5b44fe68aab14f46ea2a776-0.
INFO 03-01 23:40:35 [logger.py:42] Received request cmpl-6c8819fb020d4adca550d168ff4efa44-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:40:35 [async_llm.py:261] Added request cmpl-6c8819fb020d4adca550d168ff4efa44-0.
INFO 03-01 23:40:36 [logger.py:42] Received request cmpl-225e5c8aa70e4cb0b1fce30e6bb524e7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:40:36 [async_llm.py:261] Added request cmpl-225e5c8aa70e4cb0b1fce30e6bb524e7-0.
INFO 03-01 23:40:37 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.4%
INFO 03-01 23:40:38 [logger.py:42] Received request cmpl-6ff1d9ac009743888b179360daae6c7c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:40:38 [async_llm.py:261] Added request cmpl-6ff1d9ac009743888b179360daae6c7c-0.
INFO 03-01 23:40:39 [logger.py:42] Received request cmpl-7a004727a6584d57b345fc2c82107207-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:40:39 [async_llm.py:261] Added request cmpl-7a004727a6584d57b345fc2c82107207-0.
INFO 03-01 23:40:40 [logger.py:42] Received request cmpl-758a790886d941988cc07038cdc8697f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:40:40 [async_llm.py:261] Added request cmpl-758a790886d941988cc07038cdc8697f-0.
INFO 03-01 23:40:41 [logger.py:42] Received request cmpl-6e237f158f8a4287a2ea9b65214937e2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:40:41 [async_llm.py:261] Added request cmpl-6e237f158f8a4287a2ea9b65214937e2-0.
INFO 03-01 23:40:42 [logger.py:42] Received request cmpl-f9f811a2e935424cb3a17e274d569922-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:40:42 [async_llm.py:261] Added request cmpl-f9f811a2e935424cb3a17e274d569922-0.
INFO 03-01 23:40:43 [logger.py:42] Received request cmpl-dd06ad97befb4aac98473a0f667bbe15-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:40:43 [async_llm.py:261] Added request cmpl-dd06ad97befb4aac98473a0f667bbe15-0.
INFO 03-01 23:40:45 [logger.py:42] Received request cmpl-aecd48e63d6f4f619809090488ef3c99-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:40:45 [async_llm.py:261] Added request cmpl-aecd48e63d6f4f619809090488ef3c99-0.
INFO 03-01 23:40:46 [logger.py:42] Received request cmpl-31a6e2991c0b407492ac92484ec826de-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:40:46 [async_llm.py:261] Added request cmpl-31a6e2991c0b407492ac92484ec826de-0.
INFO 03-01 23:40:47 [logger.py:42] Received request cmpl-8c5e4bb2c2c243128323872239130756-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:40:47 [async_llm.py:261] Added request cmpl-8c5e4bb2c2c243128323872239130756-0.
INFO 03-01 23:40:47 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.4%
INFO 03-01 23:40:48 [logger.py:42] Received request cmpl-7c2fc5ae4213403c9d2a1126451f056c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:40:48 [async_llm.py:261] Added request cmpl-7c2fc5ae4213403c9d2a1126451f056c-0.
INFO 03-01 23:40:49 [logger.py:42] Received request cmpl-04e614a063584d089aee35904dce4e9a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:40:49 [async_llm.py:261] Added request cmpl-04e614a063584d089aee35904dce4e9a-0.
INFO 03-01 23:40:50 [logger.py:42] Received request cmpl-f7312909b8814a519d5898867bc2a292-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:40:50 [async_llm.py:261] Added request cmpl-f7312909b8814a519d5898867bc2a292-0.
INFO 03-01 23:40:51 [logger.py:42] Received request cmpl-f3050de28dc04f63bc554ba8ad9a4fe0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:40:51 [async_llm.py:261] Added request cmpl-f3050de28dc04f63bc554ba8ad9a4fe0-0.
INFO 03-01 23:40:53 [logger.py:42] Received request cmpl-ae2665810766451aa13f80b306e000b0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:40:53 [async_llm.py:261] Added request cmpl-ae2665810766451aa13f80b306e000b0-0.
INFO 03-01 23:40:54 [logger.py:42] Received request cmpl-5dff560b261b43cc89054475f8390c66-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:40:54 [async_llm.py:261] Added request cmpl-5dff560b261b43cc89054475f8390c66-0.
INFO 03-01 23:40:55 [logger.py:42] Received request cmpl-b2389e75915140ac87308c838f7f5d88-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:40:55 [async_llm.py:261] Added request cmpl-b2389e75915140ac87308c838f7f5d88-0.
INFO 03-01 23:40:56 [logger.py:42] Received request cmpl-c087a8d296d14836b23933e0388d688e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:40:56 [async_llm.py:261] Added request cmpl-c087a8d296d14836b23933e0388d688e-0.
INFO 03-01 23:40:57 [logger.py:42] Received request cmpl-4fc9c92082664ca19c1301580eccbf3f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:40:57 [async_llm.py:261] Added request cmpl-4fc9c92082664ca19c1301580eccbf3f-0.
INFO 03-01 23:40:57 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5%
INFO 03-01 23:40:58 [logger.py:42] Received request cmpl-dcc477c2e1be42b1991bd55799c37fd8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:40:58 [async_llm.py:261] Added request cmpl-dcc477c2e1be42b1991bd55799c37fd8-0.
INFO 03-01 23:41:00 [logger.py:42] Received request cmpl-f81bb4d34a04486399b82aa55d155b39-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:41:00 [async_llm.py:261] Added request cmpl-f81bb4d34a04486399b82aa55d155b39-0.
INFO 03-01 23:41:01 [logger.py:42] Received request cmpl-07a3c0a9593f40ef8fd761bd9c7abd78-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:41:01 [async_llm.py:261] Added request cmpl-07a3c0a9593f40ef8fd761bd9c7abd78-0.
INFO 03-01 23:41:02 [logger.py:42] Received request cmpl-5deae0c0aa27480c95ce780303aaffca-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:41:02 [async_llm.py:261] Added request cmpl-5deae0c0aa27480c95ce780303aaffca-0.
INFO 03-01 23:41:03 [logger.py:42] Received request cmpl-11d28efb8a3d48dd99417b6b34f5d0f9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:41:03 [async_llm.py:261] Added request cmpl-11d28efb8a3d48dd99417b6b34f5d0f9-0.
INFO 03-01 23:41:04 [logger.py:42] Received request cmpl-8f19f54770a842d394b25434d367a480-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:41:04 [async_llm.py:261] Added request cmpl-8f19f54770a842d394b25434d367a480-0.
INFO 03-01 23:41:05 [logger.py:42] Received request cmpl-e4aa37c2401743dc9eb4200483d8ce84-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:41:05 [async_llm.py:261] Added request cmpl-e4aa37c2401743dc9eb4200483d8ce84-0.
INFO 03-01 23:41:06 [logger.py:42] Received request cmpl-916fbd44643c483299011379f704955c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:41:06 [async_llm.py:261] Added request cmpl-916fbd44643c483299011379f704955c-0.
INFO 03-01 23:41:07 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5%
INFO 03-01 23:41:08 [logger.py:42] Received request cmpl-aa39fafe0ea649e484ff0962a5dc5529-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:41:08 [async_llm.py:261] Added request cmpl-aa39fafe0ea649e484ff0962a5dc5529-0.
INFO 03-01 23:41:09 [logger.py:42] Received request cmpl-4ee90c3447494110907d54a718c75d8e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:41:09 [async_llm.py:261] Added request cmpl-4ee90c3447494110907d54a718c75d8e-0.
INFO 03-01 23:41:10 [logger.py:42] Received request cmpl-d7fef6b7bf5a4cc19d8a388d3550942a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:41:10 [async_llm.py:261] Added request cmpl-d7fef6b7bf5a4cc19d8a388d3550942a-0.
INFO 03-01 23:41:11 [logger.py:42] Received request cmpl-c7e843865ee942b5acf973ed442966ab-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:41:11 [async_llm.py:261] Added request cmpl-c7e843865ee942b5acf973ed442966ab-0.
INFO 03-01 23:41:12 [logger.py:42] Received request cmpl-97abffa893fc4c5d93f4b0c48c9f55a2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:41:12 [async_llm.py:261] Added request cmpl-97abffa893fc4c5d93f4b0c48c9f55a2-0.
INFO 03-01 23:41:13 [logger.py:42] Received request cmpl-b5236f2e02a041e5a34c42a386626357-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:41:13 [async_llm.py:261] Added request cmpl-b5236f2e02a041e5a34c42a386626357-0.
INFO 03-01 23:41:15 [logger.py:42] Received request cmpl-9ff15ae3d0884d3281818d8f5495853c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:41:15 [async_llm.py:261] Added request cmpl-9ff15ae3d0884d3281818d8f5495853c-0.
INFO 03-01 23:41:16 [logger.py:42] Received request cmpl-2d50c9207a794835ad065c35a52d8cdf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:41:16 [async_llm.py:261] Added request cmpl-2d50c9207a794835ad065c35a52d8cdf-0.
INFO 03-01 23:41:17 [logger.py:42] Received request cmpl-fc456165a8d14632aabd405607d67a17-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:41:17 [async_llm.py:261] Added request cmpl-fc456165a8d14632aabd405607d67a17-0.
INFO 03-01 23:41:17 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5%
INFO 03-01 23:41:18 [logger.py:42] Received request cmpl-777a3214b287494cb9feac2fbe213228-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:41:18 [async_llm.py:261] Added request cmpl-777a3214b287494cb9feac2fbe213228-0.
INFO 03-01 23:41:19 [logger.py:42] Received request cmpl-2cf28b0bac0d4ce096afb2020a04d198-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:41:19 [async_llm.py:261] Added request cmpl-2cf28b0bac0d4ce096afb2020a04d198-0.
INFO 03-01 23:41:20 [logger.py:42] Received request cmpl-fcbaeab05e414fc3a9bfd1b01906e328-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:41:20 [async_llm.py:261] Added request cmpl-fcbaeab05e414fc3a9bfd1b01906e328-0.
INFO 03-01 23:41:22 [logger.py:42] Received request cmpl-abb989f2ad8f42a2964768071f6142de-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:41:22 [async_llm.py:261] Added request cmpl-abb989f2ad8f42a2964768071f6142de-0.
INFO 03-01 23:41:23 [logger.py:42] Received request cmpl-b289736218654b76996e0901346d430d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:41:23 [async_llm.py:261] Added request cmpl-b289736218654b76996e0901346d430d-0.
INFO 03-01 23:41:24 [logger.py:42] Received request cmpl-5d0582db04e24c84b8e4fee498dc76c3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:41:24 [async_llm.py:261] Added request cmpl-5d0582db04e24c84b8e4fee498dc76c3-0.
INFO 03-01 23:41:25 [logger.py:42] Received request cmpl-2b1a67af149241aa9f9727ac10c82951-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:41:25 [async_llm.py:261] Added request cmpl-2b1a67af149241aa9f9727ac10c82951-0.
INFO 03-01 23:41:26 [logger.py:42] Received request cmpl-8b8572e61c5e44e58950170749de99bd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:41:26 [async_llm.py:261] Added request cmpl-8b8572e61c5e44e58950170749de99bd-0.
INFO 03-01 23:41:27 [logger.py:42] Received request cmpl-46d0b95a385344be8eace1d8bdba1e73-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:41:27 [async_llm.py:261] Added request cmpl-46d0b95a385344be8eace1d8bdba1e73-0.
INFO 03-01 23:41:27 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5%
INFO 03-01 23:41:28 [logger.py:42] Received request cmpl-5d22e79c17e740e28b2aaad719bb214d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:41:28 [async_llm.py:261] Added request cmpl-5d22e79c17e740e28b2aaad719bb214d-0.
INFO 03-01 23:41:30 [logger.py:42] Received request cmpl-1c1efd77ee9e4084a80e9682d5638584-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:41:30 [async_llm.py:261] Added request cmpl-1c1efd77ee9e4084a80e9682d5638584-0.
INFO 03-01 23:41:31 [logger.py:42] Received request cmpl-5e756e7ab53a484cb254c963122e6c34-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:41:31 [async_llm.py:261] Added request cmpl-5e756e7ab53a484cb254c963122e6c34-0.
INFO 03-01 23:41:32 [logger.py:42] Received request cmpl-602fd2c0029e493e80f328dcc8f2506b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:41:32 [async_llm.py:261] Added request cmpl-602fd2c0029e493e80f328dcc8f2506b-0.
INFO 03-01 23:41:33 [logger.py:42] Received request cmpl-f372e7089c714114829b8e45a411a52b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:41:33 [async_llm.py:261] Added request cmpl-f372e7089c714114829b8e45a411a52b-0.
INFO 03-01 23:41:34 [logger.py:42] Received request cmpl-63d16e5650944be9a9b621abe8bb412f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:41:34 [async_llm.py:261] Added request cmpl-63d16e5650944be9a9b621abe8bb412f-0.
INFO 03-01 23:41:35 [logger.py:42] Received request cmpl-6f9834621d8e4fd5b6b3c83d25b51ab0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:41:35 [async_llm.py:261] Added request cmpl-6f9834621d8e4fd5b6b3c83d25b51ab0-0.
INFO 03-01 23:41:37 [logger.py:42] Received request cmpl-36899538630443fe874aaf4e3c71d095-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:41:37 [async_llm.py:261] Added request cmpl-36899538630443fe874aaf4e3c71d095-0.
INFO 03-01 23:41:37 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5%
INFO 03-01 23:41:38 [logger.py:42] Received request cmpl-3dc1b6400a294f8189405fd3287e2ad7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:41:38 [async_llm.py:261] Added request cmpl-3dc1b6400a294f8189405fd3287e2ad7-0.
INFO 03-01 23:41:39 [logger.py:42] Received request cmpl-230a74d3c40646bc93b0ac0019115a0e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:41:39 [async_llm.py:261] Added request cmpl-230a74d3c40646bc93b0ac0019115a0e-0.
INFO 03-01 23:41:40 [logger.py:42] Received request cmpl-266103bd02f84590994d807afeb7e86c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:41:40 [async_llm.py:261] Added request cmpl-266103bd02f84590994d807afeb7e86c-0.
INFO 03-01 23:41:41 [logger.py:42] Received request cmpl-f0b297c386c44ae6a1902b6bcf000d62-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:41:41 [async_llm.py:261] Added request cmpl-f0b297c386c44ae6a1902b6bcf000d62-0.
INFO 03-01 23:41:42 [logger.py:42] Received request cmpl-d4f44f59855e44229dc453b2639a489e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:41:42 [async_llm.py:261] Added request cmpl-d4f44f59855e44229dc453b2639a489e-0.
INFO 03-01 23:41:43 [logger.py:42] Received request cmpl-6dc43a07ca264db0bd5ab12b5badd7e7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:41:43 [async_llm.py:261] Added request cmpl-6dc43a07ca264db0bd5ab12b5badd7e7-0.
INFO 03-01 23:41:45 [logger.py:42] Received request cmpl-c1ff3bfaaa7a4057a1c970ee4dad9912-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:41:45 [async_llm.py:261] Added request cmpl-c1ff3bfaaa7a4057a1c970ee4dad9912-0.
INFO 03-01 23:41:46 [logger.py:42] Received request cmpl-a69de42bb03a4567a9a7935910277570-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:41:46 [async_llm.py:261] Added request cmpl-a69de42bb03a4567a9a7935910277570-0.
INFO 03-01 23:41:47 [logger.py:42] Received request cmpl-1b3e6541284b4cf592bb2fd6b597eb27-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:41:47 [async_llm.py:261] Added request cmpl-1b3e6541284b4cf592bb2fd6b597eb27-0.
INFO 03-01 23:41:47 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5%
INFO 03-01 23:41:48 [logger.py:42] Received request cmpl-476134199b584ba0bcd228235f17391a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:41:48 [async_llm.py:261] Added request cmpl-476134199b584ba0bcd228235f17391a-0.
INFO 03-01 23:41:49 [logger.py:42] Received request cmpl-3f81cfa6e59b432c92d7438095f210a4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:41:49 [async_llm.py:261] Added request cmpl-3f81cfa6e59b432c92d7438095f210a4-0.
INFO 03-01 23:41:50 [logger.py:42] Received request cmpl-6614a96ec8c94efb8f861ac4a3523b7f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:41:50 [async_llm.py:261] Added request cmpl-6614a96ec8c94efb8f861ac4a3523b7f-0.
INFO 03-01 23:41:52 [logger.py:42] Received request cmpl-b5ccaab7ea334590b6d16d5ccd8a4e58-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:41:52 [async_llm.py:261] Added request cmpl-b5ccaab7ea334590b6d16d5ccd8a4e58-0.
INFO 03-01 23:41:53 [logger.py:42] Received request cmpl-bd2f8ea11acd46879460740f40d8584e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:41:53 [async_llm.py:261] Added request cmpl-bd2f8ea11acd46879460740f40d8584e-0.
INFO 03-01 23:41:54 [logger.py:42] Received request cmpl-6dac31d94c7343649ec8122465e18e80-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:41:54 [async_llm.py:261] Added request cmpl-6dac31d94c7343649ec8122465e18e80-0.
INFO 03-01 23:41:55 [logger.py:42] Received request cmpl-8d35a86a159744bba2ab78f70a2b8a75-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:41:55 [async_llm.py:261] Added request cmpl-8d35a86a159744bba2ab78f70a2b8a75-0.
INFO 03-01 23:41:56 [logger.py:42] Received request cmpl-9f50d1f0935e4602ad906327e7383edb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:41:56 [async_llm.py:261] Added request cmpl-9f50d1f0935e4602ad906327e7383edb-0.
INFO 03-01 23:41:57 [logger.py:42] Received request cmpl-ee4ee66d9b004fbf8fb9fd233831c087-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:41:57 [async_llm.py:261] Added request cmpl-ee4ee66d9b004fbf8fb9fd233831c087-0.
INFO 03-01 23:41:57 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5%
INFO 03-01 23:41:58 [logger.py:42] Received request cmpl-378af6d0ca9840b590dd42e7d91b10d8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:41:58 [async_llm.py:261] Added request cmpl-378af6d0ca9840b590dd42e7d91b10d8-0.
INFO 03-01 23:42:00 [logger.py:42] Received request cmpl-fd10b50300f64215a7c5ee615fcb8d3f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:42:00 [async_llm.py:261] Added request cmpl-fd10b50300f64215a7c5ee615fcb8d3f-0.
INFO 03-01 23:42:01 [logger.py:42] Received request cmpl-28204ea0cea1445abf25cb0faf4064b8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:42:01 [async_llm.py:261] Added request cmpl-28204ea0cea1445abf25cb0faf4064b8-0.
INFO 03-01 23:42:02 [logger.py:42] Received request cmpl-b587f151d0494b938995844adcac1048-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:42:02 [async_llm.py:261] Added request cmpl-b587f151d0494b938995844adcac1048-0.
INFO 03-01 23:42:03 [logger.py:42] Received request cmpl-45aa847af2cf497498726daa14507886-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:42:03 [async_llm.py:261] Added request cmpl-45aa847af2cf497498726daa14507886-0.
INFO 03-01 23:42:04 [logger.py:42] Received request cmpl-38304a0ed5904eb096d4db75c1581c6f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:42:04 [async_llm.py:261] Added request cmpl-38304a0ed5904eb096d4db75c1581c6f-0.
INFO 03-01 23:42:05 [logger.py:42] Received request cmpl-cd039582d99e47158ffa1e816fac261d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:42:05 [async_llm.py:261] Added request cmpl-cd039582d99e47158ffa1e816fac261d-0.
INFO 03-01 23:42:07 [logger.py:42] Received request cmpl-76dc3a2a96684daf982ed9f334806f7a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:42:07 [async_llm.py:261] Added request cmpl-76dc3a2a96684daf982ed9f334806f7a-0.
INFO 03-01 23:42:07 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5%
INFO 03-01 23:42:08 [logger.py:42] Received request cmpl-2d6d1af586d74f75a6958c0c2c3fcae6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:42:08 [async_llm.py:261] Added request cmpl-2d6d1af586d74f75a6958c0c2c3fcae6-0.
INFO 03-01 23:42:09 [logger.py:42] Received request cmpl-5696e676be894a23956052f85f296d64-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:42:09 [async_llm.py:261] Added request cmpl-5696e676be894a23956052f85f296d64-0.
INFO 03-01 23:42:10 [logger.py:42] Received request cmpl-95cd24df58c541a1b4c26cfe2022ca34-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:42:10 [async_llm.py:261] Added request cmpl-95cd24df58c541a1b4c26cfe2022ca34-0.
INFO 03-01 23:42:11 [logger.py:42] Received request cmpl-556f94ac657d4f4ebd0c12afacb6f0e5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:42:11 [async_llm.py:261] Added request cmpl-556f94ac657d4f4ebd0c12afacb6f0e5-0.
INFO 03-01 23:42:12 [logger.py:42] Received request cmpl-a079f2a395a74818a6511009eda969f9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:42:12 [async_llm.py:261] Added request cmpl-a079f2a395a74818a6511009eda969f9-0.
INFO 03-01 23:42:14 [logger.py:42] Received request cmpl-ca507585a2ae404f9c8ef863a1b8896c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:42:14 [async_llm.py:261] Added request cmpl-ca507585a2ae404f9c8ef863a1b8896c-0.
INFO 03-01 23:42:15 [logger.py:42] Received request cmpl-1d41b06a94e5406ebb7d424024e3d17b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:42:15 [async_llm.py:261] Added request cmpl-1d41b06a94e5406ebb7d424024e3d17b-0.
INFO 03-01 23:42:16 [logger.py:42] Received request cmpl-025083917f3d458e8fc6fa4f255ebce7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:42:16 [async_llm.py:261] Added request cmpl-025083917f3d458e8fc6fa4f255ebce7-0.
INFO 03-01 23:42:17 [logger.py:42] Received request cmpl-2ca8e554189d483cae6f0f243eee31f8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:42:17 [async_llm.py:261] Added request cmpl-2ca8e554189d483cae6f0f243eee31f8-0.
INFO 03-01 23:42:17 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5%
INFO 03-01 23:42:18 [logger.py:42] Received request cmpl-07e216e92afb4b51a546ba19914a645e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:42:18 [async_llm.py:261] Added request cmpl-07e216e92afb4b51a546ba19914a645e-0.
INFO 03-01 23:42:19 [logger.py:42] Received request cmpl-b975d4bc58dc490aa40ddfa885bcf339-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:42:19 [async_llm.py:261] Added request cmpl-b975d4bc58dc490aa40ddfa885bcf339-0.
INFO 03-01 23:42:20 [logger.py:42] Received request cmpl-c52aa1ab99e6413a8c72641b3809e1fa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:42:20 [async_llm.py:261] Added request cmpl-c52aa1ab99e6413a8c72641b3809e1fa-0.
INFO 03-01 23:42:22 [logger.py:42] Received request cmpl-a663730d7f1247ccbd7447f72b23af09-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:42:22 [async_llm.py:261] Added request cmpl-a663730d7f1247ccbd7447f72b23af09-0.
INFO 03-01 23:42:23 [logger.py:42] Received request cmpl-efcbca24cd12415792f46eefc4697a0d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:42:23 [async_llm.py:261] Added request cmpl-efcbca24cd12415792f46eefc4697a0d-0.
INFO 03-01 23:42:24 [logger.py:42] Received request cmpl-06fbd391e9ec4cb495bfa5d7dd4357fb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:42:24 [async_llm.py:261] Added request cmpl-06fbd391e9ec4cb495bfa5d7dd4357fb-0.
INFO 03-01 23:42:25 [logger.py:42] Received request cmpl-aa8fd1e988244e83874ea82e7a464472-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:42:25 [async_llm.py:261] Added request cmpl-aa8fd1e988244e83874ea82e7a464472-0.
INFO 03-01 23:42:26 [logger.py:42] Received request cmpl-7fcf823abe044a0bb992843c739935db-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:42:26 [async_llm.py:261] Added request cmpl-7fcf823abe044a0bb992843c739935db-0.
INFO 03-01 23:42:27 [logger.py:42] Received request cmpl-342e469b23fa4da69dd5fb7718590f40-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:42:27 [async_llm.py:261] Added request cmpl-342e469b23fa4da69dd5fb7718590f40-0.
INFO 03-01 23:42:27 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.4 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.4%, Prefix cache hit rate: 51.5%
INFO 03-01 23:42:29 [logger.py:42] Received request cmpl-ae091d30805741f1a3a7975f03926bc2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:42:29 [async_llm.py:261] Added request cmpl-ae091d30805741f1a3a7975f03926bc2-0.
INFO 03-01 23:42:30 [logger.py:42] Received request cmpl-12ce8ae7713743b89291b9d163a474b3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:42:30 [async_llm.py:261] Added request cmpl-12ce8ae7713743b89291b9d163a474b3-0.
INFO 03-01 23:42:31 [logger.py:42] Received request cmpl-380049b225784aa98529efe1b3122466-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:42:31 [async_llm.py:261] Added request cmpl-380049b225784aa98529efe1b3122466-0.
INFO 03-01 23:42:32 [logger.py:42] Received request cmpl-76a6545c920941289fe19a0eae08a71b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:42:32 [async_llm.py:261] Added request cmpl-76a6545c920941289fe19a0eae08a71b-0.
INFO 03-01 23:42:33 [logger.py:42] Received request cmpl-9483c24c46aa4f2780c7141827a6998a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:42:33 [async_llm.py:261] Added request cmpl-9483c24c46aa4f2780c7141827a6998a-0.
INFO 03-01 23:42:34 [logger.py:42] Received request cmpl-54cd915b443a4c6e9e5c82a647756463-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:42:34 [async_llm.py:261] Added request cmpl-54cd915b443a4c6e9e5c82a647756463-0.
INFO 03-01 23:42:35 [logger.py:42] Received request cmpl-7c548896c06343058582e4046708967d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:42:35 [async_llm.py:261] Added request cmpl-7c548896c06343058582e4046708967d-0.
INFO 03-01 23:42:37 [logger.py:42] Received request cmpl-b8cdd85da7ce461a898a6efef477c8d3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:42:37 [async_llm.py:261] Added request cmpl-b8cdd85da7ce461a898a6efef477c8d3-0.
INFO 03-01 23:42:37 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.1 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5%
INFO 03-01 23:42:38 [logger.py:42] Received request cmpl-5f44e977a8fc48849f4c757e99567750-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:42:38 [async_llm.py:261] Added request cmpl-5f44e977a8fc48849f4c757e99567750-0.
INFO 03-01 23:42:39 [logger.py:42] Received request cmpl-c81e50bc849041aab07a8c9d7df5a6d5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:42:39 [async_llm.py:261] Added request cmpl-c81e50bc849041aab07a8c9d7df5a6d5-0.
INFO 03-01 23:42:40 [logger.py:42] Received request cmpl-5cba948b85db48c8878f1e21970c1ee3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:42:40 [async_llm.py:261] Added request cmpl-5cba948b85db48c8878f1e21970c1ee3-0.
INFO 03-01 23:42:41 [logger.py:42] Received request cmpl-8b20dd3103a740b7ae3128f9cc94fd01-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:42:41 [async_llm.py:261] Added request cmpl-8b20dd3103a740b7ae3128f9cc94fd01-0.
INFO 03-01 23:42:42 [logger.py:42] Received request cmpl-dd4bd8436741450c8a132947ce0c641b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:42:42 [async_llm.py:261] Added request cmpl-dd4bd8436741450c8a132947ce0c641b-0.
INFO 03-01 23:42:44 [logger.py:42] Received request cmpl-0e09eef3cb2c487ca1d16979cc91776e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:42:44 [async_llm.py:261] Added request cmpl-0e09eef3cb2c487ca1d16979cc91776e-0.
INFO 03-01 23:42:45 [logger.py:42] Received request cmpl-899548c11c434fc0ac5eb1f43fa5ad04-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:42:45 [async_llm.py:261] Added request cmpl-899548c11c434fc0ac5eb1f43fa5ad04-0.
INFO 03-01 23:42:46 [logger.py:42] Received request cmpl-5ed754986d754a1983a936eaaf5f1a52-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:42:46 [async_llm.py:261] Added request cmpl-5ed754986d754a1983a936eaaf5f1a52-0.
INFO 03-01 23:42:47 [logger.py:42] Received request cmpl-a9cfaf2271d64d4ab144073c135ce399-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:42:47 [async_llm.py:261] Added request cmpl-a9cfaf2271d64d4ab144073c135ce399-0.
INFO 03-01 23:42:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5%
INFO 03-01 23:42:48 [logger.py:42] Received request cmpl-6ff0f6e7b72b48b697d44b72ec5c0537-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:42:48 [async_llm.py:261] Added request cmpl-6ff0f6e7b72b48b697d44b72ec5c0537-0.
INFO 03-01 23:42:49 [logger.py:42] Received request cmpl-d063b6f9c66f4ae38fbb41bc3faee594-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:42:49 [async_llm.py:261] Added request cmpl-d063b6f9c66f4ae38fbb41bc3faee594-0.
INFO 03-01 23:42:50 [logger.py:42] Received request cmpl-e1b477ff3411421e9f17d2fea8ecd742-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:42:50 [async_llm.py:261] Added request cmpl-e1b477ff3411421e9f17d2fea8ecd742-0.
INFO 03-01 23:42:52 [logger.py:42] Received request cmpl-8d6cc8f0158b4893b046a483c1e64297-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:42:52 [async_llm.py:261] Added request cmpl-8d6cc8f0158b4893b046a483c1e64297-0.
INFO 03-01 23:42:53 [logger.py:42] Received request cmpl-5a29de4151e446e0a8df5cbeda60e2db-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:42:53 [async_llm.py:261] Added request cmpl-5a29de4151e446e0a8df5cbeda60e2db-0.
INFO 03-01 23:42:54 [logger.py:42] Received request cmpl-50d5412ee4cf4a1093743e0acfa2793c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:42:54 [async_llm.py:261] Added request cmpl-50d5412ee4cf4a1093743e0acfa2793c-0.
INFO 03-01 23:42:55 [logger.py:42] Received request cmpl-9d8023ffaf0d42fb8e8c7f944fb2f7b9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:42:55 [async_llm.py:261] Added request cmpl-9d8023ffaf0d42fb8e8c7f944fb2f7b9-0.
INFO 03-01 23:42:56 [logger.py:42] Received request cmpl-002ed52400b54e47b17a7ad9581fbf49-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:42:56 [async_llm.py:261] Added request cmpl-002ed52400b54e47b17a7ad9581fbf49-0.
INFO 03-01 23:42:57 [logger.py:42] Received request cmpl-3ad033a19e6649d98e38a458ff3513ae-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:42:57 [async_llm.py:261] Added request cmpl-3ad033a19e6649d98e38a458ff3513ae-0.
INFO 03-01 23:42:58 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.2 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.3%, Prefix cache hit rate: 51.5%
INFO 03-01 23:42:59 [logger.py:42] Received request cmpl-ce6a19d483714991b9a21110550a0dd5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:42:59 [async_llm.py:261] Added request cmpl-ce6a19d483714991b9a21110550a0dd5-0.
INFO 03-01 23:43:00 [logger.py:42] Received request cmpl-1b612e17805841c1bad5ac4af5da038b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:43:00 [async_llm.py:261] Added request cmpl-1b612e17805841c1bad5ac4af5da038b-0.
INFO 03-01 23:43:01 [logger.py:42] Received request cmpl-503119ab22ba4756893d3105ef0d49f5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:43:01 [async_llm.py:261] Added request cmpl-503119ab22ba4756893d3105ef0d49f5-0.
INFO 03-01 23:43:02 [logger.py:42] Received request cmpl-e4e837c194934ca6adadfbe344c68a91-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:43:02 [async_llm.py:261] Added request cmpl-e4e837c194934ca6adadfbe344c68a91-0.
INFO 03-01 23:43:03 [logger.py:42] Received request cmpl-36051340ba2a4e6ba6f1422460a56306-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:43:03 [async_llm.py:261] Added request cmpl-36051340ba2a4e6ba6f1422460a56306-0.
INFO 03-01 23:43:04 [logger.py:42] Received request cmpl-6bdcdbaa80a1471ea9f29c2bba93dc62-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:43:04 [async_llm.py:261] Added request cmpl-6bdcdbaa80a1471ea9f29c2bba93dc62-0.
INFO 03-01 23:43:06 [logger.py:42] Received request cmpl-c668183bf655456a9f8e822b09387ed5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:43:06 [async_llm.py:261] Added request cmpl-c668183bf655456a9f8e822b09387ed5-0.
INFO 03-01 23:43:07 [logger.py:42] Received request cmpl-cf6a2d0f458c4b4a97773ebd3e368891-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:43:07 [async_llm.py:261] Added request cmpl-cf6a2d0f458c4b4a97773ebd3e368891-0.
INFO 03-01 23:43:08 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.3 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5%
INFO 03-01 23:43:08 [logger.py:42] Received request cmpl-87e32a1838d446ca9a7dc46b1a930766-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:43:08 [async_llm.py:261] Added request cmpl-87e32a1838d446ca9a7dc46b1a930766-0.
INFO 03-01 23:43:09 [logger.py:42] Received request cmpl-e1df8df7f5e644f3b8503b008f2850f4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:43:09 [async_llm.py:261] Added request cmpl-e1df8df7f5e644f3b8503b008f2850f4-0.
INFO 03-01 23:43:10 [logger.py:42] Received request cmpl-aa92a479700b48389dd7f50795c2737c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:43:10 [async_llm.py:261] Added request cmpl-aa92a479700b48389dd7f50795c2737c-0.
INFO 03-01 23:43:11 [logger.py:42] Received request cmpl-371f131e41a349d6b05d7ed993b2c388-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:43:11 [async_llm.py:261] Added request cmpl-371f131e41a349d6b05d7ed993b2c388-0.
INFO 03-01 23:43:12 [logger.py:42] Received request cmpl-5eab7b3782b74985b292b88c28302f4d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:43:12 [async_llm.py:261] Added request cmpl-5eab7b3782b74985b292b88c28302f4d-0.
INFO 03-01 23:43:14 [logger.py:42] Received request cmpl-1c001bb521f84bd590464cb3a026eabd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:43:14 [async_llm.py:261] Added request cmpl-1c001bb521f84bd590464cb3a026eabd-0.
INFO 03-01 23:43:15 [logger.py:42] Received request cmpl-40b9ab3379d4413f84e34e304600b368-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:43:15 [async_llm.py:261] Added request cmpl-40b9ab3379d4413f84e34e304600b368-0.
INFO 03-01 23:43:16 [logger.py:42] Received request cmpl-41e14dfa2c0f459f8d03cfb9a8d61fb6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:43:16 [async_llm.py:261] Added request cmpl-41e14dfa2c0f459f8d03cfb9a8d61fb6-0.
INFO 03-01 23:43:17 [logger.py:42] Received request cmpl-603178d6e33a48cca5d7596cf401c8c2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:43:17 [async_llm.py:261] Added request cmpl-603178d6e33a48cca5d7596cf401c8c2-0.
INFO 03-01 23:43:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5%
INFO 03-01 23:43:18 [logger.py:42] Received request cmpl-c427b28f780145d7a9120809942d890e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:43:18 [async_llm.py:261] Added request cmpl-c427b28f780145d7a9120809942d890e-0.
INFO 03-01 23:43:19 [logger.py:42] Received request cmpl-70790424b8cb44d4ba2fc667cd796f17-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:43:19 [async_llm.py:261] Added request cmpl-70790424b8cb44d4ba2fc667cd796f17-0.
INFO 03-01 23:43:21 [logger.py:42] Received request cmpl-b1e04259ec3f45549f999db7f436d16e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:43:21 [async_llm.py:261] Added request cmpl-b1e04259ec3f45549f999db7f436d16e-0.
INFO 03-01 23:43:22 [logger.py:42] Received request cmpl-bd3f3e88e27348fbb0c89bae286e2e50-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:43:22 [async_llm.py:261] Added request cmpl-bd3f3e88e27348fbb0c89bae286e2e50-0.
INFO 03-01 23:43:23 [logger.py:42] Received request cmpl-cbf99ff8f54d4f1aa055be2341940547-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:43:23 [async_llm.py:261] Added request cmpl-cbf99ff8f54d4f1aa055be2341940547-0.
INFO 03-01 23:43:24 [logger.py:42] Received request cmpl-8861ae789a564298872b60a241ffc459-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:43:24 [async_llm.py:261] Added request cmpl-8861ae789a564298872b60a241ffc459-0.
INFO 03-01 23:43:25 [logger.py:42] Received request cmpl-cf17e542a84641a480509672474ab327-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:43:25 [async_llm.py:261] Added request cmpl-cf17e542a84641a480509672474ab327-0.
INFO 03-01 23:43:26 [logger.py:42] Received request cmpl-eb0b0c9235154080894d6cdb729792ce-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:43:26 [async_llm.py:261] Added request cmpl-eb0b0c9235154080894d6cdb729792ce-0.
INFO 03-01 23:43:27 [logger.py:42] Received request cmpl-2a0a2f6282b045358a1f14834228901c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:43:27 [async_llm.py:261] Added request cmpl-2a0a2f6282b045358a1f14834228901c-0.
INFO 03-01 23:43:28 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.1 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.3%, Prefix cache hit rate: 51.5%
INFO 03-01 23:43:29 [logger.py:42] Received request cmpl-13b7c90d8d914adda424e9a059cc8c9b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:43:29 [async_llm.py:261] Added request cmpl-13b7c90d8d914adda424e9a059cc8c9b-0.
INFO 03-01 23:43:30 [logger.py:42] Received request cmpl-3c184a4d29134777b481017aa89505b0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:43:30 [async_llm.py:261] Added request cmpl-3c184a4d29134777b481017aa89505b0-0.
INFO 03-01 23:43:31 [logger.py:42] Received request cmpl-ea568c8b6d4c47818cd52078a441ed3b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:43:31 [async_llm.py:261] Added request cmpl-ea568c8b6d4c47818cd52078a441ed3b-0.
INFO 03-01 23:43:32 [logger.py:42] Received request cmpl-2d002fb232f44e24b67541219847e19b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:43:32 [async_llm.py:261] Added request cmpl-2d002fb232f44e24b67541219847e19b-0.
INFO 03-01 23:43:33 [logger.py:42] Received request cmpl-6db889a2ad09413e90883c0782f24921-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:43:33 [async_llm.py:261] Added request cmpl-6db889a2ad09413e90883c0782f24921-0.
INFO 03-01 23:43:34 [logger.py:42] Received request cmpl-59933d616bd5432fbbaa127adcb0c19b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:43:34 [async_llm.py:261] Added request cmpl-59933d616bd5432fbbaa127adcb0c19b-0.
INFO 03-01 23:43:36 [logger.py:42] Received request cmpl-ff093f4e8cb74b40978273ad659d1e31-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:43:36 [async_llm.py:261] Added request cmpl-ff093f4e8cb74b40978273ad659d1e31-0.
INFO 03-01 23:43:37 [logger.py:42] Received request cmpl-5a2e49431a0e4cf58096e4f2ad1d1067-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:43:37 [async_llm.py:261] Added request cmpl-5a2e49431a0e4cf58096e4f2ad1d1067-0.
INFO 03-01 23:43:38 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.4 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5%
INFO 03-01 23:43:38 [logger.py:42] Received request cmpl-79260000c9774ebe9333d9f9c322b205-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:43:38 [async_llm.py:261] Added request cmpl-79260000c9774ebe9333d9f9c322b205-0.
INFO 03-01 23:43:39 [logger.py:42] Received request cmpl-c9370203b0364cfcba3a5e12a14f44dc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:43:39 [async_llm.py:261] Added request cmpl-c9370203b0364cfcba3a5e12a14f44dc-0.
INFO 03-01 23:43:40 [logger.py:42] Received request cmpl-5a55d153b59d4864b6de307945b5bd31-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:43:40 [async_llm.py:261] Added request cmpl-5a55d153b59d4864b6de307945b5bd31-0.
INFO 03-01 23:43:41 [logger.py:42] Received request cmpl-266c6686fef34328b0a0fcce40ca1c11-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:43:41 [async_llm.py:261] Added request cmpl-266c6686fef34328b0a0fcce40ca1c11-0.
INFO 03-01 23:43:42 [logger.py:42] Received request cmpl-11d6f667135943a2ac178694f9d7cbf3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:43:42 [async_llm.py:261] Added request cmpl-11d6f667135943a2ac178694f9d7cbf3-0.
INFO 03-01 23:43:44 [logger.py:42] Received request cmpl-6eaf1363b5a94bd88022472edf2eedcf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:43:44 [async_llm.py:261] Added request cmpl-6eaf1363b5a94bd88022472edf2eedcf-0.
INFO 03-01 23:43:45 [logger.py:42] Received request cmpl-86c44805e84f428cadeddf0e297bbc78-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:43:45 [async_llm.py:261] Added request cmpl-86c44805e84f428cadeddf0e297bbc78-0.
INFO 03-01 23:43:46 [logger.py:42] Received request cmpl-68fc18dc961c4e47b09e83694b7fdec2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:43:46 [async_llm.py:261] Added request cmpl-68fc18dc961c4e47b09e83694b7fdec2-0.
INFO 03-01 23:43:47 [logger.py:42] Received request cmpl-311db78b11374c2c8f957860d95b766f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:43:47 [async_llm.py:261] Added request cmpl-311db78b11374c2c8f957860d95b766f-0.
INFO 03-01 23:43:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5%
INFO 03-01 23:43:48 [logger.py:42] Received request cmpl-444f7aec658b4721919ec680f36e1c5d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:43:48 [async_llm.py:261] Added request cmpl-444f7aec658b4721919ec680f36e1c5d-0.
INFO 03-01 23:43:49 [logger.py:42] Received request cmpl-4e782dfd1ad14ac3b7d824ea8c8f2d38-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:43:49 [async_llm.py:261] Added request cmpl-4e782dfd1ad14ac3b7d824ea8c8f2d38-0.
INFO 03-01 23:43:51 [logger.py:42] Received request cmpl-812b5dbfbcec425389a1aa2cc724d55f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:43:51 [async_llm.py:261] Added request cmpl-812b5dbfbcec425389a1aa2cc724d55f-0.
INFO 03-01 23:43:52 [logger.py:42] Received request cmpl-cb99c07c4a56487cb7ccafd0f068f916-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:43:52 [async_llm.py:261] Added request cmpl-cb99c07c4a56487cb7ccafd0f068f916-0.
INFO 03-01 23:43:53 [logger.py:42] Received request cmpl-0c48a36ddd384963b06c500ffe0ee51f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:43:53 [async_llm.py:261] Added request cmpl-0c48a36ddd384963b06c500ffe0ee51f-0.
INFO 03-01 23:43:54 [logger.py:42] Received request cmpl-c3771f41487a4a23a3b5ab2550325d6d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:43:54 [async_llm.py:261] Added request cmpl-c3771f41487a4a23a3b5ab2550325d6d-0.
INFO 03-01 23:43:55 [logger.py:42] Received request cmpl-b7bd543b090245768bc9a062531c0f83-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:43:55 [async_llm.py:261] Added request cmpl-b7bd543b090245768bc9a062531c0f83-0.
INFO 03-01 23:43:56 [logger.py:42] Received request cmpl-f9fc7498819d40f288e0a9d065e37e44-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:43:56 [async_llm.py:261] Added request cmpl-f9fc7498819d40f288e0a9d065e37e44-0.
INFO 03-01 23:43:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5%
INFO 03-01 23:43:58 [logger.py:42] Received request cmpl-f566170d4b1743febefef3132f9f6a22-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:43:58 [async_llm.py:261] Added request cmpl-f566170d4b1743febefef3132f9f6a22-0.
INFO 03-01 23:43:59 [logger.py:42] Received request cmpl-0444a7f9ef95499aa188eda87e2010bd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:43:59 [async_llm.py:261] Added request cmpl-0444a7f9ef95499aa188eda87e2010bd-0.
INFO 03-01 23:44:00 [logger.py:42] Received request cmpl-c3065aa5de204ac5baa2c51f51767785-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:44:00 [async_llm.py:261] Added request cmpl-c3065aa5de204ac5baa2c51f51767785-0.
INFO 03-01 23:44:01 [logger.py:42] Received request cmpl-b7104cb543504f4ba6cdb1e210ed6dea-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:44:01 [async_llm.py:261] Added request cmpl-b7104cb543504f4ba6cdb1e210ed6dea-0.
INFO 03-01 23:44:02 [logger.py:42] Received request cmpl-c05a06bf57ba49048e50b5fa51e0ced3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:44:02 [async_llm.py:261] Added request cmpl-c05a06bf57ba49048e50b5fa51e0ced3-0.
INFO 03-01 23:44:03 [logger.py:42] Received request cmpl-74db9b35e13a4cc3b116b50a41cd889a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:44:03 [async_llm.py:261] Added request cmpl-74db9b35e13a4cc3b116b50a41cd889a-0.
INFO 03-01 23:44:04 [logger.py:42] Received request cmpl-ed537b959e4b42beb4937dc0dfd0954d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:44:04 [async_llm.py:261] Added request cmpl-ed537b959e4b42beb4937dc0dfd0954d-0.
INFO 03-01 23:44:06 [logger.py:42] Received request cmpl-861a576ec4414d3782005074aa066ac7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:44:06 [async_llm.py:261] Added request cmpl-861a576ec4414d3782005074aa066ac7-0.
INFO 03-01 23:44:07 [logger.py:42] Received request cmpl-6e15d6ab76594e59bdae4f17033085c6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:44:07 [async_llm.py:261] Added request cmpl-6e15d6ab76594e59bdae4f17033085c6-0.
INFO 03-01 23:44:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5%
INFO 03-01 23:44:08 [logger.py:42] Received request cmpl-01dec02b46d1485a8e35f36452d4938f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:44:08 [async_llm.py:261] Added request cmpl-01dec02b46d1485a8e35f36452d4938f-0.
INFO 03-01 23:44:09 [logger.py:42] Received request cmpl-f1b672bf48c8467d87d7737559507668-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:44:09 [async_llm.py:261] Added request cmpl-f1b672bf48c8467d87d7737559507668-0.
INFO 03-01 23:44:10 [logger.py:42] Received request cmpl-6c0649b6763d4a5e971707834eeb0c02-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:44:10 [async_llm.py:261] Added request cmpl-6c0649b6763d4a5e971707834eeb0c02-0.
INFO 03-01 23:44:11 [logger.py:42] Received request cmpl-f7674a704f5b4fa29b5f2cfda8f0d0c1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:44:11 [async_llm.py:261] Added request cmpl-f7674a704f5b4fa29b5f2cfda8f0d0c1-0.
INFO 03-01 23:44:13 [logger.py:42] Received request cmpl-b601130653f742e6af022d01fb4e9b2c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:44:13 [async_llm.py:261] Added request cmpl-b601130653f742e6af022d01fb4e9b2c-0.
INFO 03-01 23:44:14 [logger.py:42] Received request cmpl-0014f7ca1e1f4f6299742a83bbfb97cb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:44:14 [async_llm.py:261] Added request cmpl-0014f7ca1e1f4f6299742a83bbfb97cb-0.
INFO 03-01 23:44:15 [logger.py:42] Received request cmpl-761447c714684d5fae447aba77756fa7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:44:15 [async_llm.py:261] Added request cmpl-761447c714684d5fae447aba77756fa7-0.
INFO 03-01 23:44:16 [logger.py:42] Received request cmpl-388e03b543b04f368dad79345b712104-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:44:16 [async_llm.py:261] Added request cmpl-388e03b543b04f368dad79345b712104-0.
INFO 03-01 23:44:17 [logger.py:42] Received request cmpl-30e8e52044aa47128a9158c5976678c3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:44:17 [async_llm.py:261] Added request cmpl-30e8e52044aa47128a9158c5976678c3-0.
INFO 03-01 23:44:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5%
INFO 03-01 23:44:18 [logger.py:42] Received request cmpl-d7cd86cf2ab14a8581f9095fb6e6f4b2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:44:18 [async_llm.py:261] Added request cmpl-d7cd86cf2ab14a8581f9095fb6e6f4b2-0.
INFO 03-01 23:44:19 [logger.py:42] Received request cmpl-bc82bf832d184e3abf94a33c38ec1ebf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:44:19 [async_llm.py:261] Added request cmpl-bc82bf832d184e3abf94a33c38ec1ebf-0.
INFO 03-01 23:44:21 [logger.py:42] Received request cmpl-74c11f0d8a1142ad91ff3a8606f1c8d9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:44:21 [async_llm.py:261] Added request cmpl-74c11f0d8a1142ad91ff3a8606f1c8d9-0.
INFO 03-01 23:44:22 [logger.py:42] Received request cmpl-4b060802ba3840d383a2d0c37d0d26a9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:44:22 [async_llm.py:261] Added request cmpl-4b060802ba3840d383a2d0c37d0d26a9-0.
INFO 03-01 23:44:23 [logger.py:42] Received request cmpl-a36bf922e53f41a3abf6380488eff90d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:44:23 [async_llm.py:261] Added request cmpl-a36bf922e53f41a3abf6380488eff90d-0.
INFO 03-01 23:44:24 [logger.py:42] Received request cmpl-4a686423b54243fd955813b40a064d19-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:44:24 [async_llm.py:261] Added request cmpl-4a686423b54243fd955813b40a064d19-0.
INFO 03-01 23:44:25 [logger.py:42] Received request cmpl-560a120195ad489c9856ebd13f76d7ae-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:44:25 [async_llm.py:261] Added request cmpl-560a120195ad489c9856ebd13f76d7ae-0.
INFO 03-01 23:44:26 [logger.py:42] Received request cmpl-6e33fe2e7b7c4bf1ba2edbcb28fdebfb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:44:26 [async_llm.py:261] Added request cmpl-6e33fe2e7b7c4bf1ba2edbcb28fdebfb-0.
INFO 03-01 23:44:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5%
INFO 03-01 23:44:28 [logger.py:42] Received request cmpl-444514f93f154321be6121fdb33a1de9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:44:28 [async_llm.py:261] Added request cmpl-444514f93f154321be6121fdb33a1de9-0.
INFO 03-01 23:44:29 [logger.py:42] Received request cmpl-249078c0e70d41d8bafde051cb343d72-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:44:29 [async_llm.py:261] Added request cmpl-249078c0e70d41d8bafde051cb343d72-0.
INFO 03-01 23:44:30 [logger.py:42] Received request cmpl-2c490a5698e74ef5b830e2a101afecbe-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:44:30 [async_llm.py:261] Added request cmpl-2c490a5698e74ef5b830e2a101afecbe-0.
INFO 03-01 23:44:31 [logger.py:42] Received request cmpl-35180304be0d4c52af4efa3d7002309f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:44:31 [async_llm.py:261] Added request cmpl-35180304be0d4c52af4efa3d7002309f-0.
INFO 03-01 23:44:32 [logger.py:42] Received request cmpl-aaf35cdd2d8e4bf496db0e641bdb365c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:44:32 [async_llm.py:261] Added request cmpl-aaf35cdd2d8e4bf496db0e641bdb365c-0.
INFO 03-01 23:44:33 [logger.py:42] Received request cmpl-bc6f76ee382243dd845792a3a6c37a84-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:44:33 [async_llm.py:261] Added request cmpl-bc6f76ee382243dd845792a3a6c37a84-0.
INFO 03-01 23:44:34 [logger.py:42] Received request cmpl-419650f17e264b8ea302ef8541756fe8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:44:34 [async_llm.py:261] Added request cmpl-419650f17e264b8ea302ef8541756fe8-0.
INFO 03-01 23:44:36 [logger.py:42] Received request cmpl-4c4e2d5e08cd45e8b6b922764dd7d833-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:44:36 [async_llm.py:261] Added request cmpl-4c4e2d5e08cd45e8b6b922764dd7d833-0.
INFO 03-01 23:44:37 [logger.py:42] Received request cmpl-ec4b3615ca7f4f8cbd6ddde1be852214-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:44:37 [async_llm.py:261] Added request cmpl-ec4b3615ca7f4f8cbd6ddde1be852214-0.
INFO 03-01 23:44:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5%
INFO 03-01 23:44:38 [logger.py:42] Received request cmpl-4377d43db84841329ef7c0859e0bb8e0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:44:38 [async_llm.py:261] Added request cmpl-4377d43db84841329ef7c0859e0bb8e0-0.
INFO 03-01 23:44:39 [logger.py:42] Received request cmpl-3cbf4a43fb73450fa832b0261c62370f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:44:39 [async_llm.py:261] Added request cmpl-3cbf4a43fb73450fa832b0261c62370f-0.
INFO 03-01 23:44:40 [logger.py:42] Received request cmpl-5d4dc881b6334c83b4908a20a34dd8ce-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:44:40 [async_llm.py:261] Added request cmpl-5d4dc881b6334c83b4908a20a34dd8ce-0.
INFO 03-01 23:44:41 [logger.py:42] Received request cmpl-b90ae3ae866846259b200c12ac8182f8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:44:41 [async_llm.py:261] Added request cmpl-b90ae3ae866846259b200c12ac8182f8-0.
INFO 03-01 23:44:43 [logger.py:42] Received request cmpl-275cbdff52484ae3969923df3511135a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:44:43 [async_llm.py:261] Added request cmpl-275cbdff52484ae3969923df3511135a-0.
INFO 03-01 23:44:44 [logger.py:42] Received request cmpl-31b38dcdde3b449ab782e88ec60c50d7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:44:44 [async_llm.py:261] Added request cmpl-31b38dcdde3b449ab782e88ec60c50d7-0.
INFO 03-01 23:44:45 [logger.py:42] Received request cmpl-6349862910f74e6fbd714806f5cde66c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:44:45 [async_llm.py:261] Added request cmpl-6349862910f74e6fbd714806f5cde66c-0.
INFO 03-01 23:44:46 [logger.py:42] Received request cmpl-451de749530b44499fea5b197ea1a15c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:44:46 [async_llm.py:261] Added request cmpl-451de749530b44499fea5b197ea1a15c-0.
INFO 03-01 23:44:47 [logger.py:42] Received request cmpl-a84e90f8f0854b53bfae5b518eb3c384-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:44:47 [async_llm.py:261] Added request cmpl-a84e90f8f0854b53bfae5b518eb3c384-0.
INFO 03-01 23:44:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5%
INFO 03-01 23:44:48 [logger.py:42] Received request cmpl-e5b1e41794de479e84285d3ac783a1ba-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:44:48 [async_llm.py:261] Added request cmpl-e5b1e41794de479e84285d3ac783a1ba-0.
INFO 03-01 23:44:49 [logger.py:42] Received request cmpl-7ec6e5eadf624137b23363ea283aee31-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:44:49 [async_llm.py:261] Added request cmpl-7ec6e5eadf624137b23363ea283aee31-0.
INFO 03-01 23:44:51 [logger.py:42] Received request cmpl-724a4c81793647d490f3bd11b63efbd4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:44:51 [async_llm.py:261] Added request cmpl-724a4c81793647d490f3bd11b63efbd4-0.
INFO 03-01 23:44:52 [logger.py:42] Received request cmpl-1e4a0824b56c4d3482cce9191c1124ff-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:44:52 [async_llm.py:261] Added request cmpl-1e4a0824b56c4d3482cce9191c1124ff-0.
INFO 03-01 23:44:53 [logger.py:42] Received request cmpl-15b441ebc4f14e7ba53ce913a08c0109-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:44:53 [async_llm.py:261] Added request cmpl-15b441ebc4f14e7ba53ce913a08c0109-0.
INFO 03-01 23:44:54 [logger.py:42] Received request cmpl-06f6c2c6574f47839e03762c469a9299-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:44:54 [async_llm.py:261] Added request cmpl-06f6c2c6574f47839e03762c469a9299-0.
INFO 03-01 23:44:55 [logger.py:42] Received request cmpl-4e5e7ce99930478788ab8749c2886c91-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:44:55 [async_llm.py:261] Added request cmpl-4e5e7ce99930478788ab8749c2886c91-0.
INFO 03-01 23:44:56 [logger.py:42] Received request cmpl-270642482e484a54a522d8455bc2dbb5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:44:56 [async_llm.py:261] Added request cmpl-270642482e484a54a522d8455bc2dbb5-0.
INFO 03-01 23:44:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5%
INFO 03-01 23:44:58 [logger.py:42] Received request cmpl-ddca4f12255d48ffb773524838aee506-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:44:58 [async_llm.py:261] Added request cmpl-ddca4f12255d48ffb773524838aee506-0.
INFO 03-01 23:44:59 [logger.py:42] Received request cmpl-2a92f3b769234fc28ade2c24656b204f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:44:59 [async_llm.py:261] Added request cmpl-2a92f3b769234fc28ade2c24656b204f-0.
INFO 03-01 23:45:00 [logger.py:42] Received request cmpl-f13b5199c6864733a6048b9426da6150-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:45:00 [async_llm.py:261] Added request cmpl-f13b5199c6864733a6048b9426da6150-0.
INFO 03-01 23:45:01 [logger.py:42] Received request cmpl-5d3c18bcfc7f4cbab42d14ff7d54ee4b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:45:01 [async_llm.py:261] Added request cmpl-5d3c18bcfc7f4cbab42d14ff7d54ee4b-0.
INFO 03-01 23:45:02 [logger.py:42] Received request cmpl-ade853c9cf504cc5843f3000ccc2048d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:45:02 [async_llm.py:261] Added request cmpl-ade853c9cf504cc5843f3000ccc2048d-0.
INFO 03-01 23:45:03 [logger.py:42] Received request cmpl-e06d9d5aaeb24ae7a5515c9d2ace2f94-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:45:03 [async_llm.py:261] Added request cmpl-e06d9d5aaeb24ae7a5515c9d2ace2f94-0.
INFO 03-01 23:45:05 [logger.py:42] Received request cmpl-40ac5fa1698c4dccb6774bc5cc748df1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:45:05 [async_llm.py:261] Added request cmpl-40ac5fa1698c4dccb6774bc5cc748df1-0.
INFO 03-01 23:45:06 [logger.py:42] Received request cmpl-03a6f16d78924059a39b343fb34ccf97-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:45:06 [async_llm.py:261] Added request cmpl-03a6f16d78924059a39b343fb34ccf97-0.
INFO 03-01 23:45:07 [logger.py:42] Received request cmpl-45f50162e4b94a69b2482ed76192f99e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:45:07 [async_llm.py:261] Added request cmpl-45f50162e4b94a69b2482ed76192f99e-0.
INFO 03-01 23:45:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5%
INFO 03-01 23:45:08 [logger.py:42] Received request cmpl-d4296fb6a022447fa5a81c0413059de0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:45:08 [async_llm.py:261] Added request cmpl-d4296fb6a022447fa5a81c0413059de0-0.
INFO 03-01 23:45:09 [logger.py:42] Received request cmpl-a65d3908ff474374bb9221bd2ccb9a95-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:45:09 [async_llm.py:261] Added request cmpl-a65d3908ff474374bb9221bd2ccb9a95-0.
INFO 03-01 23:45:10 [logger.py:42] Received request cmpl-4e44f00daca44ebfa4ec58e983611957-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:45:10 [async_llm.py:261] Added request cmpl-4e44f00daca44ebfa4ec58e983611957-0.
INFO 03-01 23:45:11 [logger.py:42] Received request cmpl-ea0dcbd0c82d4e839fea0abe0628b752-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:45:11 [async_llm.py:261] Added request cmpl-ea0dcbd0c82d4e839fea0abe0628b752-0.
INFO 03-01 23:45:13 [logger.py:42] Received request cmpl-368b0582cdf64589be7a25a67fa25e07-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:45:13 [async_llm.py:261] Added request cmpl-368b0582cdf64589be7a25a67fa25e07-0.
INFO 03-01 23:45:14 [logger.py:42] Received request cmpl-5d10b6c19e994997945f03b28af2db5f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:45:14 [async_llm.py:261] Added request cmpl-5d10b6c19e994997945f03b28af2db5f-0.
INFO 03-01 23:45:15 [logger.py:42] Received request cmpl-92ebfd89f1c7433d9daf4f7e754ec6d7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:45:15 [async_llm.py:261] Added request cmpl-92ebfd89f1c7433d9daf4f7e754ec6d7-0.
INFO 03-01 23:45:16 [logger.py:42] Received request cmpl-57241f6f84e14ee3b0c42be8b1561545-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:45:16 [async_llm.py:261] Added request cmpl-57241f6f84e14ee3b0c42be8b1561545-0.
INFO 03-01 23:45:17 [logger.py:42] Received request cmpl-017925a26e3840a78d3dfa5d776eae9a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:45:17 [async_llm.py:261] Added request cmpl-017925a26e3840a78d3dfa5d776eae9a-0.
INFO 03-01 23:45:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5%
INFO 03-01 23:45:18 [logger.py:42] Received request cmpl-a8010e78335944398414c10fca712d86-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:45:18 [async_llm.py:261] Added request cmpl-a8010e78335944398414c10fca712d86-0.
INFO 03-01 23:45:20 [logger.py:42] Received request cmpl-8203b27998404026b7b0c05631bff849-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:45:20 [async_llm.py:261] Added request cmpl-8203b27998404026b7b0c05631bff849-0.
INFO 03-01 23:45:21 [logger.py:42] Received request cmpl-446cbe9d6ff64ecca139ec7da967f4ca-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:45:21 [async_llm.py:261] Added request cmpl-446cbe9d6ff64ecca139ec7da967f4ca-0.
INFO 03-01 23:45:22 [logger.py:42] Received request cmpl-bc4c9eb121054875a64e586513a2fc33-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:45:22 [async_llm.py:261] Added request cmpl-bc4c9eb121054875a64e586513a2fc33-0.
INFO 03-01 23:45:23 [logger.py:42] Received request cmpl-d2d1aabc79114694997bbec141481a18-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:45:23 [async_llm.py:261] Added request cmpl-d2d1aabc79114694997bbec141481a18-0.
INFO 03-01 23:45:24 [logger.py:42] Received request cmpl-48d16e7a99704f5586e215954e6fd08a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:45:24 [async_llm.py:261] Added request cmpl-48d16e7a99704f5586e215954e6fd08a-0.
INFO 03-01 23:45:25 [logger.py:42] Received request cmpl-b42b78d64bad40728d656a6110043e21-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:45:25 [async_llm.py:261] Added request cmpl-b42b78d64bad40728d656a6110043e21-0.
INFO 03-01 23:45:26 [logger.py:42] Received request cmpl-f3825316363544caad7fed410d01803e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:45:26 [async_llm.py:261] Added request cmpl-f3825316363544caad7fed410d01803e-0.
INFO 03-01 23:45:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5%
INFO 03-01 23:45:28 [logger.py:42] Received request cmpl-106c494c3f2147f18916aaea41f770b5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:45:28 [async_llm.py:261] Added request cmpl-106c494c3f2147f18916aaea41f770b5-0.
INFO 03-01 23:45:29 [logger.py:42] Received request cmpl-4e012febd6d245c49cd94efaa8f6786a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:45:29 [async_llm.py:261] Added request cmpl-4e012febd6d245c49cd94efaa8f6786a-0.
INFO 03-01 23:45:30 [logger.py:42] Received request cmpl-5ea5510baefc4409b037783e5b8960c2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:45:30 [async_llm.py:261] Added request cmpl-5ea5510baefc4409b037783e5b8960c2-0.
INFO 03-01 23:45:31 [logger.py:42] Received request cmpl-b6295c23cec746d492e1bda7308eae4d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:45:31 [async_llm.py:261] Added request cmpl-b6295c23cec746d492e1bda7308eae4d-0.
INFO 03-01 23:45:32 [logger.py:42] Received request cmpl-dd939f75fd404995b474dfa8854f5407-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:45:32 [async_llm.py:261] Added request cmpl-dd939f75fd404995b474dfa8854f5407-0.
INFO 03-01 23:45:33 [logger.py:42] Received request cmpl-00ce64700c5547c3862855f47978efe0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:45:33 [async_llm.py:261] Added request cmpl-00ce64700c5547c3862855f47978efe0-0.
INFO 03-01 23:45:35 [logger.py:42] Received request cmpl-3af9a9a9a1ac4b6ba6ec9f4805029956-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:45:35 [async_llm.py:261] Added request cmpl-3af9a9a9a1ac4b6ba6ec9f4805029956-0.
INFO 03-01 23:45:36 [logger.py:42] Received request cmpl-79bba23274dd4453aa56f36327392542-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:45:36 [async_llm.py:261] Added request cmpl-79bba23274dd4453aa56f36327392542-0.
INFO 03-01 23:45:37 [logger.py:42] Received request cmpl-32e057f76bb34c27a29d4cc736236088-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:45:37 [async_llm.py:261] Added request cmpl-32e057f76bb34c27a29d4cc736236088-0.
INFO 03-01 23:45:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5%
INFO 03-01 23:45:38 [logger.py:42] Received request cmpl-99b00354ac224aefbf77eaad189ae9fa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:45:38 [async_llm.py:261] Added request cmpl-99b00354ac224aefbf77eaad189ae9fa-0.
INFO 03-01 23:45:39 [logger.py:42] Received request cmpl-6b7497e78f1f4cd583d84c7d24fad384-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:45:39 [async_llm.py:261] Added request cmpl-6b7497e78f1f4cd583d84c7d24fad384-0.
INFO 03-01 23:45:40 [logger.py:42] Received request cmpl-c54a60329b3a4bbfb2247d6f84460345-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:45:40 [async_llm.py:261] Added request cmpl-c54a60329b3a4bbfb2247d6f84460345-0.
INFO 03-01 23:45:41 [logger.py:42] Received request cmpl-5c5efe40a7714237ad1089f57add7788-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:45:41 [async_llm.py:261] Added request cmpl-5c5efe40a7714237ad1089f57add7788-0.
INFO 03-01 23:45:43 [logger.py:42] Received request cmpl-65f2413e9ff741a389b1d51d079feeed-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:45:43 [async_llm.py:261] Added request cmpl-65f2413e9ff741a389b1d51d079feeed-0.
INFO 03-01 23:45:44 [logger.py:42] Received request cmpl-a9a8e9ef84e349d6b2e2802f75ddc60b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:45:44 [async_llm.py:261] Added request cmpl-a9a8e9ef84e349d6b2e2802f75ddc60b-0.
INFO 03-01 23:45:45 [logger.py:42] Received request cmpl-08f4a36f301545aaad7f9e50873e6c78-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:45:45 [async_llm.py:261] Added request cmpl-08f4a36f301545aaad7f9e50873e6c78-0.
INFO 03-01 23:45:46 [logger.py:42] Received request cmpl-84893b41fb5a48fea45db5e77f0040a0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:45:46 [async_llm.py:261] Added request cmpl-84893b41fb5a48fea45db5e77f0040a0-0.
INFO 03-01 23:45:47 [logger.py:42] Received request cmpl-77cd2fc4be9b47b59d85a3bd844b008a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:45:47 [async_llm.py:261] Added request cmpl-77cd2fc4be9b47b59d85a3bd844b008a-0.
INFO 03-01 23:45:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5%
INFO 03-01 23:45:48 [logger.py:42] Received request cmpl-76818b1fb01142cdaecda1c13a472275-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:45:48 [async_llm.py:261] Added request cmpl-76818b1fb01142cdaecda1c13a472275-0.
INFO 03-01 23:45:50 [logger.py:42] Received request cmpl-8b798c629d204694af100b85332bc3e1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:45:50 [async_llm.py:261] Added request cmpl-8b798c629d204694af100b85332bc3e1-0.
INFO 03-01 23:45:51 [logger.py:42] Received request cmpl-aae615613d3d412da3345cc97781d22f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:45:51 [async_llm.py:261] Added request cmpl-aae615613d3d412da3345cc97781d22f-0.
INFO 03-01 23:45:52 [logger.py:42] Received request cmpl-caf4d8db3fee4c04b9add273cf39341c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:45:52 [async_llm.py:261] Added request cmpl-caf4d8db3fee4c04b9add273cf39341c-0.
INFO 03-01 23:45:53 [logger.py:42] Received request cmpl-919bed8591ab4cf29fd16706a06a0e2f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:45:53 [async_llm.py:261] Added request cmpl-919bed8591ab4cf29fd16706a06a0e2f-0.
INFO 03-01 23:45:54 [logger.py:42] Received request cmpl-d712e9d1736c44bca13800de1aacfba9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:45:54 [async_llm.py:261] Added request cmpl-d712e9d1736c44bca13800de1aacfba9-0.
INFO 03-01 23:45:55 [logger.py:42] Received request cmpl-91118d1b56d047189f4091279558ab0c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:45:55 [async_llm.py:261] Added request cmpl-91118d1b56d047189f4091279558ab0c-0.
INFO 03-01 23:45:57 [logger.py:42] Received request cmpl-1a35877aac0449cc817a4fd36f91cede-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:45:57 [async_llm.py:261] Added request cmpl-1a35877aac0449cc817a4fd36f91cede-0.
INFO 03-01 23:45:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5%
INFO 03-01 23:45:58 [logger.py:42] Received request cmpl-f00bbee3951348f7977ee2a295534494-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:45:58 [async_llm.py:261] Added request cmpl-f00bbee3951348f7977ee2a295534494-0.
INFO 03-01 23:45:59 [logger.py:42] Received request cmpl-2307b2c7203446eaaf71aa1538aa9eb0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:45:59 [async_llm.py:261] Added request cmpl-2307b2c7203446eaaf71aa1538aa9eb0-0.
INFO 03-01 23:46:00 [logger.py:42] Received request cmpl-0521627855f84e818eb6d273ffe3bf90-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:46:00 [async_llm.py:261] Added request cmpl-0521627855f84e818eb6d273ffe3bf90-0.
INFO 03-01 23:46:01 [logger.py:42] Received request cmpl-fccc1b6cc7394293913555fe90891cf5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:46:01 [async_llm.py:261] Added request cmpl-fccc1b6cc7394293913555fe90891cf5-0.
INFO 03-01 23:46:02 [logger.py:42] Received request cmpl-342d9d47d31345508255da5b168c36e5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:46:02 [async_llm.py:261] Added request cmpl-342d9d47d31345508255da5b168c36e5-0.
INFO 03-01 23:46:03 [logger.py:42] Received request cmpl-9a485bb9d1d7471aab08f26de8d143cc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:46:03 [async_llm.py:261] Added request cmpl-9a485bb9d1d7471aab08f26de8d143cc-0.
INFO 03-01 23:46:05 [logger.py:42] Received request cmpl-ef8a0d17d98b4cd4854a9f4ec10a3daf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:46:05 [async_llm.py:261] Added request cmpl-ef8a0d17d98b4cd4854a9f4ec10a3daf-0.
INFO 03-01 23:46:06 [logger.py:42] Received request cmpl-304e32ec97a14e56b492fbccf39c59bc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:46:06 [async_llm.py:261] Added request cmpl-304e32ec97a14e56b492fbccf39c59bc-0.
INFO 03-01 23:46:07 [logger.py:42] Received request cmpl-4305690172834a89af9df39f2ee76c7b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:46:07 [async_llm.py:261] Added request cmpl-4305690172834a89af9df39f2ee76c7b-0.
INFO 03-01 23:46:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5%
INFO 03-01 23:46:08 [logger.py:42] Received request cmpl-adda18943d274cce96f030998ca9ef63-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:46:08 [async_llm.py:261] Added request cmpl-adda18943d274cce96f030998ca9ef63-0.
INFO 03-01 23:46:09 [logger.py:42] Received request cmpl-0c0bc25dc1f54f6d9ebf777949eb7760-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:46:09 [async_llm.py:261] Added request cmpl-0c0bc25dc1f54f6d9ebf777949eb7760-0.
INFO 03-01 23:46:10 [logger.py:42] Received request cmpl-4401a77fadc44454913eb938f2a8923d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:46:10 [async_llm.py:261] Added request cmpl-4401a77fadc44454913eb938f2a8923d-0.
INFO 03-01 23:46:12 [logger.py:42] Received request cmpl-df2c287875ae4ebb879f1661015a0f87-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:46:12 [async_llm.py:261] Added request cmpl-df2c287875ae4ebb879f1661015a0f87-0.
INFO 03-01 23:46:13 [logger.py:42] Received request cmpl-d4899989b84048aa8aae293703bfce86-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:46:13 [async_llm.py:261] Added request cmpl-d4899989b84048aa8aae293703bfce86-0.
INFO 03-01 23:46:14 [logger.py:42] Received request cmpl-bef2fc03ab944de5a4472489e5be7153-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:46:14 [async_llm.py:261] Added request cmpl-bef2fc03ab944de5a4472489e5be7153-0.
INFO 03-01 23:46:15 [logger.py:42] Received request cmpl-c47d3aacc6bf4fb09587ca5659f9bc87-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:46:15 [async_llm.py:261] Added request cmpl-c47d3aacc6bf4fb09587ca5659f9bc87-0.
INFO 03-01 23:46:16 [logger.py:42] Received request cmpl-5bfb0193e68941bbaee622728fd7523c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:46:16 [async_llm.py:261] Added request cmpl-5bfb0193e68941bbaee622728fd7523c-0.
INFO 03-01 23:46:17 [logger.py:42] Received request cmpl-f8d21de0fd33426cb98f40037edb2ea4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:46:17 [async_llm.py:261] Added request cmpl-f8d21de0fd33426cb98f40037edb2ea4-0.
INFO 03-01 23:46:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5%
INFO 03-01 23:46:18 [logger.py:42] Received request cmpl-a5159cba146b4b359bd12b324a670e12-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:46:18 [async_llm.py:261] Added request cmpl-a5159cba146b4b359bd12b324a670e12-0.
INFO 03-01 23:46:20 [logger.py:42] Received request cmpl-2e5ed9346f9e45489f4e26edaf0374b9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:46:20 [async_llm.py:261] Added request cmpl-2e5ed9346f9e45489f4e26edaf0374b9-0.
INFO 03-01 23:46:21 [logger.py:42] Received request cmpl-23628660262f4000b665be04d9cbc69f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:46:21 [async_llm.py:261] Added request cmpl-23628660262f4000b665be04d9cbc69f-0.
INFO 03-01 23:46:22 [logger.py:42] Received request cmpl-40a8400d3c534f02a636dd4286ece5b3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:46:22 [async_llm.py:261] Added request cmpl-40a8400d3c534f02a636dd4286ece5b3-0.
INFO 03-01 23:46:23 [logger.py:42] Received request cmpl-e2aa06e3034b4b1b916d76971921f79b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:46:23 [async_llm.py:261] Added request cmpl-e2aa06e3034b4b1b916d76971921f79b-0.
INFO 03-01 23:46:24 [logger.py:42] Received request cmpl-0b02b930cde6465e82eab9e7ff5de08a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:46:24 [async_llm.py:261] Added request cmpl-0b02b930cde6465e82eab9e7ff5de08a-0.
INFO 03-01 23:46:25 [logger.py:42] Received request cmpl-ce394b44ea3249ad9d0b2922c8204c21-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:46:25 [async_llm.py:261] Added request cmpl-ce394b44ea3249ad9d0b2922c8204c21-0.
INFO 03-01 23:46:27 [logger.py:42] Received request cmpl-3ab40b0e911c410ba6b4ef0298583b6a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:46:27 [async_llm.py:261] Added request cmpl-3ab40b0e911c410ba6b4ef0298583b6a-0.
INFO 03-01 23:46:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5%
INFO 03-01 23:46:28 [logger.py:42] Received request cmpl-b4a916c446f64bc486a7589e8b5a19d6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:46:28 [async_llm.py:261] Added request cmpl-b4a916c446f64bc486a7589e8b5a19d6-0.
INFO 03-01 23:46:29 [logger.py:42] Received request cmpl-46c8adb4cbb24617b82137267c316095-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:46:29 [async_llm.py:261] Added request cmpl-46c8adb4cbb24617b82137267c316095-0.
INFO 03-01 23:46:30 [logger.py:42] Received request cmpl-999e9efff27343e698f737dd99f4c933-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:46:30 [async_llm.py:261] Added request cmpl-999e9efff27343e698f737dd99f4c933-0.
INFO 03-01 23:46:31 [logger.py:42] Received request cmpl-47c8d802886b457287c8b1b966210e7a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:46:31 [async_llm.py:261] Added request cmpl-47c8d802886b457287c8b1b966210e7a-0.
INFO 03-01 23:46:32 [logger.py:42] Received request cmpl-e5905bf539b240cab90efaf90392d7f2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:46:32 [async_llm.py:261] Added request cmpl-e5905bf539b240cab90efaf90392d7f2-0.
INFO 03-01 23:46:33 [logger.py:42] Received request cmpl-13af36be68174911b870fb623ff5206a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:46:33 [async_llm.py:261] Added request cmpl-13af36be68174911b870fb623ff5206a-0.
INFO 03-01 23:46:35 [logger.py:42] Received request cmpl-201185269de341f38c9983f514a92f7c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:46:35 [async_llm.py:261] Added request cmpl-201185269de341f38c9983f514a92f7c-0.
INFO 03-01 23:46:36 [logger.py:42] Received request cmpl-b0eb0c6790f44612b1196e9233c41bf0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:46:36 [async_llm.py:261] Added request cmpl-b0eb0c6790f44612b1196e9233c41bf0-0.
INFO 03-01 23:46:37 [logger.py:42] Received request cmpl-c4c171a97b544274aaca99758866fa8c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:46:37 [async_llm.py:261] Added request cmpl-c4c171a97b544274aaca99758866fa8c-0.
INFO 03-01 23:46:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5%
INFO 03-01 23:46:38 [logger.py:42] Received request cmpl-69e16f28a9af4f19afe47e500a009ba3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:46:38 [async_llm.py:261] Added request cmpl-69e16f28a9af4f19afe47e500a009ba3-0.
INFO 03-01 23:46:39 [logger.py:42] Received request cmpl-7eca0e90c1cc4c5eb5e4462d4cd1162d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:46:39 [async_llm.py:261] Added request cmpl-7eca0e90c1cc4c5eb5e4462d4cd1162d-0.
INFO 03-01 23:46:40 [logger.py:42] Received request cmpl-706c0d27a89049b49490869a8f488f3a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:46:40 [async_llm.py:261] Added request cmpl-706c0d27a89049b49490869a8f488f3a-0.
INFO 03-01 23:46:42 [logger.py:42] Received request cmpl-038bf3eb529640ac9635dd4f33dc7959-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:46:42 [async_llm.py:261] Added request cmpl-038bf3eb529640ac9635dd4f33dc7959-0.
INFO 03-01 23:46:43 [logger.py:42] Received request cmpl-37c47cce4e594528a780d5d1c6691cb9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:46:43 [async_llm.py:261] Added request cmpl-37c47cce4e594528a780d5d1c6691cb9-0.
INFO 03-01 23:46:44 [logger.py:42] Received request cmpl-68371e7865bb428096d0e1e30ac81aac-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:46:44 [async_llm.py:261] Added request cmpl-68371e7865bb428096d0e1e30ac81aac-0.
INFO 03-01 23:46:45 [logger.py:42] Received request cmpl-e2f46418c400403dbd9a01ef8d36811d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:46:45 [async_llm.py:261] Added request cmpl-e2f46418c400403dbd9a01ef8d36811d-0.
INFO 03-01 23:46:46 [logger.py:42] Received request cmpl-ab990eef0c6b47dfa2857fea51b02008-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:46:46 [async_llm.py:261] Added request cmpl-ab990eef0c6b47dfa2857fea51b02008-0.
INFO 03-01 23:46:47 [logger.py:42] Received request cmpl-71ca28bb3622483db0a61576ea61fbbc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:46:47 [async_llm.py:261] Added request cmpl-71ca28bb3622483db0a61576ea61fbbc-0.
INFO 03-01 23:46:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5%
INFO 03-01 23:46:48 [logger.py:42] Received request cmpl-21c67bbd9bc442c0b7188a29d12556d2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:46:48 [async_llm.py:261] Added request cmpl-21c67bbd9bc442c0b7188a29d12556d2-0.
INFO 03-01 23:46:50 [logger.py:42] Received request cmpl-02ed63c81a6a4fa493765ba45129053a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:46:50 [async_llm.py:261] Added request cmpl-02ed63c81a6a4fa493765ba45129053a-0.
INFO 03-01 23:46:51 [logger.py:42] Received request cmpl-dfcd2b4411c8451e80a416f50ab08cd9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:46:51 [async_llm.py:261] Added request cmpl-dfcd2b4411c8451e80a416f50ab08cd9-0.
INFO 03-01 23:46:52 [logger.py:42] Received request cmpl-89977fb9367c496b852af2d2bb590dfb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:46:52 [async_llm.py:261] Added request cmpl-89977fb9367c496b852af2d2bb590dfb-0.
INFO 03-01 23:46:53 [logger.py:42] Received request cmpl-ba5627af1de64f0f8e322a02fbc99f60-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:46:53 [async_llm.py:261] Added request cmpl-ba5627af1de64f0f8e322a02fbc99f60-0.
INFO 03-01 23:46:54 [logger.py:42] Received request cmpl-4ec0ec353d6743fb9c54ea10548b1846-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:46:54 [async_llm.py:261] Added request cmpl-4ec0ec353d6743fb9c54ea10548b1846-0.
INFO 03-01 23:46:55 [logger.py:42] Received request cmpl-9fa75d05072943b1a294ba479e76c0be-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:46:55 [async_llm.py:261] Added request cmpl-9fa75d05072943b1a294ba479e76c0be-0.
INFO 03-01 23:46:57 [logger.py:42] Received request cmpl-7dda227798ae45a29c11c370f1457ff8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:46:57 [async_llm.py:261] Added request cmpl-7dda227798ae45a29c11c370f1457ff8-0.
INFO 03-01 23:46:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5%
INFO 03-01 23:46:58 [logger.py:42] Received request cmpl-3cbc7273c2ae41438cabeb30f07695e4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:46:58 [async_llm.py:261] Added request cmpl-3cbc7273c2ae41438cabeb30f07695e4-0.
INFO 03-01 23:46:59 [logger.py:42] Received request cmpl-16fb7bd3c87d49fe9c27e65a7a81fc83-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:46:59 [async_llm.py:261] Added request cmpl-16fb7bd3c87d49fe9c27e65a7a81fc83-0.
INFO 03-01 23:47:00 [logger.py:42] Received request cmpl-4ea16c6fdaec467c9b15cd51371e28ca-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:47:00 [async_llm.py:261] Added request cmpl-4ea16c6fdaec467c9b15cd51371e28ca-0.
INFO 03-01 23:47:01 [logger.py:42] Received request cmpl-abdb7382acf44edd83e9b94ebd22f7d3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:47:01 [async_llm.py:261] Added request cmpl-abdb7382acf44edd83e9b94ebd22f7d3-0.
INFO 03-01 23:47:02 [logger.py:42] Received request cmpl-0e305b708f134f0aa3308bbfb839bc1a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:47:02 [async_llm.py:261] Added request cmpl-0e305b708f134f0aa3308bbfb839bc1a-0.
INFO 03-01 23:47:03 [logger.py:42] Received request cmpl-4fe7f70a25ec4b839bdc9cb77963b90e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:47:03 [async_llm.py:261] Added request cmpl-4fe7f70a25ec4b839bdc9cb77963b90e-0.
INFO 03-01 23:47:05 [logger.py:42] Received request cmpl-6c5e6651d72647fbad79067dae6d32e8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:47:05 [async_llm.py:261] Added request cmpl-6c5e6651d72647fbad79067dae6d32e8-0.
INFO 03-01 23:47:06 [logger.py:42] Received request cmpl-945d7cc9183a40aab87a72bf35dc5dec-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:47:06 [async_llm.py:261] Added request cmpl-945d7cc9183a40aab87a72bf35dc5dec-0.
INFO 03-01 23:47:07 [logger.py:42] Received request cmpl-c239637638b54a28a47481a4ac67cc22-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:47:07 [async_llm.py:261] Added request cmpl-c239637638b54a28a47481a4ac67cc22-0.
INFO 03-01 23:47:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5%
INFO 03-01 23:47:08 [logger.py:42] Received request cmpl-371eeb683a6f480db9c07034430effa4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:47:08 [async_llm.py:261] Added request cmpl-371eeb683a6f480db9c07034430effa4-0.
INFO 03-01 23:47:09 [logger.py:42] Received request cmpl-d7a7e56ae5a64ad89a71feae22696f32-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:47:09 [async_llm.py:261] Added request cmpl-d7a7e56ae5a64ad89a71feae22696f32-0.
INFO 03-01 23:47:10 [logger.py:42] Received request cmpl-59758058ec0442f9814b5b5488676901-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:47:10 [async_llm.py:261] Added request cmpl-59758058ec0442f9814b5b5488676901-0.
INFO 03-01 23:47:12 [logger.py:42] Received request cmpl-a860626b3bb4479c9d5008bae168b62d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:47:12 [async_llm.py:261] Added request cmpl-a860626b3bb4479c9d5008bae168b62d-0.
INFO 03-01 23:47:13 [logger.py:42] Received request cmpl-6f5fccb6a67e410294a857cdbaeb231f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:47:13 [async_llm.py:261] Added request cmpl-6f5fccb6a67e410294a857cdbaeb231f-0.
INFO 03-01 23:47:14 [logger.py:42] Received request cmpl-5fa7276f9ca94f2bb3349d1228bc7ed3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:47:14 [async_llm.py:261] Added request cmpl-5fa7276f9ca94f2bb3349d1228bc7ed3-0.
INFO 03-01 23:47:15 [logger.py:42] Received request cmpl-c2f5737280c64652b64996f4889d73d2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:47:15 [async_llm.py:261] Added request cmpl-c2f5737280c64652b64996f4889d73d2-0.
INFO 03-01 23:47:16 [logger.py:42] Received request cmpl-1aaa3e3b428f47c58a4fc3cf0f1fae77-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:47:16 [async_llm.py:261] Added request cmpl-1aaa3e3b428f47c58a4fc3cf0f1fae77-0.
INFO 03-01 23:47:17 [logger.py:42] Received request cmpl-2fdc3bda02a44f358f1f584966420ae0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:47:17 [async_llm.py:261] Added request cmpl-2fdc3bda02a44f358f1f584966420ae0-0.
INFO 03-01 23:47:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5%
INFO 03-01 23:47:18 [logger.py:42] Received request cmpl-6a91e56d7e4d4b91aff6f040a2cfa791-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:47:18 [async_llm.py:261] Added request cmpl-6a91e56d7e4d4b91aff6f040a2cfa791-0.
INFO 03-01 23:47:20 [logger.py:42] Received request cmpl-70e1ea00618d416b8751c3f144a09044-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:47:20 [async_llm.py:261] Added request cmpl-70e1ea00618d416b8751c3f144a09044-0.
INFO 03-01 23:47:21 [logger.py:42] Received request cmpl-f826afa30e454e75be8c0e5d16a8d641-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:47:21 [async_llm.py:261] Added request cmpl-f826afa30e454e75be8c0e5d16a8d641-0.
INFO 03-01 23:47:22 [logger.py:42] Received request cmpl-d76ce9e1d9be441aa1a5cb88e35bac39-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:47:22 [async_llm.py:261] Added request cmpl-d76ce9e1d9be441aa1a5cb88e35bac39-0.
INFO 03-01 23:47:23 [logger.py:42] Received request cmpl-176e5414be744497aa0d89dd4ecd00f0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:47:23 [async_llm.py:261] Added request cmpl-176e5414be744497aa0d89dd4ecd00f0-0.
INFO 03-01 23:47:24 [logger.py:42] Received request cmpl-b98560931be74cbc9d41197046e4f98c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:47:24 [async_llm.py:261] Added request cmpl-b98560931be74cbc9d41197046e4f98c-0.
INFO 03-01 23:47:25 [logger.py:42] Received request cmpl-451738db46df4f3fb304acb15ff9460b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:47:25 [async_llm.py:261] Added request cmpl-451738db46df4f3fb304acb15ff9460b-0.
INFO 03-01 23:47:27 [logger.py:42] Received request cmpl-edd1a466850147afacbf79fc54ebc4a0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:47:27 [async_llm.py:261] Added request cmpl-edd1a466850147afacbf79fc54ebc4a0-0.
INFO 03-01 23:47:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5%
INFO 03-01 23:47:28 [logger.py:42] Received request cmpl-90625afe4e40443187fb0c8077658429-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:47:28 [async_llm.py:261] Added request cmpl-90625afe4e40443187fb0c8077658429-0.
INFO 03-01 23:47:29 [logger.py:42] Received request cmpl-b51039a24e234c2aa791b0940491b7d5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:47:29 [async_llm.py:261] Added request cmpl-b51039a24e234c2aa791b0940491b7d5-0.
INFO 03-01 23:47:30 [logger.py:42] Received request cmpl-5968ca262f054681adb22dfaeb4f8760-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:47:30 [async_llm.py:261] Added request cmpl-5968ca262f054681adb22dfaeb4f8760-0.
INFO 03-01 23:47:31 [logger.py:42] Received request cmpl-75f5e3bab96343daaffc645070d1bf0f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:47:31 [async_llm.py:261] Added request cmpl-75f5e3bab96343daaffc645070d1bf0f-0.
INFO 03-01 23:47:32 [logger.py:42] Received request cmpl-323197c05b4a48ada7262d1b06ce187e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:47:32 [async_llm.py:261] Added request cmpl-323197c05b4a48ada7262d1b06ce187e-0.
INFO 03-01 23:47:34 [logger.py:42] Received request cmpl-a211ea476d3741e5bb6a5271b24f931d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:47:34 [async_llm.py:261] Added request cmpl-a211ea476d3741e5bb6a5271b24f931d-0.
INFO 03-01 23:47:35 [logger.py:42] Received request cmpl-2bbd370f96a74b1b97b5b1d806896934-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:47:35 [async_llm.py:261] Added request cmpl-2bbd370f96a74b1b97b5b1d806896934-0.
INFO 03-01 23:47:36 [logger.py:42] Received request cmpl-f5f691a30a90449b854765f058c26915-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:47:36 [async_llm.py:261] Added request cmpl-f5f691a30a90449b854765f058c26915-0.
INFO 03-01 23:47:37 [logger.py:42] Received request cmpl-505eaf64cded4ce7a760644fd166c9b1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:47:37 [async_llm.py:261] Added request cmpl-505eaf64cded4ce7a760644fd166c9b1-0.
INFO 03-01 23:47:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5%
INFO 03-01 23:47:38 [logger.py:42] Received request cmpl-6861e88462ef439e855053e636a0c5bd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:47:38 [async_llm.py:261] Added request cmpl-6861e88462ef439e855053e636a0c5bd-0.
INFO 03-01 23:47:39 [logger.py:42] Received request cmpl-e67d12acf1b447b2b4d21a0413990c96-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:47:39 [async_llm.py:261] Added request cmpl-e67d12acf1b447b2b4d21a0413990c96-0.
INFO 03-01 23:47:40 [logger.py:42] Received request cmpl-24840880b06144989c3dae8248e13966-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:47:40 [async_llm.py:261] Added request cmpl-24840880b06144989c3dae8248e13966-0.
INFO 03-01 23:47:42 [logger.py:42] Received request cmpl-e86d82b4e88e4427be78327c15b37213-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:47:42 [async_llm.py:261] Added request cmpl-e86d82b4e88e4427be78327c15b37213-0.
INFO 03-01 23:47:43 [logger.py:42] Received request cmpl-f9e5b37bf5e441b1aadc695d99aadc53-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:47:43 [async_llm.py:261] Added request cmpl-f9e5b37bf5e441b1aadc695d99aadc53-0.
INFO 03-01 23:47:44 [logger.py:42] Received request cmpl-ae22c4838f164c41b1e533ba0ad751d1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:47:44 [async_llm.py:261] Added request cmpl-ae22c4838f164c41b1e533ba0ad751d1-0.
INFO 03-01 23:47:45 [logger.py:42] Received request cmpl-0453095f52da4dae919c83235d4c301c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:47:45 [async_llm.py:261] Added request cmpl-0453095f52da4dae919c83235d4c301c-0.
INFO 03-01 23:47:46 [logger.py:42] Received request cmpl-d5c81043eaf546e8bef6b9867a685201-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:47:46 [async_llm.py:261] Added request cmpl-d5c81043eaf546e8bef6b9867a685201-0.
INFO 03-01 23:47:47 [logger.py:42] Received request cmpl-98e79c8cc22d467881e9672606571a9f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:47:47 [async_llm.py:261] Added request cmpl-98e79c8cc22d467881e9672606571a9f-0.
INFO 03-01 23:47:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5%
INFO 03-01 23:47:49 [logger.py:42] Received request cmpl-511a18dafeb74c6085394c5bf5ff45d4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:47:49 [async_llm.py:261] Added request cmpl-511a18dafeb74c6085394c5bf5ff45d4-0.
INFO 03-01 23:47:50 [logger.py:42] Received request cmpl-dd251505061445c6a10c2785ca661e9d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:47:50 [async_llm.py:261] Added request cmpl-dd251505061445c6a10c2785ca661e9d-0.
INFO 03-01 23:47:51 [logger.py:42] Received request cmpl-e83afc3408a845ca9be8dd4f13805352-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:47:51 [async_llm.py:261] Added request cmpl-e83afc3408a845ca9be8dd4f13805352-0.
INFO 03-01 23:47:52 [logger.py:42] Received request cmpl-ea06ed33aea44f86b344f3b931a061e4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:47:52 [async_llm.py:261] Added request cmpl-ea06ed33aea44f86b344f3b931a061e4-0.
INFO 03-01 23:47:53 [logger.py:42] Received request cmpl-7b176d73deb34914969ec459da4790a4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:47:53 [async_llm.py:261] Added request cmpl-7b176d73deb34914969ec459da4790a4-0.
INFO 03-01 23:47:54 [logger.py:42] Received request cmpl-761300252e8549839339b112b02924a0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:47:54 [async_llm.py:261] Added request cmpl-761300252e8549839339b112b02924a0-0.
INFO 03-01 23:47:55 [logger.py:42] Received request cmpl-f778f3bf8a0b4f968b55521e66dc75cf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:47:55 [async_llm.py:261] Added request cmpl-f778f3bf8a0b4f968b55521e66dc75cf-0.
INFO 03-01 23:47:57 [logger.py:42] Received request cmpl-ffa53cc95d9f4e75974698b7f99f796c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:47:57 [async_llm.py:261] Added request cmpl-ffa53cc95d9f4e75974698b7f99f796c-0.
INFO 03-01 23:47:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5%
INFO 03-01 23:47:58 [logger.py:42] Received request cmpl-f9157691a8054b38987382ad859a8352-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:47:58 [async_llm.py:261] Added request cmpl-f9157691a8054b38987382ad859a8352-0.
INFO 03-01 23:47:59 [logger.py:42] Received request cmpl-6164cf81f523481bb8e7614cbdcb29bc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:47:59 [async_llm.py:261] Added request cmpl-6164cf81f523481bb8e7614cbdcb29bc-0.
INFO 03-01 23:48:00 [logger.py:42] Received request cmpl-465b26f42bd248d3a3bc253b4b0cac80-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:48:00 [async_llm.py:261] Added request cmpl-465b26f42bd248d3a3bc253b4b0cac80-0.
INFO 03-01 23:48:01 [logger.py:42] Received request cmpl-0c9b75435135447398908511eb45dcf1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:48:01 [async_llm.py:261] Added request cmpl-0c9b75435135447398908511eb45dcf1-0.
INFO 03-01 23:48:02 [logger.py:42] Received request cmpl-ae4dba11a3834d51a234e0942465f580-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:48:02 [async_llm.py:261] Added request cmpl-ae4dba11a3834d51a234e0942465f580-0.
INFO 03-01 23:48:04 [logger.py:42] Received request cmpl-9ad7eee79c784945a27003de14a5a4aa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:48:04 [async_llm.py:261] Added request cmpl-9ad7eee79c784945a27003de14a5a4aa-0.
INFO 03-01 23:48:05 [logger.py:42] Received request cmpl-a0ee9650367640a99e891bb0038d17b7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:48:05 [async_llm.py:261] Added request cmpl-a0ee9650367640a99e891bb0038d17b7-0.
INFO 03-01 23:48:06 [logger.py:42] Received request cmpl-a5ad70df3d1348089e12fdf053f75522-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:48:06 [async_llm.py:261] Added request cmpl-a5ad70df3d1348089e12fdf053f75522-0.
INFO 03-01 23:48:07 [logger.py:42] Received request cmpl-232a0285e0a242c38c352bbd8fcfd2d2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:48:07 [async_llm.py:261] Added request cmpl-232a0285e0a242c38c352bbd8fcfd2d2-0.
INFO 03-01 23:48:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5%
INFO 03-01 23:48:08 [logger.py:42] Received request cmpl-5225e2a72e764c80ab5bcd6e2f63ff22-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:48:08 [async_llm.py:261] Added request cmpl-5225e2a72e764c80ab5bcd6e2f63ff22-0.
INFO 03-01 23:48:09 [logger.py:42] Received request cmpl-f4cd2bdd5d4143a19cd7357c7c1ec20e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:48:09 [async_llm.py:261] Added request cmpl-f4cd2bdd5d4143a19cd7357c7c1ec20e-0.
INFO 03-01 23:48:10 [logger.py:42] Received request cmpl-5da0c036f2bf48f9b478bbc5b52be002-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:48:10 [async_llm.py:261] Added request cmpl-5da0c036f2bf48f9b478bbc5b52be002-0.
INFO 03-01 23:48:12 [logger.py:42] Received request cmpl-3b68f922ae0940c7808b51f4dd99a652-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:48:12 [async_llm.py:261] Added request cmpl-3b68f922ae0940c7808b51f4dd99a652-0.
INFO 03-01 23:48:13 [logger.py:42] Received request cmpl-fb273e49f168449ca5fc17d9bfb7a597-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:48:13 [async_llm.py:261] Added request cmpl-fb273e49f168449ca5fc17d9bfb7a597-0.
INFO 03-01 23:48:14 [logger.py:42] Received request cmpl-ef4fe017df014ed0bb7e5bfbed284b77-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:48:14 [async_llm.py:261] Added request cmpl-ef4fe017df014ed0bb7e5bfbed284b77-0.
INFO 03-01 23:48:15 [logger.py:42] Received request cmpl-6dbc785c00a4499c9a667d16ac034ea5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:48:15 [async_llm.py:261] Added request cmpl-6dbc785c00a4499c9a667d16ac034ea5-0.
INFO 03-01 23:48:16 [logger.py:42] Received request cmpl-fd2bd190fa944694adf56d66fa125b7f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:48:16 [async_llm.py:261] Added request cmpl-fd2bd190fa944694adf56d66fa125b7f-0.
INFO 03-01 23:48:17 [logger.py:42] Received request cmpl-0ee67fc354994e01a7a3f8d1b3dfb1e9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:48:17 [async_llm.py:261] Added request cmpl-0ee67fc354994e01a7a3f8d1b3dfb1e9-0.
INFO 03-01 23:48:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5%
INFO 03-01 23:48:18 [logger.py:42] Received request cmpl-669a041664c640aaa3176537c4313483-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:48:18 [async_llm.py:261] Added request cmpl-669a041664c640aaa3176537c4313483-0.
INFO 03-01 23:48:20 [logger.py:42] Received request cmpl-99aaee89d6ee4677ac0e0c5a68042197-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:48:20 [async_llm.py:261] Added request cmpl-99aaee89d6ee4677ac0e0c5a68042197-0.
INFO 03-01 23:48:21 [logger.py:42] Received request cmpl-d7161a0eb66e4191b24368723bdcdb66-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:48:21 [async_llm.py:261] Added request cmpl-d7161a0eb66e4191b24368723bdcdb66-0.
INFO 03-01 23:48:22 [logger.py:42] Received request cmpl-eb0c4d2f302a469099cbe895f4532b93-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:48:22 [async_llm.py:261] Added request cmpl-eb0c4d2f302a469099cbe895f4532b93-0.
INFO 03-01 23:48:23 [logger.py:42] Received request cmpl-da1b571212804f2b87032bc4a4e97b5f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:48:23 [async_llm.py:261] Added request cmpl-da1b571212804f2b87032bc4a4e97b5f-0.
INFO 03-01 23:48:24 [logger.py:42] Received request cmpl-bdc1e0ec26334fbe9ddfc229a812e734-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:48:24 [async_llm.py:261] Added request cmpl-bdc1e0ec26334fbe9ddfc229a812e734-0.
INFO 03-01 23:48:25 [logger.py:42] Received request cmpl-343d1cff0c46447ba92ba9e740201019-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:48:25 [async_llm.py:261] Added request cmpl-343d1cff0c46447ba92ba9e740201019-0.
INFO 03-01 23:48:27 [logger.py:42] Received request cmpl-c2b59d6ef1cb49f58d63e3e5be1ddf53-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:48:27 [async_llm.py:261] Added request cmpl-c2b59d6ef1cb49f58d63e3e5be1ddf53-0.
INFO 03-01 23:48:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5%
INFO 03-01 23:48:28 [logger.py:42] Received request cmpl-9d9b42fdcb674436870c45cf27534800-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:48:28 [async_llm.py:261] Added request cmpl-9d9b42fdcb674436870c45cf27534800-0.
INFO 03-01 23:48:29 [logger.py:42] Received request cmpl-5636fda66de14a5abbb78e51661d7e06-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:48:29 [async_llm.py:261] Added request cmpl-5636fda66de14a5abbb78e51661d7e06-0.
INFO 03-01 23:48:30 [logger.py:42] Received request cmpl-afe9a99254b5441d866ad1691753f470-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:48:30 [async_llm.py:261] Added request cmpl-afe9a99254b5441d866ad1691753f470-0.
INFO 03-01 23:48:31 [logger.py:42] Received request cmpl-18b34c09f9c945f6a34e763bfb3eb742-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:48:31 [async_llm.py:261] Added request cmpl-18b34c09f9c945f6a34e763bfb3eb742-0.
INFO 03-01 23:48:32 [logger.py:42] Received request cmpl-fc9b5c5a0ae643d493eccd29ee5f7c11-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:48:32 [async_llm.py:261] Added request cmpl-fc9b5c5a0ae643d493eccd29ee5f7c11-0.
INFO 03-01 23:48:33 [logger.py:42] Received request cmpl-21e46971af4641708e42512aee2be66d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:48:33 [async_llm.py:261] Added request cmpl-21e46971af4641708e42512aee2be66d-0.
INFO 03-01 23:48:35 [logger.py:42] Received request cmpl-3699024af8534647ac6f57f85cc82d8f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:48:35 [async_llm.py:261] Added request cmpl-3699024af8534647ac6f57f85cc82d8f-0.
INFO 03-01 23:48:36 [logger.py:42] Received request cmpl-8e4facf76d55493c89c15cae66032103-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:48:36 [async_llm.py:261] Added request cmpl-8e4facf76d55493c89c15cae66032103-0.
INFO 03-01 23:48:37 [logger.py:42] Received request cmpl-3029499b6930428891c515e7c8511164-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:48:37 [async_llm.py:261] Added request cmpl-3029499b6930428891c515e7c8511164-0.
INFO 03-01 23:48:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5%
INFO 03-01 23:48:38 [logger.py:42] Received request cmpl-85a1e868c2c2494f8d9e6e29da8562b3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:48:38 [async_llm.py:261] Added request cmpl-85a1e868c2c2494f8d9e6e29da8562b3-0.
INFO 03-01 23:48:39 [logger.py:42] Received request cmpl-7332a42899e54ae1a8aea1ebef5986db-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:48:39 [async_llm.py:261] Added request cmpl-7332a42899e54ae1a8aea1ebef5986db-0.
INFO 03-01 23:48:40 [logger.py:42] Received request cmpl-6c3222d2bc6f435080c24a2de87ba4b0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:48:40 [async_llm.py:261] Added request cmpl-6c3222d2bc6f435080c24a2de87ba4b0-0.
INFO 03-01 23:48:42 [logger.py:42] Received request cmpl-cabe499cdd13461980ba3873f33ca6e8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:48:42 [async_llm.py:261] Added request cmpl-cabe499cdd13461980ba3873f33ca6e8-0.
INFO 03-01 23:48:43 [logger.py:42] Received request cmpl-6ccbe4b8498b4901823976d2c85570cf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:48:43 [async_llm.py:261] Added request cmpl-6ccbe4b8498b4901823976d2c85570cf-0.
INFO 03-01 23:48:44 [logger.py:42] Received request cmpl-b19745410cd64940801dc1ceceb8d4a1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:48:44 [async_llm.py:261] Added request cmpl-b19745410cd64940801dc1ceceb8d4a1-0.
INFO 03-01 23:48:45 [logger.py:42] Received request cmpl-8235ca7f2ba8479e9cfa8f164e6c5e28-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:48:45 [async_llm.py:261] Added request cmpl-8235ca7f2ba8479e9cfa8f164e6c5e28-0.
INFO 03-01 23:48:46 [logger.py:42] Received request cmpl-8de512c8c28d4632bf81392308adedea-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:48:46 [async_llm.py:261] Added request cmpl-8de512c8c28d4632bf81392308adedea-0.
INFO 03-01 23:48:47 [logger.py:42] Received request cmpl-2cf3ccf38ecd431aa22ba31954958a12-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:48:47 [async_llm.py:261] Added request cmpl-2cf3ccf38ecd431aa22ba31954958a12-0.
INFO 03-01 23:48:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5%
INFO 03-01 23:48:48 [logger.py:42] Received request cmpl-6d551bb4fe0c4dcc890e7d5c592f030f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:48:48 [async_llm.py:261] Added request cmpl-6d551bb4fe0c4dcc890e7d5c592f030f-0.
INFO 03-01 23:48:50 [logger.py:42] Received request cmpl-7a606b745c5f4f4181ced0a7b2528c8d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:48:50 [async_llm.py:261] Added request cmpl-7a606b745c5f4f4181ced0a7b2528c8d-0.
INFO 03-01 23:48:51 [logger.py:42] Received request cmpl-59386e654cdb44a28dcc9763b1cda270-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:48:51 [async_llm.py:261] Added request cmpl-59386e654cdb44a28dcc9763b1cda270-0.
INFO 03-01 23:48:52 [logger.py:42] Received request cmpl-58a1c102328a4f059fdac64d1b10a618-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:48:52 [async_llm.py:261] Added request cmpl-58a1c102328a4f059fdac64d1b10a618-0.
INFO 03-01 23:48:53 [logger.py:42] Received request cmpl-ba34993aebbe4db9b3ca6751a5a08092-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:48:53 [async_llm.py:261] Added request cmpl-ba34993aebbe4db9b3ca6751a5a08092-0.
INFO 03-01 23:48:54 [logger.py:42] Received request cmpl-dc805646f392426ca9bff40dae904878-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:48:54 [async_llm.py:261] Added request cmpl-dc805646f392426ca9bff40dae904878-0.
INFO 03-01 23:48:55 [logger.py:42] Received request cmpl-0707280e86c9401099d0fc13d53da09a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:48:55 [async_llm.py:261] Added request cmpl-0707280e86c9401099d0fc13d53da09a-0.
INFO 03-01 23:48:57 [logger.py:42] Received request cmpl-9940dddf0107488c91c82fd968266e11-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:48:57 [async_llm.py:261] Added request cmpl-9940dddf0107488c91c82fd968266e11-0.
INFO 03-01 23:48:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5%
INFO 03-01 23:48:58 [logger.py:42] Received request cmpl-afb313fd8dd7422f83fd4d74c665af4e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:48:58 [async_llm.py:261] Added request cmpl-afb313fd8dd7422f83fd4d74c665af4e-0.
INFO 03-01 23:48:59 [logger.py:42] Received request cmpl-e90f1d65ccd0438080426ce2a90b03a7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:48:59 [async_llm.py:261] Added request cmpl-e90f1d65ccd0438080426ce2a90b03a7-0.
INFO 03-01 23:49:00 [logger.py:42] Received request cmpl-caef588d5b1f4f1884f94496c83c7a90-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:49:00 [async_llm.py:261] Added request cmpl-caef588d5b1f4f1884f94496c83c7a90-0.
INFO 03-01 23:49:01 [logger.py:42] Received request cmpl-ee2cdceb15f84c24ba532c6e04c03f63-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:49:01 [async_llm.py:261] Added request cmpl-ee2cdceb15f84c24ba532c6e04c03f63-0.
INFO 03-01 23:49:02 [logger.py:42] Received request cmpl-d8a56dcc728648eea899f75b1a3f3df8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:49:02 [async_llm.py:261] Added request cmpl-d8a56dcc728648eea899f75b1a3f3df8-0.
INFO 03-01 23:49:03 [logger.py:42] Received request cmpl-1e9424e8483b4f45a24af28ccdee26d5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:49:03 [async_llm.py:261] Added request cmpl-1e9424e8483b4f45a24af28ccdee26d5-0.
INFO 03-01 23:49:05 [logger.py:42] Received request cmpl-4d0a7eabeb0746bfaa4af9bc412a9d6f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:49:05 [async_llm.py:261] Added request cmpl-4d0a7eabeb0746bfaa4af9bc412a9d6f-0.
INFO 03-01 23:49:06 [logger.py:42] Received request cmpl-ce5b21c062324870adcd1325c55584cd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:49:06 [async_llm.py:261] Added request cmpl-ce5b21c062324870adcd1325c55584cd-0.
INFO 03-01 23:49:07 [logger.py:42] Received request cmpl-c48958568a2340d9b41859f56a28567c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:49:07 [async_llm.py:261] Added request cmpl-c48958568a2340d9b41859f56a28567c-0.
INFO 03-01 23:49:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5%
INFO 03-01 23:49:08 [logger.py:42] Received request cmpl-b0e2a531bd4d474d8ee8cfdb6f3ab018-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:49:08 [async_llm.py:261] Added request cmpl-b0e2a531bd4d474d8ee8cfdb6f3ab018-0.
INFO 03-01 23:49:09 [logger.py:42] Received request cmpl-a96020adc182471b90efd06c8a4eaef0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:49:09 [async_llm.py:261] Added request cmpl-a96020adc182471b90efd06c8a4eaef0-0.
INFO 03-01 23:49:10 [logger.py:42] Received request cmpl-2bb1bc9e2a0447358990062d49a05cbc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:49:10 [async_llm.py:261] Added request cmpl-2bb1bc9e2a0447358990062d49a05cbc-0.
INFO 03-01 23:49:11 [logger.py:42] Received request cmpl-f97070cd1285459da92392f909e5db6e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:49:12 [async_llm.py:261] Added request cmpl-f97070cd1285459da92392f909e5db6e-0.
INFO 03-01 23:49:13 [logger.py:42] Received request cmpl-db151b9e9030432c9948ac1768df331e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:49:13 [async_llm.py:261] Added request cmpl-db151b9e9030432c9948ac1768df331e-0.
INFO 03-01 23:49:14 [logger.py:42] Received request cmpl-04a527778ff243f8a8220549222a0929-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:49:14 [async_llm.py:261] Added request cmpl-04a527778ff243f8a8220549222a0929-0.
INFO 03-01 23:49:15 [logger.py:42] Received request cmpl-6ded34bf0871458bafe2acf1eff3f085-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:49:15 [async_llm.py:261] Added request cmpl-6ded34bf0871458bafe2acf1eff3f085-0.
INFO 03-01 23:49:16 [logger.py:42] Received request cmpl-872ad64cbdbf4b5eb740f44ec1e438b2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:49:16 [async_llm.py:261] Added request cmpl-872ad64cbdbf4b5eb740f44ec1e438b2-0.
INFO 03-01 23:49:17 [logger.py:42] Received request cmpl-af8ede82ab9c4966bb44b9f5376ab9fe-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:49:17 [async_llm.py:261] Added request cmpl-af8ede82ab9c4966bb44b9f5376ab9fe-0.
INFO 03-01 23:49:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5%
INFO 03-01 23:49:18 [logger.py:42] Received request cmpl-d2e556f0463842b19245ecdd6a65ce8a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:49:18 [async_llm.py:261] Added request cmpl-d2e556f0463842b19245ecdd6a65ce8a-0.
INFO 03-01 23:49:20 [logger.py:42] Received request cmpl-d78a0e5e41c4427a85e46f91cad5bbd3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:49:20 [async_llm.py:261] Added request cmpl-d78a0e5e41c4427a85e46f91cad5bbd3-0.
INFO 03-01 23:49:21 [logger.py:42] Received request cmpl-bdfc41c8a4184a38b56ebf3ba57f4c6e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:49:21 [async_llm.py:261] Added request cmpl-bdfc41c8a4184a38b56ebf3ba57f4c6e-0.
INFO 03-01 23:49:22 [logger.py:42] Received request cmpl-7f433c8327144fe1913b536307cab35a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:49:22 [async_llm.py:261] Added request cmpl-7f433c8327144fe1913b536307cab35a-0.
INFO 03-01 23:49:23 [logger.py:42] Received request cmpl-95e98a65aa32431a9183d0d93233ef73-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:49:23 [async_llm.py:261] Added request cmpl-95e98a65aa32431a9183d0d93233ef73-0.
INFO 03-01 23:49:24 [logger.py:42] Received request cmpl-b467b743cdf14e9382960e32e9405985-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:49:24 [async_llm.py:261] Added request cmpl-b467b743cdf14e9382960e32e9405985-0.
INFO 03-01 23:49:25 [logger.py:42] Received request cmpl-c58357e2a0544b06b829a5bb91a88913-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:49:25 [async_llm.py:261] Added request cmpl-c58357e2a0544b06b829a5bb91a88913-0.
INFO 03-01 23:49:26 [logger.py:42] Received request cmpl-d2a15dbb436843059dd21def850382fc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:49:26 [async_llm.py:261] Added request cmpl-d2a15dbb436843059dd21def850382fc-0.
INFO 03-01 23:49:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5%
INFO 03-01 23:49:28 [logger.py:42] Received request cmpl-d7351c11148e44a98a5abbd72288cab7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:49:28 [async_llm.py:261] Added request cmpl-d7351c11148e44a98a5abbd72288cab7-0.
INFO 03-01 23:49:29 [logger.py:42] Received request cmpl-d891213129d5412eb04b52b3d78a4d36-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:49:29 [async_llm.py:261] Added request cmpl-d891213129d5412eb04b52b3d78a4d36-0.
INFO 03-01 23:49:30 [logger.py:42] Received request cmpl-436e78995d964920a01d0e9cda9e7aec-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:49:30 [async_llm.py:261] Added request cmpl-436e78995d964920a01d0e9cda9e7aec-0.
INFO 03-01 23:49:31 [logger.py:42] Received request cmpl-31a823ba757c4e6784323a399aec7c77-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:49:31 [async_llm.py:261] Added request cmpl-31a823ba757c4e6784323a399aec7c77-0.
INFO 03-01 23:49:32 [logger.py:42] Received request cmpl-16968fe6a6ef4979baf49e996b81b12c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:49:32 [async_llm.py:261] Added request cmpl-16968fe6a6ef4979baf49e996b81b12c-0.
INFO 03-01 23:49:33 [logger.py:42] Received request cmpl-c72bfd120d44419eb01ade73b960c245-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:49:33 [async_llm.py:261] Added request cmpl-c72bfd120d44419eb01ade73b960c245-0.
INFO 03-01 23:49:35 [logger.py:42] Received request cmpl-b335163be968444194f0d630429dd255-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:49:35 [async_llm.py:261] Added request cmpl-b335163be968444194f0d630429dd255-0.
INFO 03-01 23:49:36 [logger.py:42] Received request cmpl-1a1ef269ba224cbb843897ef50b23cf4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:49:36 [async_llm.py:261] Added request cmpl-1a1ef269ba224cbb843897ef50b23cf4-0.
INFO 03-01 23:49:37 [logger.py:42] Received request cmpl-61f7263dc4904433a63778c2de534dee-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:49:37 [async_llm.py:261] Added request cmpl-61f7263dc4904433a63778c2de534dee-0.
INFO 03-01 23:49:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5%
INFO 03-01 23:49:38 [logger.py:42] Received request cmpl-159994e39b0544099a64cdc5bd8380a6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:49:38 [async_llm.py:261] Added request cmpl-159994e39b0544099a64cdc5bd8380a6-0.
INFO 03-01 23:49:39 [logger.py:42] Received request cmpl-b64d6a5ac8ef4adbab41197bb7a0aad5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:49:39 [async_llm.py:261] Added request cmpl-b64d6a5ac8ef4adbab41197bb7a0aad5-0.
INFO 03-01 23:49:40 [logger.py:42] Received request cmpl-880406d41c494cec9e87747840ee897d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:49:40 [async_llm.py:261] Added request cmpl-880406d41c494cec9e87747840ee897d-0.
INFO 03-01 23:49:41 [logger.py:42] Received request cmpl-9fb573c53c3740f58156555ec8f223f6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:49:41 [async_llm.py:261] Added request cmpl-9fb573c53c3740f58156555ec8f223f6-0.
INFO 03-01 23:49:43 [logger.py:42] Received request cmpl-1a3e5b00af4245eb800a3ed0b88f5ce6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:49:43 [async_llm.py:261] Added request cmpl-1a3e5b00af4245eb800a3ed0b88f5ce6-0.
INFO 03-01 23:49:44 [logger.py:42] Received request cmpl-9416ed55facd4d7089c1f5288e25bf99-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:49:44 [async_llm.py:261] Added request cmpl-9416ed55facd4d7089c1f5288e25bf99-0.
INFO 03-01 23:49:45 [logger.py:42] Received request cmpl-af36f97f37384817807ea7dc02612685-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:49:45 [async_llm.py:261] Added request cmpl-af36f97f37384817807ea7dc02612685-0.
INFO 03-01 23:49:46 [logger.py:42] Received request cmpl-93e2b6d11aaa46fb8e57ef51fc5098d2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:49:46 [async_llm.py:261] Added request cmpl-93e2b6d11aaa46fb8e57ef51fc5098d2-0.
INFO 03-01 23:49:47 [logger.py:42] Received request cmpl-57efec6ef1594b729dcd7377272a0b9d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:49:47 [async_llm.py:261] Added request cmpl-57efec6ef1594b729dcd7377272a0b9d-0.
INFO 03-01 23:49:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5%
INFO 03-01 23:49:48 [logger.py:42] Received request cmpl-f04a9610eb7e41a08df3ef32bc40853b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:49:48 [async_llm.py:261] Added request cmpl-f04a9610eb7e41a08df3ef32bc40853b-0.
INFO 03-01 23:49:50 [logger.py:42] Received request cmpl-b13fa40aed084f91a5502e9fd21e30e0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:49:50 [async_llm.py:261] Added request cmpl-b13fa40aed084f91a5502e9fd21e30e0-0.
INFO 03-01 23:49:51 [logger.py:42] Received request cmpl-d849cb87c1eb48048e8668a1671e9a16-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:49:51 [async_llm.py:261] Added request cmpl-d849cb87c1eb48048e8668a1671e9a16-0.
INFO 03-01 23:49:52 [logger.py:42] Received request cmpl-55929adcb543475284ef6b5bd758f7b2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:49:52 [async_llm.py:261] Added request cmpl-55929adcb543475284ef6b5bd758f7b2-0.
INFO 03-01 23:49:53 [logger.py:42] Received request cmpl-88d043b5815f4c38bf37dbd839e6ed86-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:49:53 [async_llm.py:261] Added request cmpl-88d043b5815f4c38bf37dbd839e6ed86-0.
INFO 03-01 23:49:54 [logger.py:42] Received request cmpl-00d09b28bc9b4fb5ad39cf9391869d10-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:49:54 [async_llm.py:261] Added request cmpl-00d09b28bc9b4fb5ad39cf9391869d10-0.
INFO 03-01 23:49:55 [logger.py:42] Received request cmpl-0d9c298fcf884122800f9b8d80252d45-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:49:55 [async_llm.py:261] Added request cmpl-0d9c298fcf884122800f9b8d80252d45-0.
INFO 03-01 23:49:56 [logger.py:42] Received request cmpl-62c9c8bc7140487f92939024c9390bb6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:49:56 [async_llm.py:261] Added request cmpl-62c9c8bc7140487f92939024c9390bb6-0.
INFO 03-01 23:49:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5%
INFO 03-01 23:49:58 [logger.py:42] Received request cmpl-579ad2dbf5874f318e38e85ff08c2a45-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:49:58 [async_llm.py:261] Added request cmpl-579ad2dbf5874f318e38e85ff08c2a45-0.
INFO 03-01 23:49:59 [logger.py:42] Received request cmpl-5d08c0e199d24fd787cd1f3c22a81943-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:49:59 [async_llm.py:261] Added request cmpl-5d08c0e199d24fd787cd1f3c22a81943-0.
INFO 03-01 23:50:00 [logger.py:42] Received request cmpl-f8f4ab353eaa4634ae68ac31897bc0b1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:50:00 [async_llm.py:261] Added request cmpl-f8f4ab353eaa4634ae68ac31897bc0b1-0.
INFO 03-01 23:50:01 [logger.py:42] Received request cmpl-977dd4047558407883eede37ff580610-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:50:01 [async_llm.py:261] Added request cmpl-977dd4047558407883eede37ff580610-0.
INFO 03-01 23:50:02 [logger.py:42] Received request cmpl-25951e8672714ef7930ebeca9d487655-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:50:02 [async_llm.py:261] Added request cmpl-25951e8672714ef7930ebeca9d487655-0.
INFO 03-01 23:50:03 [logger.py:42] Received request cmpl-51246891c944448ebb778230ac475c75-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:50:03 [async_llm.py:261] Added request cmpl-51246891c944448ebb778230ac475c75-0.
INFO 03-01 23:50:05 [logger.py:42] Received request cmpl-8c42386200884058baa5d0b64968abfa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:50:05 [async_llm.py:261] Added request cmpl-8c42386200884058baa5d0b64968abfa-0.
INFO 03-01 23:50:06 [logger.py:42] Received request cmpl-90bb6adc4522428ea451f62bbb1fe5b4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:50:06 [async_llm.py:261] Added request cmpl-90bb6adc4522428ea451f62bbb1fe5b4-0.
INFO 03-01 23:50:07 [logger.py:42] Received request cmpl-aeb3a0808ff849ae97645c2c514ff391-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:50:07 [async_llm.py:261] Added request cmpl-aeb3a0808ff849ae97645c2c514ff391-0.
INFO 03-01 23:50:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5%
INFO 03-01 23:50:08 [logger.py:42] Received request cmpl-334dd88fc0004ee8bdd0df2094e615a3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:50:08 [async_llm.py:261] Added request cmpl-334dd88fc0004ee8bdd0df2094e615a3-0.
INFO 03-01 23:50:09 [logger.py:42] Received request cmpl-a073f08ef3b74ca09f1325fcdf72971b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:50:09 [async_llm.py:261] Added request cmpl-a073f08ef3b74ca09f1325fcdf72971b-0.
INFO 03-01 23:50:10 [logger.py:42] Received request cmpl-2c121c3880e44ea79ac00bca49f6e7a9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:50:10 [async_llm.py:261] Added request cmpl-2c121c3880e44ea79ac00bca49f6e7a9-0.
INFO 03-01 23:50:11 [logger.py:42] Received request cmpl-602eeef14ab04a8f9c84497681ec00c2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:50:11 [async_llm.py:261] Added request cmpl-602eeef14ab04a8f9c84497681ec00c2-0.
INFO 03-01 23:50:13 [logger.py:42] Received request cmpl-9408d971d6494e6f904ee1e32c79100f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:50:13 [async_llm.py:261] Added request cmpl-9408d971d6494e6f904ee1e32c79100f-0.
INFO 03-01 23:50:14 [logger.py:42] Received request cmpl-a6d7311400714a419d136df2e415d0c0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:50:14 [async_llm.py:261] Added request cmpl-a6d7311400714a419d136df2e415d0c0-0.
INFO 03-01 23:50:15 [logger.py:42] Received request cmpl-8c9087a688e04caba833419a0ae2114b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:50:15 [async_llm.py:261] Added request cmpl-8c9087a688e04caba833419a0ae2114b-0.
INFO 03-01 23:50:16 [logger.py:42] Received request cmpl-64918837de7340278b4db64555fd664a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:50:16 [async_llm.py:261] Added request cmpl-64918837de7340278b4db64555fd664a-0.
INFO 03-01 23:50:17 [logger.py:42] Received request cmpl-31eacb1b4b314a0c8ff4c71e469d260e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:50:17 [async_llm.py:261] Added request cmpl-31eacb1b4b314a0c8ff4c71e469d260e-0.
INFO 03-01 23:50:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5%
INFO 03-01 23:50:18 [logger.py:42] Received request cmpl-341dbcbe42ff4319b675afc5dab04b4c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:50:18 [async_llm.py:261] Added request cmpl-341dbcbe42ff4319b675afc5dab04b4c-0.
INFO 03-01 23:50:20 [logger.py:42] Received request cmpl-6185234ff72547e595f582796e12bd76-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:50:20 [async_llm.py:261] Added request cmpl-6185234ff72547e595f582796e12bd76-0.
INFO 03-01 23:50:21 [logger.py:42] Received request cmpl-7acd16b6c3114c3fbba1895742c6b356-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:50:21 [async_llm.py:261] Added request cmpl-7acd16b6c3114c3fbba1895742c6b356-0.
INFO 03-01 23:50:22 [logger.py:42] Received request cmpl-7c44ba9b0c454986bfb30a9af90dc658-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:50:22 [async_llm.py:261] Added request cmpl-7c44ba9b0c454986bfb30a9af90dc658-0.
INFO 03-01 23:50:23 [logger.py:42] Received request cmpl-ba876d5c49fc4effba112be2ad5c4996-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:50:23 [async_llm.py:261] Added request cmpl-ba876d5c49fc4effba112be2ad5c4996-0.
INFO 03-01 23:50:24 [logger.py:42] Received request cmpl-835a5427fa3a4a0a93a2fb106e239cf1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:50:24 [async_llm.py:261] Added request cmpl-835a5427fa3a4a0a93a2fb106e239cf1-0.
INFO 03-01 23:50:25 [logger.py:42] Received request cmpl-8b47b4f297a5495e9dde48fb4c3f14b9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:50:25 [async_llm.py:261] Added request cmpl-8b47b4f297a5495e9dde48fb4c3f14b9-0.
INFO 03-01 23:50:26 [logger.py:42] Received request cmpl-5027e1d17da9428c8d6dc5a5f636ca86-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:50:26 [async_llm.py:261] Added request cmpl-5027e1d17da9428c8d6dc5a5f636ca86-0.
INFO 03-01 23:50:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5%
INFO 03-01 23:50:28 [logger.py:42] Received request cmpl-7144b044c91d4e1296b0e764fd58a542-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:50:28 [async_llm.py:261] Added request cmpl-7144b044c91d4e1296b0e764fd58a542-0.
INFO 03-01 23:50:29 [logger.py:42] Received request cmpl-a7a9effcaa1d478f9f7801f551cded95-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:50:29 [async_llm.py:261] Added request cmpl-a7a9effcaa1d478f9f7801f551cded95-0.
INFO 03-01 23:50:30 [logger.py:42] Received request cmpl-bfa6b149951c4212b8021d0602f3bf57-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:50:30 [async_llm.py:261] Added request cmpl-bfa6b149951c4212b8021d0602f3bf57-0.
INFO 03-01 23:50:31 [logger.py:42] Received request cmpl-b79cf4e84df44f13bede27d519fac74f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:50:31 [async_llm.py:261] Added request cmpl-b79cf4e84df44f13bede27d519fac74f-0.
INFO 03-01 23:50:32 [logger.py:42] Received request cmpl-c8a9c57016624b8fbe046626cb9b4b3c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:50:32 [async_llm.py:261] Added request cmpl-c8a9c57016624b8fbe046626cb9b4b3c-0.
INFO 03-01 23:50:33 [logger.py:42] Received request cmpl-b8e9746deefd4c2f85b798f372bc1659-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:50:33 [async_llm.py:261] Added request cmpl-b8e9746deefd4c2f85b798f372bc1659-0.
INFO 03-01 23:50:35 [logger.py:42] Received request cmpl-379d265f55f447278da93e8ba1996474-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:50:35 [async_llm.py:261] Added request cmpl-379d265f55f447278da93e8ba1996474-0.
INFO 03-01 23:50:36 [logger.py:42] Received request cmpl-2423821be9bf4512abbca5c6b5e89c71-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:50:36 [async_llm.py:261] Added request cmpl-2423821be9bf4512abbca5c6b5e89c71-0.
INFO 03-01 23:50:37 [logger.py:42] Received request cmpl-e4190e52997b4cd4868c27dcf8b3dfba-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:50:37 [async_llm.py:261] Added request cmpl-e4190e52997b4cd4868c27dcf8b3dfba-0.
INFO 03-01 23:50:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5%
INFO 03-01 23:50:38 [logger.py:42] Received request cmpl-df8897cd155d47d49870fba3b0745f5e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:50:38 [async_llm.py:261] Added request cmpl-df8897cd155d47d49870fba3b0745f5e-0.
INFO 03-01 23:50:39 [logger.py:42] Received request cmpl-e674ac8bcf3f409f8afa8269042421e0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:50:39 [async_llm.py:261] Added request cmpl-e674ac8bcf3f409f8afa8269042421e0-0.
INFO 03-01 23:50:40 [logger.py:42] Received request cmpl-8b9c5f3d8013404289c0fbcdb432db53-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:50:40 [async_llm.py:261] Added request cmpl-8b9c5f3d8013404289c0fbcdb432db53-0.
INFO 03-01 23:50:41 [logger.py:42] Received request cmpl-40f792ae142d4362b63dcd33272e9610-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:50:41 [async_llm.py:261] Added request cmpl-40f792ae142d4362b63dcd33272e9610-0.
INFO 03-01 23:50:43 [logger.py:42] Received request cmpl-6f79c12e44c540dcab2989a9d1059118-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:50:43 [async_llm.py:261] Added request cmpl-6f79c12e44c540dcab2989a9d1059118-0.
INFO 03-01 23:50:44 [logger.py:42] Received request cmpl-bd77c0ecae104700936b906ce30e60bf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:50:44 [async_llm.py:261] Added request cmpl-bd77c0ecae104700936b906ce30e60bf-0.
INFO 03-01 23:50:45 [logger.py:42] Received request cmpl-7119a79450e2436098c38c47afbea7a1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:50:45 [async_llm.py:261] Added request cmpl-7119a79450e2436098c38c47afbea7a1-0.
INFO 03-01 23:50:46 [logger.py:42] Received request cmpl-e691c647d90c443f9767477e7896d7b3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:50:46 [async_llm.py:261] Added request cmpl-e691c647d90c443f9767477e7896d7b3-0.
INFO 03-01 23:50:47 [logger.py:42] Received request cmpl-4e3fd1acfc8849b8b4591b7fbb441566-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:50:47 [async_llm.py:261] Added request cmpl-4e3fd1acfc8849b8b4591b7fbb441566-0.
INFO 03-01 23:50:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-01 23:50:48 [logger.py:42] Received request cmpl-085ce43c6f1d49fa80d3e711ecfe87b6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:50:48 [async_llm.py:261] Added request cmpl-085ce43c6f1d49fa80d3e711ecfe87b6-0.
INFO 03-01 23:50:50 [logger.py:42] Received request cmpl-0319a8b570fd4eb58067f4b5ae13463d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:50:50 [async_llm.py:261] Added request cmpl-0319a8b570fd4eb58067f4b5ae13463d-0.
INFO 03-01 23:50:51 [logger.py:42] Received request cmpl-afb24cc440a84f7f982fe86798c50661-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:50:51 [async_llm.py:261] Added request cmpl-afb24cc440a84f7f982fe86798c50661-0.
INFO 03-01 23:50:52 [logger.py:42] Received request cmpl-22d4841585a746fd98172e653741e4d0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:50:52 [async_llm.py:261] Added request cmpl-22d4841585a746fd98172e653741e4d0-0.
INFO 03-01 23:50:53 [logger.py:42] Received request cmpl-f3cb7482896a48d3850d7bc552d10c59-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:50:53 [async_llm.py:261] Added request cmpl-f3cb7482896a48d3850d7bc552d10c59-0.
INFO 03-01 23:50:54 [logger.py:42] Received request cmpl-dc6a3281d7484c0ea187cc3b2b20311e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:50:54 [async_llm.py:261] Added request cmpl-dc6a3281d7484c0ea187cc3b2b20311e-0.
INFO 03-01 23:50:55 [logger.py:42] Received request cmpl-59481c47a0814121b945f1731bccec73-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:50:55 [async_llm.py:261] Added request cmpl-59481c47a0814121b945f1731bccec73-0.
INFO 03-01 23:50:56 [logger.py:42] Received request cmpl-cbf5eccbafc34fcd9cf2967357a0b7fb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:50:56 [async_llm.py:261] Added request cmpl-cbf5eccbafc34fcd9cf2967357a0b7fb-0.
INFO 03-01 23:50:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-01 23:50:58 [logger.py:42] Received request cmpl-71b7eb9a359348de8376cd4e8019b4f9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:50:58 [async_llm.py:261] Added request cmpl-71b7eb9a359348de8376cd4e8019b4f9-0.
INFO 03-01 23:50:59 [logger.py:42] Received request cmpl-7ab1308f4f2449f183f3e36af7180d82-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:50:59 [async_llm.py:261] Added request cmpl-7ab1308f4f2449f183f3e36af7180d82-0.
INFO 03-01 23:51:00 [logger.py:42] Received request cmpl-bf93f4b5768744cd9c862c6497c1fa83-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:51:00 [async_llm.py:261] Added request cmpl-bf93f4b5768744cd9c862c6497c1fa83-0.
INFO 03-01 23:51:01 [logger.py:42] Received request cmpl-ff02474bcdfd47f2a133d2694cc38d37-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:51:01 [async_llm.py:261] Added request cmpl-ff02474bcdfd47f2a133d2694cc38d37-0.
INFO 03-01 23:51:02 [logger.py:42] Received request cmpl-eb61d5ea6c8f47ea962e6ca278dd8732-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:51:02 [async_llm.py:261] Added request cmpl-eb61d5ea6c8f47ea962e6ca278dd8732-0.
INFO 03-01 23:51:03 [logger.py:42] Received request cmpl-004344dbfd744f0888967afde4a9ee73-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:51:03 [async_llm.py:261] Added request cmpl-004344dbfd744f0888967afde4a9ee73-0.
INFO 03-01 23:51:05 [logger.py:42] Received request cmpl-98e3d1e9a55c408a9944e200fb8ebde0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:51:05 [async_llm.py:261] Added request cmpl-98e3d1e9a55c408a9944e200fb8ebde0-0.
INFO 03-01 23:51:06 [logger.py:42] Received request cmpl-956f827666ac4a0a88cc122aecaecd4c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:51:06 [async_llm.py:261] Added request cmpl-956f827666ac4a0a88cc122aecaecd4c-0.
INFO 03-01 23:51:07 [logger.py:42] Received request cmpl-bf4e5d53e9164bdf9e3f8589eb6e37df-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:51:07 [async_llm.py:261] Added request cmpl-bf4e5d53e9164bdf9e3f8589eb6e37df-0.
INFO 03-01 23:51:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-01 23:51:08 [logger.py:42] Received request cmpl-b2f9cb6cd3924da88b8b2585e0831a3b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:51:08 [async_llm.py:261] Added request cmpl-b2f9cb6cd3924da88b8b2585e0831a3b-0.
INFO 03-01 23:51:09 [logger.py:42] Received request cmpl-b97db173b32b4986b945dd8c6a83d6e4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:51:09 [async_llm.py:261] Added request cmpl-b97db173b32b4986b945dd8c6a83d6e4-0.
INFO 03-01 23:51:10 [logger.py:42] Received request cmpl-b786403b80654e848b057c727207b04f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:51:10 [async_llm.py:261] Added request cmpl-b786403b80654e848b057c727207b04f-0.
INFO 03-01 23:51:11 [logger.py:42] Received request cmpl-bb3d9b2c0f204aa89b4315dfbbe1c791-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:51:11 [async_llm.py:261] Added request cmpl-bb3d9b2c0f204aa89b4315dfbbe1c791-0.
INFO 03-01 23:51:13 [logger.py:42] Received request cmpl-8cb87bd7b63e4d87a1178d3e43b2d32c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:51:13 [async_llm.py:261] Added request cmpl-8cb87bd7b63e4d87a1178d3e43b2d32c-0.
INFO 03-01 23:51:14 [logger.py:42] Received request cmpl-0182a7c53d6e419481a11745e3b53a79-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:51:14 [async_llm.py:261] Added request cmpl-0182a7c53d6e419481a11745e3b53a79-0.
INFO 03-01 23:51:15 [logger.py:42] Received request cmpl-874130e636f54dce8e7733a5f19d9ebd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:51:15 [async_llm.py:261] Added request cmpl-874130e636f54dce8e7733a5f19d9ebd-0.
INFO 03-01 23:51:16 [logger.py:42] Received request cmpl-91dc9835498046b79e5146674da36efd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:51:16 [async_llm.py:261] Added request cmpl-91dc9835498046b79e5146674da36efd-0.
INFO 03-01 23:51:17 [logger.py:42] Received request cmpl-2019784f65db43eab4750de774ed16e7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:51:17 [async_llm.py:261] Added request cmpl-2019784f65db43eab4750de774ed16e7-0.
INFO 03-01 23:51:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-01 23:51:18 [logger.py:42] Received request cmpl-2b3e2a6cd46c4ce3b47daa67b565606b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:51:18 [async_llm.py:261] Added request cmpl-2b3e2a6cd46c4ce3b47daa67b565606b-0.
INFO 03-01 23:51:20 [logger.py:42] Received request cmpl-9947c81c3ea44776a9f68043c4c93b82-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:51:20 [async_llm.py:261] Added request cmpl-9947c81c3ea44776a9f68043c4c93b82-0.
INFO 03-01 23:51:21 [logger.py:42] Received request cmpl-c26af002215f415799469a6808c7e17a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:51:21 [async_llm.py:261] Added request cmpl-c26af002215f415799469a6808c7e17a-0.
INFO 03-01 23:51:22 [logger.py:42] Received request cmpl-eabf56599a2b46f1a65f7b4356dae0f6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:51:22 [async_llm.py:261] Added request cmpl-eabf56599a2b46f1a65f7b4356dae0f6-0.
INFO 03-01 23:51:23 [logger.py:42] Received request cmpl-7dcff9451e1047928ff645d39e565ef0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:51:23 [async_llm.py:261] Added request cmpl-7dcff9451e1047928ff645d39e565ef0-0.
INFO 03-01 23:51:24 [logger.py:42] Received request cmpl-ed9ce433d002417db11d9d2d4c2bd8f0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:51:24 [async_llm.py:261] Added request cmpl-ed9ce433d002417db11d9d2d4c2bd8f0-0.
INFO 03-01 23:51:25 [logger.py:42] Received request cmpl-0c319a948b304fb089e24dee7f4f7b3e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:51:25 [async_llm.py:261] Added request cmpl-0c319a948b304fb089e24dee7f4f7b3e-0.
INFO 03-01 23:51:26 [logger.py:42] Received request cmpl-68da97d5b05544419a3023c9870db236-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:51:26 [async_llm.py:261] Added request cmpl-68da97d5b05544419a3023c9870db236-0.
INFO 03-01 23:51:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-01 23:51:28 [logger.py:42] Received request cmpl-76dd9b10faa742d2afa38ba3c7d16043-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:51:28 [async_llm.py:261] Added request cmpl-76dd9b10faa742d2afa38ba3c7d16043-0.
INFO 03-01 23:51:29 [logger.py:42] Received request cmpl-8d4bcfb4af2a4b2b9137d34e428a47cf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:51:29 [async_llm.py:261] Added request cmpl-8d4bcfb4af2a4b2b9137d34e428a47cf-0.
INFO 03-01 23:51:30 [logger.py:42] Received request cmpl-d4f299baaa2248d7abe716c8b0c6d956-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:51:30 [async_llm.py:261] Added request cmpl-d4f299baaa2248d7abe716c8b0c6d956-0.
INFO 03-01 23:51:31 [logger.py:42] Received request cmpl-e62b19cccee849eea1eadd806b6be5eb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:51:31 [async_llm.py:261] Added request cmpl-e62b19cccee849eea1eadd806b6be5eb-0.
INFO 03-01 23:51:32 [logger.py:42] Received request cmpl-f62112d8ca70422da4faffe2bd6f3bf8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:51:32 [async_llm.py:261] Added request cmpl-f62112d8ca70422da4faffe2bd6f3bf8-0.
INFO 03-01 23:51:33 [logger.py:42] Received request cmpl-915f31ae0689441ab8643d0b63da3788-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:51:33 [async_llm.py:261] Added request cmpl-915f31ae0689441ab8643d0b63da3788-0.
INFO 03-01 23:51:35 [logger.py:42] Received request cmpl-b5f1c4b41f7f4ec8a7969dead3c8266e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:51:35 [async_llm.py:261] Added request cmpl-b5f1c4b41f7f4ec8a7969dead3c8266e-0.
INFO 03-01 23:51:36 [logger.py:42] Received request cmpl-0e38dd06041347dfa4aa6718111dbe2d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:51:36 [async_llm.py:261] Added request cmpl-0e38dd06041347dfa4aa6718111dbe2d-0.
INFO 03-01 23:51:37 [logger.py:42] Received request cmpl-1efcdd4a6d66493d9a3ea57c5a54f481-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:51:37 [async_llm.py:261] Added request cmpl-1efcdd4a6d66493d9a3ea57c5a54f481-0.
INFO 03-01 23:51:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-01 23:51:38 [logger.py:42] Received request cmpl-d04b140019f340e299d6d9c6211df100-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:51:38 [async_llm.py:261] Added request cmpl-d04b140019f340e299d6d9c6211df100-0.
INFO 03-01 23:51:39 [logger.py:42] Received request cmpl-b5c254a65a054a82a4939d89e5f62c30-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:51:39 [async_llm.py:261] Added request cmpl-b5c254a65a054a82a4939d89e5f62c30-0.
INFO 03-01 23:51:40 [logger.py:42] Received request cmpl-2bb30df6f6de4353bdc03f61b00d546b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:51:40 [async_llm.py:261] Added request cmpl-2bb30df6f6de4353bdc03f61b00d546b-0.
INFO 03-01 23:51:41 [logger.py:42] Received request cmpl-78b6661538374d1da3888f220a81905d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:51:41 [async_llm.py:261] Added request cmpl-78b6661538374d1da3888f220a81905d-0.
INFO 03-01 23:51:43 [logger.py:42] Received request cmpl-a76d0f8c610b4dd093e3dfe7cfb332de-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:51:43 [async_llm.py:261] Added request cmpl-a76d0f8c610b4dd093e3dfe7cfb332de-0.
INFO 03-01 23:51:44 [logger.py:42] Received request cmpl-ad3785791cdb4c2492c2802ff40ac85e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:51:44 [async_llm.py:261] Added request cmpl-ad3785791cdb4c2492c2802ff40ac85e-0.
INFO 03-01 23:51:45 [logger.py:42] Received request cmpl-1bb380187381410d9119ef5e3623df85-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:51:45 [async_llm.py:261] Added request cmpl-1bb380187381410d9119ef5e3623df85-0.
INFO 03-01 23:51:46 [logger.py:42] Received request cmpl-0ca9e524537d4630b0c4cdc918ea76fc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:51:46 [async_llm.py:261] Added request cmpl-0ca9e524537d4630b0c4cdc918ea76fc-0.
INFO 03-01 23:51:47 [logger.py:42] Received request cmpl-43b4e655a9cc4a5f9b9bb0696e3445e6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:51:47 [async_llm.py:261] Added request cmpl-43b4e655a9cc4a5f9b9bb0696e3445e6-0.
INFO 03-01 23:51:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-01 23:51:48 [logger.py:42] Received request cmpl-da1e4e9c9add4a4fbdd646c3d6ea4e48-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:51:48 [async_llm.py:261] Added request cmpl-da1e4e9c9add4a4fbdd646c3d6ea4e48-0.
INFO 03-01 23:51:50 [logger.py:42] Received request cmpl-ec1f5a8715f54395aa93583e9992b661-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:51:50 [async_llm.py:261] Added request cmpl-ec1f5a8715f54395aa93583e9992b661-0.
INFO 03-01 23:51:51 [logger.py:42] Received request cmpl-d5c540d11a1746d9b2e4c73da1a2525f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:51:51 [async_llm.py:261] Added request cmpl-d5c540d11a1746d9b2e4c73da1a2525f-0.
INFO 03-01 23:51:52 [logger.py:42] Received request cmpl-bfd96c644a2b420e9454c588f2ac7450-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:51:52 [async_llm.py:261] Added request cmpl-bfd96c644a2b420e9454c588f2ac7450-0.
INFO 03-01 23:51:53 [logger.py:42] Received request cmpl-09a2a40060b440c7981d948d8a4854cf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:51:53 [async_llm.py:261] Added request cmpl-09a2a40060b440c7981d948d8a4854cf-0.
INFO 03-01 23:51:54 [logger.py:42] Received request cmpl-7e8b18019ba7487ba6d3c1fffa0475f3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:51:54 [async_llm.py:261] Added request cmpl-7e8b18019ba7487ba6d3c1fffa0475f3-0.
INFO 03-01 23:51:55 [logger.py:42] Received request cmpl-5c839df4e4a342739078cd9d13e30c89-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:51:55 [async_llm.py:261] Added request cmpl-5c839df4e4a342739078cd9d13e30c89-0.
INFO 03-01 23:51:57 [logger.py:42] Received request cmpl-d1ac73b36ef9417e9d5dfb16d3822222-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:51:57 [async_llm.py:261] Added request cmpl-d1ac73b36ef9417e9d5dfb16d3822222-0.
INFO 03-01 23:51:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-01 23:51:58 [logger.py:42] Received request cmpl-9adf0577cc0a48a28c4319a98b7d4611-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:51:58 [async_llm.py:261] Added request cmpl-9adf0577cc0a48a28c4319a98b7d4611-0.
INFO 03-01 23:51:59 [logger.py:42] Received request cmpl-df1de0802ae04f0495db2f7b3e0d7c0c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:51:59 [async_llm.py:261] Added request cmpl-df1de0802ae04f0495db2f7b3e0d7c0c-0.
INFO 03-01 23:52:00 [logger.py:42] Received request cmpl-37791a7616be464e879799ff6d2b2b7d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:52:00 [async_llm.py:261] Added request cmpl-37791a7616be464e879799ff6d2b2b7d-0.
INFO 03-01 23:52:01 [logger.py:42] Received request cmpl-6f869e73aa764cc7aa0f3042b4934dcd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:52:01 [async_llm.py:261] Added request cmpl-6f869e73aa764cc7aa0f3042b4934dcd-0.
INFO 03-01 23:52:02 [logger.py:42] Received request cmpl-4557a0f4fd3a428a89d0a70621259508-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:52:02 [async_llm.py:261] Added request cmpl-4557a0f4fd3a428a89d0a70621259508-0.
INFO 03-01 23:52:03 [logger.py:42] Received request cmpl-7eb5683d7b80477db71a4475dd2416da-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:52:03 [async_llm.py:261] Added request cmpl-7eb5683d7b80477db71a4475dd2416da-0.
INFO 03-01 23:52:05 [logger.py:42] Received request cmpl-d7bad9a8be1c487ca2a5d8de757f98b6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:52:05 [async_llm.py:261] Added request cmpl-d7bad9a8be1c487ca2a5d8de757f98b6-0.
INFO 03-01 23:52:06 [logger.py:42] Received request cmpl-9d319f3c2e574ba1968d0288fe90ac59-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:52:06 [async_llm.py:261] Added request cmpl-9d319f3c2e574ba1968d0288fe90ac59-0.
INFO 03-01 23:52:07 [logger.py:42] Received request cmpl-df8d3515c1c54c57968b61a4aa0bdb4d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:52:07 [async_llm.py:261] Added request cmpl-df8d3515c1c54c57968b61a4aa0bdb4d-0.
INFO 03-01 23:52:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-01 23:52:08 [logger.py:42] Received request cmpl-06960d4cfb1546abbe867fbb58ff15e3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:52:08 [async_llm.py:261] Added request cmpl-06960d4cfb1546abbe867fbb58ff15e3-0.
INFO 03-01 23:52:09 [logger.py:42] Received request cmpl-27b78c0720ff45e8b4f3f6648696afed-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:52:09 [async_llm.py:261] Added request cmpl-27b78c0720ff45e8b4f3f6648696afed-0.
INFO 03-01 23:52:10 [logger.py:42] Received request cmpl-73917db067684c1fa923adae339ee2ec-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:52:10 [async_llm.py:261] Added request cmpl-73917db067684c1fa923adae339ee2ec-0.
INFO 03-01 23:52:11 [logger.py:42] Received request cmpl-3ff4f4f4b3f246cda6f3c8cca00cd500-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:52:11 [async_llm.py:261] Added request cmpl-3ff4f4f4b3f246cda6f3c8cca00cd500-0.
INFO 03-01 23:52:13 [logger.py:42] Received request cmpl-47130637b14e456a99e6569da7ab32fd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:52:13 [async_llm.py:261] Added request cmpl-47130637b14e456a99e6569da7ab32fd-0.
INFO 03-01 23:52:14 [logger.py:42] Received request cmpl-f584a9cf33e4473e852e01f0f3c55e76-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:52:14 [async_llm.py:261] Added request cmpl-f584a9cf33e4473e852e01f0f3c55e76-0.
INFO 03-01 23:52:15 [logger.py:42] Received request cmpl-83d7641c3dd749a9b16512ecca3a1420-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:52:15 [async_llm.py:261] Added request cmpl-83d7641c3dd749a9b16512ecca3a1420-0.
INFO 03-01 23:52:16 [logger.py:42] Received request cmpl-eb067898f6e8493180c09d86d5b9d5a0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:52:16 [async_llm.py:261] Added request cmpl-eb067898f6e8493180c09d86d5b9d5a0-0.
INFO 03-01 23:52:17 [logger.py:42] Received request cmpl-3943d74a909b4050a3e31954dbeeafb2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:52:17 [async_llm.py:261] Added request cmpl-3943d74a909b4050a3e31954dbeeafb2-0.
INFO 03-01 23:52:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-01 23:52:18 [logger.py:42] Received request cmpl-66b18a6a19894e8999db56acda76fa16-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:52:18 [async_llm.py:261] Added request cmpl-66b18a6a19894e8999db56acda76fa16-0.
INFO 03-01 23:52:20 [logger.py:42] Received request cmpl-26bc92610dee475bbda9dcf390f21ff3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:52:20 [async_llm.py:261] Added request cmpl-26bc92610dee475bbda9dcf390f21ff3-0.
INFO 03-01 23:52:21 [logger.py:42] Received request cmpl-fe644752928f49f79cd52bd4af6205fd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:52:21 [async_llm.py:261] Added request cmpl-fe644752928f49f79cd52bd4af6205fd-0.
INFO 03-01 23:52:22 [logger.py:42] Received request cmpl-73418def510743a198b9cb0142558bd7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:52:22 [async_llm.py:261] Added request cmpl-73418def510743a198b9cb0142558bd7-0.
INFO 03-01 23:52:23 [logger.py:42] Received request cmpl-a4be1e157c164c71bccc61f5f2628e90-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:52:23 [async_llm.py:261] Added request cmpl-a4be1e157c164c71bccc61f5f2628e90-0.
INFO 03-01 23:52:24 [logger.py:42] Received request cmpl-d7c8f4a5c880421da5956051846ef5fc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:52:24 [async_llm.py:261] Added request cmpl-d7c8f4a5c880421da5956051846ef5fc-0.
INFO 03-01 23:52:25 [logger.py:42] Received request cmpl-ad2e10cf70134f0dba8c382b5d605517-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:52:25 [async_llm.py:261] Added request cmpl-ad2e10cf70134f0dba8c382b5d605517-0.
INFO 03-01 23:52:26 [logger.py:42] Received request cmpl-02bd2f0a88834612930fc0fee76d44f3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:52:26 [async_llm.py:261] Added request cmpl-02bd2f0a88834612930fc0fee76d44f3-0.
INFO 03-01 23:52:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-01 23:52:28 [logger.py:42] Received request cmpl-b6a7e0a463534482a4646990c001afaf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:52:28 [async_llm.py:261] Added request cmpl-b6a7e0a463534482a4646990c001afaf-0.
INFO 03-01 23:52:29 [logger.py:42] Received request cmpl-07c2c6fe895944f9b6af5d0fd9a4ab25-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:52:29 [async_llm.py:261] Added request cmpl-07c2c6fe895944f9b6af5d0fd9a4ab25-0.
INFO 03-01 23:52:30 [logger.py:42] Received request cmpl-7212b3f1921b4cd4ae7721d3220d77e4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:52:30 [async_llm.py:261] Added request cmpl-7212b3f1921b4cd4ae7721d3220d77e4-0.
INFO 03-01 23:52:31 [logger.py:42] Received request cmpl-cf7b7b79cec444b7bef45af72de9be10-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:52:31 [async_llm.py:261] Added request cmpl-cf7b7b79cec444b7bef45af72de9be10-0.
INFO 03-01 23:52:32 [logger.py:42] Received request cmpl-f216cfa94893443e9b4dce9c0d65b3e2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:52:32 [async_llm.py:261] Added request cmpl-f216cfa94893443e9b4dce9c0d65b3e2-0.
INFO 03-01 23:52:33 [logger.py:42] Received request cmpl-3980bf1c8c61402b8d81ee56cc930801-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:52:33 [async_llm.py:261] Added request cmpl-3980bf1c8c61402b8d81ee56cc930801-0.
INFO 03-01 23:52:35 [logger.py:42] Received request cmpl-0877477c908b4d7d9157a98cdb4ea352-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:52:35 [async_llm.py:261] Added request cmpl-0877477c908b4d7d9157a98cdb4ea352-0.
INFO 03-01 23:52:36 [logger.py:42] Received request cmpl-db258ec7459d4c5c98149531aac2b3b7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:52:36 [async_llm.py:261] Added request cmpl-db258ec7459d4c5c98149531aac2b3b7-0.
INFO 03-01 23:52:37 [logger.py:42] Received request cmpl-d12ce47512b24f3f96bb5ce2a84a208c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:52:37 [async_llm.py:261] Added request cmpl-d12ce47512b24f3f96bb5ce2a84a208c-0.
INFO 03-01 23:52:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-01 23:52:38 [logger.py:42] Received request cmpl-fd99439c5b1c43b68a4e7d155f810081-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:52:38 [async_llm.py:261] Added request cmpl-fd99439c5b1c43b68a4e7d155f810081-0.
INFO 03-01 23:52:39 [logger.py:42] Received request cmpl-06a72ac1891748a990c39d60d8a7b468-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:52:39 [async_llm.py:261] Added request cmpl-06a72ac1891748a990c39d60d8a7b468-0.
INFO 03-01 23:52:40 [logger.py:42] Received request cmpl-c3ba78cdb4a74bdd8bb8dcd67409ad74-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:52:40 [async_llm.py:261] Added request cmpl-c3ba78cdb4a74bdd8bb8dcd67409ad74-0.
INFO 03-01 23:52:41 [logger.py:42] Received request cmpl-a9430dc65142436893fc29cfaa368ae8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:52:41 [async_llm.py:261] Added request cmpl-a9430dc65142436893fc29cfaa368ae8-0.
INFO 03-01 23:52:43 [logger.py:42] Received request cmpl-eb0b98e31dab4a859e67d3fa09353e16-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:52:43 [async_llm.py:261] Added request cmpl-eb0b98e31dab4a859e67d3fa09353e16-0.
INFO 03-01 23:52:44 [logger.py:42] Received request cmpl-735fbcad14c14761875b6bd2b6fe36f3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:52:44 [async_llm.py:261] Added request cmpl-735fbcad14c14761875b6bd2b6fe36f3-0.
INFO 03-01 23:52:45 [logger.py:42] Received request cmpl-44b9b10707404d42bf0373e899ccd79c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:52:45 [async_llm.py:261] Added request cmpl-44b9b10707404d42bf0373e899ccd79c-0.
INFO 03-01 23:52:46 [logger.py:42] Received request cmpl-778fbcdb4c9d40d89c6c740a777feee8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:52:46 [async_llm.py:261] Added request cmpl-778fbcdb4c9d40d89c6c740a777feee8-0.
INFO 03-01 23:52:47 [logger.py:42] Received request cmpl-47119eb92c3745a99bcbdd558e1eac21-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:52:47 [async_llm.py:261] Added request cmpl-47119eb92c3745a99bcbdd558e1eac21-0.
INFO 03-01 23:52:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-01 23:52:48 [logger.py:42] Received request cmpl-4b0dff02dd384ce4a0bbc2c8a6bea5a3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:52:48 [async_llm.py:261] Added request cmpl-4b0dff02dd384ce4a0bbc2c8a6bea5a3-0.
INFO 03-01 23:52:50 [logger.py:42] Received request cmpl-9807d1f20f904c97bd81e057d406d1c4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:52:50 [async_llm.py:261] Added request cmpl-9807d1f20f904c97bd81e057d406d1c4-0.
INFO 03-01 23:52:51 [logger.py:42] Received request cmpl-648e1298873d48939a7032168c5e9ac0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:52:51 [async_llm.py:261] Added request cmpl-648e1298873d48939a7032168c5e9ac0-0.
INFO 03-01 23:52:52 [logger.py:42] Received request cmpl-41cbc047b74c4646b586adbeb2cd9114-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:52:52 [async_llm.py:261] Added request cmpl-41cbc047b74c4646b586adbeb2cd9114-0.
INFO 03-01 23:52:53 [logger.py:42] Received request cmpl-c0c9fe41a42e41938a55dcfe4b1630d5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:52:53 [async_llm.py:261] Added request cmpl-c0c9fe41a42e41938a55dcfe4b1630d5-0.
INFO 03-01 23:52:54 [logger.py:42] Received request cmpl-ea1a7acdd8274c1d886b7bfd57b3cb15-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:52:54 [async_llm.py:261] Added request cmpl-ea1a7acdd8274c1d886b7bfd57b3cb15-0.
INFO 03-01 23:52:55 [logger.py:42] Received request cmpl-564e1a27e1714dd6aa1bab996d1a393d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:52:55 [async_llm.py:261] Added request cmpl-564e1a27e1714dd6aa1bab996d1a393d-0.
INFO 03-01 23:52:56 [logger.py:42] Received request cmpl-1875cf2afdee40079a9b6fe310c26ece-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:52:56 [async_llm.py:261] Added request cmpl-1875cf2afdee40079a9b6fe310c26ece-0.
INFO 03-01 23:52:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-01 23:52:58 [logger.py:42] Received request cmpl-947835b6cf6d453592f93d2637b7af44-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:52:58 [async_llm.py:261] Added request cmpl-947835b6cf6d453592f93d2637b7af44-0.
INFO 03-01 23:52:59 [logger.py:42] Received request cmpl-d430e8bdd8324faaa31617b4ed766240-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:52:59 [async_llm.py:261] Added request cmpl-d430e8bdd8324faaa31617b4ed766240-0.
INFO 03-01 23:53:00 [logger.py:42] Received request cmpl-db399be03972472c8e108b3707fe3d3a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:53:00 [async_llm.py:261] Added request cmpl-db399be03972472c8e108b3707fe3d3a-0.
INFO 03-01 23:53:01 [logger.py:42] Received request cmpl-7c524a73293442e0a392c837bf536f07-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:53:01 [async_llm.py:261] Added request cmpl-7c524a73293442e0a392c837bf536f07-0.
INFO 03-01 23:53:02 [logger.py:42] Received request cmpl-21f2af8a5a8a4fd5ae2e7a5c57c6a124-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:53:02 [async_llm.py:261] Added request cmpl-21f2af8a5a8a4fd5ae2e7a5c57c6a124-0.
INFO 03-01 23:53:03 [logger.py:42] Received request cmpl-225b5a13bda24d53be4b47c935591861-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:53:03 [async_llm.py:261] Added request cmpl-225b5a13bda24d53be4b47c935591861-0.
INFO 03-01 23:53:05 [logger.py:42] Received request cmpl-766ef00d39004a1fbfdee855d7dd0cc5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:53:05 [async_llm.py:261] Added request cmpl-766ef00d39004a1fbfdee855d7dd0cc5-0.
INFO 03-01 23:53:06 [logger.py:42] Received request cmpl-c7f128ff14e64b488a2042617fb5c47d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:53:06 [async_llm.py:261] Added request cmpl-c7f128ff14e64b488a2042617fb5c47d-0.
INFO 03-01 23:53:07 [logger.py:42] Received request cmpl-cb03f49e4f6447928f31a0cbd686e6a1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:53:07 [async_llm.py:261] Added request cmpl-cb03f49e4f6447928f31a0cbd686e6a1-0.
INFO 03-01 23:53:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-01 23:53:08 [logger.py:42] Received request cmpl-fcb6269e34ac40498205feb6d4dedc09-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:53:08 [async_llm.py:261] Added request cmpl-fcb6269e34ac40498205feb6d4dedc09-0.
INFO 03-01 23:53:09 [logger.py:42] Received request cmpl-04727dc16d134225b60e956d870a6a7e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:53:09 [async_llm.py:261] Added request cmpl-04727dc16d134225b60e956d870a6a7e-0.
INFO 03-01 23:53:10 [logger.py:42] Received request cmpl-4d84ff00cc284463b9e3c53f6152aafa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:53:10 [async_llm.py:261] Added request cmpl-4d84ff00cc284463b9e3c53f6152aafa-0.
INFO 03-01 23:53:11 [logger.py:42] Received request cmpl-b0969d40634741c38c176103df5cca4a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:53:11 [async_llm.py:261] Added request cmpl-b0969d40634741c38c176103df5cca4a-0.
INFO 03-01 23:53:13 [logger.py:42] Received request cmpl-a3ac76db5c0949d3b2ba6313b08fc903-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:53:13 [async_llm.py:261] Added request cmpl-a3ac76db5c0949d3b2ba6313b08fc903-0.
INFO 03-01 23:53:14 [logger.py:42] Received request cmpl-a7a130441e254da3847dce94b0a8b61d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:53:14 [async_llm.py:261] Added request cmpl-a7a130441e254da3847dce94b0a8b61d-0.
INFO 03-01 23:53:15 [logger.py:42] Received request cmpl-fe40f3a72e044bc1b3995a1e9aac978e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:53:15 [async_llm.py:261] Added request cmpl-fe40f3a72e044bc1b3995a1e9aac978e-0.
INFO 03-01 23:53:16 [logger.py:42] Received request cmpl-9c783835af214779a2af434132ff2066-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:53:16 [async_llm.py:261] Added request cmpl-9c783835af214779a2af434132ff2066-0.
INFO 03-01 23:53:17 [logger.py:42] Received request cmpl-ae7195ee5aba41cca2f7fb5a19bc3931-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:53:17 [async_llm.py:261] Added request cmpl-ae7195ee5aba41cca2f7fb5a19bc3931-0.
INFO 03-01 23:53:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-01 23:53:18 [logger.py:42] Received request cmpl-b513dc511f804d87b7d78200b67f6b3f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:53:18 [async_llm.py:261] Added request cmpl-b513dc511f804d87b7d78200b67f6b3f-0.
INFO 03-01 23:53:19 [logger.py:42] Received request cmpl-ad85cdd0bfae49df8db590bfee6f6b21-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:53:19 [async_llm.py:261] Added request cmpl-ad85cdd0bfae49df8db590bfee6f6b21-0.
INFO 03-01 23:53:21 [logger.py:42] Received request cmpl-b7b9db72a1b1425b91fcb4085ba673d4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:53:21 [async_llm.py:261] Added request cmpl-b7b9db72a1b1425b91fcb4085ba673d4-0.
INFO 03-01 23:53:22 [logger.py:42] Received request cmpl-c5d9d795afcd4288ada610a45863afb0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:53:22 [async_llm.py:261] Added request cmpl-c5d9d795afcd4288ada610a45863afb0-0.
INFO 03-01 23:53:23 [logger.py:42] Received request cmpl-a5fa8b663b0945a3bf893c916a6bcd1c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:53:23 [async_llm.py:261] Added request cmpl-a5fa8b663b0945a3bf893c916a6bcd1c-0.
INFO 03-01 23:53:24 [logger.py:42] Received request cmpl-930743937b2740b18edc3ed87f603f46-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:53:24 [async_llm.py:261] Added request cmpl-930743937b2740b18edc3ed87f603f46-0.
INFO 03-01 23:53:25 [logger.py:42] Received request cmpl-7dcbf3a5814f4a9abf84238b05660fd3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:53:25 [async_llm.py:261] Added request cmpl-7dcbf3a5814f4a9abf84238b05660fd3-0.
INFO 03-01 23:53:26 [logger.py:42] Received request cmpl-d28fc50cdff34d1eab8fecab1a8afbf5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:53:26 [async_llm.py:261] Added request cmpl-d28fc50cdff34d1eab8fecab1a8afbf5-0.
INFO 03-01 23:53:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-01 23:53:28 [logger.py:42] Received request cmpl-47180ae481d3439ab3a536fd24a92818-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:53:28 [async_llm.py:261] Added request cmpl-47180ae481d3439ab3a536fd24a92818-0.
INFO 03-01 23:53:29 [logger.py:42] Received request cmpl-04a391fdce2845d2a3e82af164e0c4d1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:53:29 [async_llm.py:261] Added request cmpl-04a391fdce2845d2a3e82af164e0c4d1-0.
INFO 03-01 23:53:30 [logger.py:42] Received request cmpl-aa8bc0971eda484ca763dd3e5bb488fb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:53:30 [async_llm.py:261] Added request cmpl-aa8bc0971eda484ca763dd3e5bb488fb-0.
INFO 03-01 23:53:31 [logger.py:42] Received request cmpl-c927dcf24773414daeb103f77201e586-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:53:31 [async_llm.py:261] Added request cmpl-c927dcf24773414daeb103f77201e586-0.
INFO 03-01 23:53:32 [logger.py:42] Received request cmpl-7a5801e7a78841db8d5ba137884b5cfa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:53:32 [async_llm.py:261] Added request cmpl-7a5801e7a78841db8d5ba137884b5cfa-0.
INFO 03-01 23:53:33 [logger.py:42] Received request cmpl-87a2f91b89234446a8ba553a77f837dc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:53:33 [async_llm.py:261] Added request cmpl-87a2f91b89234446a8ba553a77f837dc-0.
INFO 03-01 23:53:34 [logger.py:42] Received request cmpl-68d96c8c41ed4eec91c4155d213680e1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:53:34 [async_llm.py:261] Added request cmpl-68d96c8c41ed4eec91c4155d213680e1-0.
INFO 03-01 23:53:36 [logger.py:42] Received request cmpl-18b88b224cb84f18bf81909ece0a1f90-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:53:36 [async_llm.py:261] Added request cmpl-18b88b224cb84f18bf81909ece0a1f90-0.
INFO 03-01 23:53:37 [logger.py:42] Received request cmpl-811d65dc748c4f12b9a513942da8688a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:53:37 [async_llm.py:261] Added request cmpl-811d65dc748c4f12b9a513942da8688a-0.
INFO 03-01 23:53:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-01 23:53:38 [logger.py:42] Received request cmpl-9d356aee15494844bb491f67674b4862-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:53:38 [async_llm.py:261] Added request cmpl-9d356aee15494844bb491f67674b4862-0.
INFO 03-01 23:53:39 [logger.py:42] Received request cmpl-43272bbebc354f32bf9350493295f866-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:53:39 [async_llm.py:261] Added request cmpl-43272bbebc354f32bf9350493295f866-0.
INFO 03-01 23:53:40 [logger.py:42] Received request cmpl-d15fc581bc8c475ca9a2dbd5efedaabb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:53:40 [async_llm.py:261] Added request cmpl-d15fc581bc8c475ca9a2dbd5efedaabb-0.
INFO 03-01 23:53:41 [logger.py:42] Received request cmpl-899eb7d4479a4e13aafe5296dcafdc1d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:53:41 [async_llm.py:261] Added request cmpl-899eb7d4479a4e13aafe5296dcafdc1d-0.
INFO 03-01 23:53:43 [logger.py:42] Received request cmpl-04a30a6dfaf34092bf8044b421cfc8d3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:53:43 [async_llm.py:261] Added request cmpl-04a30a6dfaf34092bf8044b421cfc8d3-0.
INFO 03-01 23:53:44 [logger.py:42] Received request cmpl-a3641ee829e54faca8cf68012ee08a0c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:53:44 [async_llm.py:261] Added request cmpl-a3641ee829e54faca8cf68012ee08a0c-0.
INFO 03-01 23:53:45 [logger.py:42] Received request cmpl-81f202eda9af4f1cbdb846e1a19d4643-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:53:45 [async_llm.py:261] Added request cmpl-81f202eda9af4f1cbdb846e1a19d4643-0.
INFO 03-01 23:53:46 [logger.py:42] Received request cmpl-bdbddb82a91b4e2c8df118a94fb6b207-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:53:46 [async_llm.py:261] Added request cmpl-bdbddb82a91b4e2c8df118a94fb6b207-0.
INFO 03-01 23:53:47 [logger.py:42] Received request cmpl-34c0743340184404b8c0a1eebb1012c3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:53:47 [async_llm.py:261] Added request cmpl-34c0743340184404b8c0a1eebb1012c3-0.
INFO 03-01 23:53:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-01 23:53:48 [logger.py:42] Received request cmpl-1aaf8dcf7e1845e1a8815d4e03abe6f6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:53:48 [async_llm.py:261] Added request cmpl-1aaf8dcf7e1845e1a8815d4e03abe6f6-0.
INFO 03-01 23:53:49 [logger.py:42] Received request cmpl-acd006f847834b4c8a0c1566d1b5a386-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:53:49 [async_llm.py:261] Added request cmpl-acd006f847834b4c8a0c1566d1b5a386-0.
INFO 03-01 23:53:51 [logger.py:42] Received request cmpl-4cb0c8c8ac2b495891bc6c1cf7aa013b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:53:51 [async_llm.py:261] Added request cmpl-4cb0c8c8ac2b495891bc6c1cf7aa013b-0.
INFO 03-01 23:53:52 [logger.py:42] Received request cmpl-f03685abb4bf46dfa88e19c03012c2eb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:53:52 [async_llm.py:261] Added request cmpl-f03685abb4bf46dfa88e19c03012c2eb-0.
INFO 03-01 23:53:53 [logger.py:42] Received request cmpl-848cf2aeded84496a8baf246ab7da260-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:53:53 [async_llm.py:261] Added request cmpl-848cf2aeded84496a8baf246ab7da260-0.
INFO 03-01 23:53:54 [logger.py:42] Received request cmpl-c89ffa627a8f4584b8be1b4f8c567707-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:53:54 [async_llm.py:261] Added request cmpl-c89ffa627a8f4584b8be1b4f8c567707-0.
INFO 03-01 23:53:55 [logger.py:42] Received request cmpl-270f22023b814e3eb201df63e8cf1a56-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:53:55 [async_llm.py:261] Added request cmpl-270f22023b814e3eb201df63e8cf1a56-0.
INFO 03-01 23:53:56 [logger.py:42] Received request cmpl-348b933ebc6f40cabea62e309487a74a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:53:56 [async_llm.py:261] Added request cmpl-348b933ebc6f40cabea62e309487a74a-0.
INFO 03-01 23:53:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-01 23:53:58 [logger.py:42] Received request cmpl-402ff38f04d54b7fa3128ab4a944b1d1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:53:58 [async_llm.py:261] Added request cmpl-402ff38f04d54b7fa3128ab4a944b1d1-0.
INFO 03-01 23:53:59 [logger.py:42] Received request cmpl-e7ec5a1331cb49efb23898cd34da1eb8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:53:59 [async_llm.py:261] Added request cmpl-e7ec5a1331cb49efb23898cd34da1eb8-0.
INFO 03-01 23:54:00 [logger.py:42] Received request cmpl-bc364cd774654e92b30c6e8b326bcaa4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:54:00 [async_llm.py:261] Added request cmpl-bc364cd774654e92b30c6e8b326bcaa4-0.
INFO 03-01 23:54:01 [logger.py:42] Received request cmpl-fc8ed8e228e945ac8e48fcbfe2dfbc3e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:54:01 [async_llm.py:261] Added request cmpl-fc8ed8e228e945ac8e48fcbfe2dfbc3e-0.
INFO 03-01 23:54:02 [logger.py:42] Received request cmpl-8bdcdba408304454bb6e72b28029b787-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:54:02 [async_llm.py:261] Added request cmpl-8bdcdba408304454bb6e72b28029b787-0.
INFO 03-01 23:54:03 [logger.py:42] Received request cmpl-e9c8216277134847960e0b319e034d99-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:54:03 [async_llm.py:261] Added request cmpl-e9c8216277134847960e0b319e034d99-0.
INFO 03-01 23:54:04 [logger.py:42] Received request cmpl-8c4c0b8dbad342a084fb10140c177723-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:54:05 [async_llm.py:261] Added request cmpl-8c4c0b8dbad342a084fb10140c177723-0.
INFO 03-01 23:54:06 [logger.py:42] Received request cmpl-a14e8e9988464a4ebeb0409e29bbfbe1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:54:06 [async_llm.py:261] Added request cmpl-a14e8e9988464a4ebeb0409e29bbfbe1-0.
INFO 03-01 23:54:07 [logger.py:42] Received request cmpl-255e001dbbd640438297bc3461651d8d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:54:07 [async_llm.py:261] Added request cmpl-255e001dbbd640438297bc3461651d8d-0.
INFO 03-01 23:54:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-01 23:54:08 [logger.py:42] Received request cmpl-6da7a1091cb64f748d99ff4bb0c61ab0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:54:08 [async_llm.py:261] Added request cmpl-6da7a1091cb64f748d99ff4bb0c61ab0-0.
INFO 03-01 23:54:09 [logger.py:42] Received request cmpl-4eeb26498a6a4c3489d7fc6001f26503-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:54:09 [async_llm.py:261] Added request cmpl-4eeb26498a6a4c3489d7fc6001f26503-0.
INFO 03-01 23:54:10 [logger.py:42] Received request cmpl-7f2ac2666bb1420f9a44146e5783c7c1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:54:10 [async_llm.py:261] Added request cmpl-7f2ac2666bb1420f9a44146e5783c7c1-0.
INFO 03-01 23:54:11 [logger.py:42] Received request cmpl-07b30684746e48b38cb0ca5039a41a0a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:54:11 [async_llm.py:261] Added request cmpl-07b30684746e48b38cb0ca5039a41a0a-0.
INFO 03-01 23:54:13 [logger.py:42] Received request cmpl-51c09c1f0358491297817cb0d164ccff-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:54:13 [async_llm.py:261] Added request cmpl-51c09c1f0358491297817cb0d164ccff-0.
INFO 03-01 23:54:14 [logger.py:42] Received request cmpl-93d0e8ef81fb4337bf9f413fafecbd65-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:54:14 [async_llm.py:261] Added request cmpl-93d0e8ef81fb4337bf9f413fafecbd65-0.
INFO 03-01 23:54:15 [logger.py:42] Received request cmpl-b385e57da89a433eb5cab19104af6afd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:54:15 [async_llm.py:261] Added request cmpl-b385e57da89a433eb5cab19104af6afd-0.
INFO 03-01 23:54:16 [logger.py:42] Received request cmpl-5d7f08cf1b2646f1b4b96c1957618500-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:54:16 [async_llm.py:261] Added request cmpl-5d7f08cf1b2646f1b4b96c1957618500-0.
INFO 03-01 23:54:17 [logger.py:42] Received request cmpl-fd3411d223d24e338682f0145dca1109-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:54:17 [async_llm.py:261] Added request cmpl-fd3411d223d24e338682f0145dca1109-0.
INFO 03-01 23:54:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-01 23:54:18 [logger.py:42] Received request cmpl-2c0f4a9c279f4c419ae0314327c44643-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:54:18 [async_llm.py:261] Added request cmpl-2c0f4a9c279f4c419ae0314327c44643-0.
INFO 03-01 23:54:20 [logger.py:42] Received request cmpl-e5bfe5c0f6a04e03b408f6857b17381c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:54:20 [async_llm.py:261] Added request cmpl-e5bfe5c0f6a04e03b408f6857b17381c-0.
INFO 03-01 23:54:21 [logger.py:42] Received request cmpl-6526f90a49214b63a508e73797792f3b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:54:21 [async_llm.py:261] Added request cmpl-6526f90a49214b63a508e73797792f3b-0.
INFO 03-01 23:54:22 [logger.py:42] Received request cmpl-95666f3662d84f39b1d58e9e51a81340-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:54:22 [async_llm.py:261] Added request cmpl-95666f3662d84f39b1d58e9e51a81340-0.
INFO 03-01 23:54:23 [logger.py:42] Received request cmpl-d95b6dac0c964a35826060eafd5fa180-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:54:23 [async_llm.py:261] Added request cmpl-d95b6dac0c964a35826060eafd5fa180-0.
INFO 03-01 23:54:24 [logger.py:42] Received request cmpl-be2e99e5d6a94f71bc58cc70c490d2e6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:54:24 [async_llm.py:261] Added request cmpl-be2e99e5d6a94f71bc58cc70c490d2e6-0.
INFO 03-01 23:54:25 [logger.py:42] Received request cmpl-b091a92a6c4a457c929eeffe63cbb4b7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:54:25 [async_llm.py:261] Added request cmpl-b091a92a6c4a457c929eeffe63cbb4b7-0.
INFO 03-01 23:54:26 [logger.py:42] Received request cmpl-cbcfd0aca3fe48bfb6018cc68bd18e83-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:54:26 [async_llm.py:261] Added request cmpl-cbcfd0aca3fe48bfb6018cc68bd18e83-0.
INFO 03-01 23:54:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-01 23:54:28 [logger.py:42] Received request cmpl-1d684792dc3546c185fb9eda85fdc958-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:54:28 [async_llm.py:261] Added request cmpl-1d684792dc3546c185fb9eda85fdc958-0.
INFO 03-01 23:54:29 [logger.py:42] Received request cmpl-e43417212b3d441bab24b350cdede719-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:54:29 [async_llm.py:261] Added request cmpl-e43417212b3d441bab24b350cdede719-0.
INFO 03-01 23:54:30 [logger.py:42] Received request cmpl-375fdefd3dc447269fb4a1208aaa3853-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:54:30 [async_llm.py:261] Added request cmpl-375fdefd3dc447269fb4a1208aaa3853-0.
INFO 03-01 23:54:31 [logger.py:42] Received request cmpl-fe630dae35464dc09fb2f61c946fc0f0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:54:31 [async_llm.py:261] Added request cmpl-fe630dae35464dc09fb2f61c946fc0f0-0.
INFO 03-01 23:54:32 [logger.py:42] Received request cmpl-8fad7ceb0b674c01b0d877a7a7174ad8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:54:32 [async_llm.py:261] Added request cmpl-8fad7ceb0b674c01b0d877a7a7174ad8-0.
INFO 03-01 23:54:33 [logger.py:42] Received request cmpl-9d4ca60ba8fa4259a10bee4542d68ecc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:54:33 [async_llm.py:261] Added request cmpl-9d4ca60ba8fa4259a10bee4542d68ecc-0.
INFO 03-01 23:54:35 [logger.py:42] Received request cmpl-e12ae57e774846b9af659d388026101f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:54:35 [async_llm.py:261] Added request cmpl-e12ae57e774846b9af659d388026101f-0.
INFO 03-01 23:54:36 [logger.py:42] Received request cmpl-3fc6ba9b2515447a8a5a7c0b091eb59f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:54:36 [async_llm.py:261] Added request cmpl-3fc6ba9b2515447a8a5a7c0b091eb59f-0.
INFO 03-01 23:54:37 [logger.py:42] Received request cmpl-bb4f1b6b57b54c28bc03f8f8e6a038ae-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:54:37 [async_llm.py:261] Added request cmpl-bb4f1b6b57b54c28bc03f8f8e6a038ae-0.
INFO 03-01 23:54:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-01 23:54:38 [logger.py:42] Received request cmpl-40d7757e3fbe4595a066fe1d8c4d8e02-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:54:38 [async_llm.py:261] Added request cmpl-40d7757e3fbe4595a066fe1d8c4d8e02-0.
INFO 03-01 23:54:39 [logger.py:42] Received request cmpl-bb4d305e15ce4f46b7454db738d3e1f3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:54:39 [async_llm.py:261] Added request cmpl-bb4d305e15ce4f46b7454db738d3e1f3-0.
INFO 03-01 23:54:40 [logger.py:42] Received request cmpl-0017d9ff81a34f659cec52cca1b0c86a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:54:40 [async_llm.py:261] Added request cmpl-0017d9ff81a34f659cec52cca1b0c86a-0.
INFO 03-01 23:54:41 [logger.py:42] Received request cmpl-4e77a9db1d6849e4ad4f0056218018fc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:54:41 [async_llm.py:261] Added request cmpl-4e77a9db1d6849e4ad4f0056218018fc-0.
INFO 03-01 23:54:43 [logger.py:42] Received request cmpl-4b3e5f2083c748528d14ecbf2ccd2d54-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:54:43 [async_llm.py:261] Added request cmpl-4b3e5f2083c748528d14ecbf2ccd2d54-0.
INFO 03-01 23:54:44 [logger.py:42] Received request cmpl-bcacbfb679fa4cce9654c54fdebd7438-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:54:44 [async_llm.py:261] Added request cmpl-bcacbfb679fa4cce9654c54fdebd7438-0.
INFO 03-01 23:54:45 [logger.py:42] Received request cmpl-a1418d432c1e4fabad2e957c0e5fb960-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:54:45 [async_llm.py:261] Added request cmpl-a1418d432c1e4fabad2e957c0e5fb960-0.
INFO 03-01 23:54:46 [logger.py:42] Received request cmpl-bd18ae475f824d049d0e16a0af3c318b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:54:46 [async_llm.py:261] Added request cmpl-bd18ae475f824d049d0e16a0af3c318b-0.
INFO 03-01 23:54:47 [logger.py:42] Received request cmpl-e401fa6b540849c1a374ea2fb2d4a06f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:54:47 [async_llm.py:261] Added request cmpl-e401fa6b540849c1a374ea2fb2d4a06f-0.
INFO 03-01 23:54:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-01 23:54:48 [logger.py:42] Received request cmpl-57c0a50111634e4c942eeb3b3d4a62e9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:54:48 [async_llm.py:261] Added request cmpl-57c0a50111634e4c942eeb3b3d4a62e9-0.
INFO 03-01 23:54:50 [logger.py:42] Received request cmpl-e7c7a0695a0e4672a1d92dfd0276f630-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:54:50 [async_llm.py:261] Added request cmpl-e7c7a0695a0e4672a1d92dfd0276f630-0.
INFO 03-01 23:54:51 [logger.py:42] Received request cmpl-7bfb8ae4e172439496c66ca15fd38d85-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:54:51 [async_llm.py:261] Added request cmpl-7bfb8ae4e172439496c66ca15fd38d85-0.
INFO 03-01 23:54:52 [logger.py:42] Received request cmpl-5ae0e3c5f74d46a7af207c1cb6317381-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:54:52 [async_llm.py:261] Added request cmpl-5ae0e3c5f74d46a7af207c1cb6317381-0.
INFO 03-01 23:54:53 [logger.py:42] Received request cmpl-1156096b7d964d77a3c7020e987be064-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:54:53 [async_llm.py:261] Added request cmpl-1156096b7d964d77a3c7020e987be064-0.
INFO 03-01 23:54:54 [logger.py:42] Received request cmpl-6d7b33d519804d3bb455330cfa0ed93d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:54:54 [async_llm.py:261] Added request cmpl-6d7b33d519804d3bb455330cfa0ed93d-0.
INFO 03-01 23:54:55 [logger.py:42] Received request cmpl-30630f9858714f5a82f252d280f0e470-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:54:55 [async_llm.py:261] Added request cmpl-30630f9858714f5a82f252d280f0e470-0.
INFO 03-01 23:54:56 [logger.py:42] Received request cmpl-013011d253444d10b0a8747ae811272c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:54:56 [async_llm.py:261] Added request cmpl-013011d253444d10b0a8747ae811272c-0.
INFO 03-01 23:54:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-01 23:54:58 [logger.py:42] Received request cmpl-dfbc0c82c75642b5b44d8a9152eef14d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:54:58 [async_llm.py:261] Added request cmpl-dfbc0c82c75642b5b44d8a9152eef14d-0.
INFO 03-01 23:54:59 [logger.py:42] Received request cmpl-d26f0079925e4c059cea235074b01ae9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:54:59 [async_llm.py:261] Added request cmpl-d26f0079925e4c059cea235074b01ae9-0.
INFO 03-01 23:55:00 [logger.py:42] Received request cmpl-76efa05af6a04e9aacd18fd013905939-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:55:00 [async_llm.py:261] Added request cmpl-76efa05af6a04e9aacd18fd013905939-0.
INFO 03-01 23:55:01 [logger.py:42] Received request cmpl-cccb06febc8b49f49f2628540d888f21-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:55:01 [async_llm.py:261] Added request cmpl-cccb06febc8b49f49f2628540d888f21-0.
INFO 03-01 23:55:02 [logger.py:42] Received request cmpl-4a2bcd871a69421a961040608285695a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:55:02 [async_llm.py:261] Added request cmpl-4a2bcd871a69421a961040608285695a-0.
INFO 03-01 23:55:03 [logger.py:42] Received request cmpl-68a998e94ea143d98660cb289d8c494d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:55:03 [async_llm.py:261] Added request cmpl-68a998e94ea143d98660cb289d8c494d-0.
INFO 03-01 23:55:05 [logger.py:42] Received request cmpl-258479f44c974782a6c5a4de0342634e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:55:05 [async_llm.py:261] Added request cmpl-258479f44c974782a6c5a4de0342634e-0.
INFO 03-01 23:55:06 [logger.py:42] Received request cmpl-917d4e895fde41138230f3be296cbb1b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:55:06 [async_llm.py:261] Added request cmpl-917d4e895fde41138230f3be296cbb1b-0.
INFO 03-01 23:55:07 [logger.py:42] Received request cmpl-9cac3a07e83f4cf8be67f43f2eb3059e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:55:07 [async_llm.py:261] Added request cmpl-9cac3a07e83f4cf8be67f43f2eb3059e-0.
INFO 03-01 23:55:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-01 23:55:08 [logger.py:42] Received request cmpl-4b6f41aca252489d84bbcf5a250456a1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:55:08 [async_llm.py:261] Added request cmpl-4b6f41aca252489d84bbcf5a250456a1-0.
INFO 03-01 23:55:09 [logger.py:42] Received request cmpl-855849b96fa24f318cf872edd52fbcd2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:55:09 [async_llm.py:261] Added request cmpl-855849b96fa24f318cf872edd52fbcd2-0.
INFO 03-01 23:55:10 [logger.py:42] Received request cmpl-33395a88b7db4db08a7850dc5885836c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:55:10 [async_llm.py:261] Added request cmpl-33395a88b7db4db08a7850dc5885836c-0.
INFO 03-01 23:55:11 [logger.py:42] Received request cmpl-5a5b930da296494186ff310f43c43b2d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:55:11 [async_llm.py:261] Added request cmpl-5a5b930da296494186ff310f43c43b2d-0.
INFO 03-01 23:55:13 [logger.py:42] Received request cmpl-680bec98d9cd4ff3bb8f7e59210b65ca-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:55:13 [async_llm.py:261] Added request cmpl-680bec98d9cd4ff3bb8f7e59210b65ca-0.
INFO 03-01 23:55:14 [logger.py:42] Received request cmpl-cd019758a2974596841354796ffda227-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:55:14 [async_llm.py:261] Added request cmpl-cd019758a2974596841354796ffda227-0.
INFO 03-01 23:55:15 [logger.py:42] Received request cmpl-70771c92d0854aa79de93bf4bcabf7e0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:55:15 [async_llm.py:261] Added request cmpl-70771c92d0854aa79de93bf4bcabf7e0-0.
INFO 03-01 23:55:16 [logger.py:42] Received request cmpl-0eeedb74fc954a2dbdca4f6e6829e8c4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:55:16 [async_llm.py:261] Added request cmpl-0eeedb74fc954a2dbdca4f6e6829e8c4-0.
INFO 03-01 23:55:17 [logger.py:42] Received request cmpl-315a56c7407e4ba89075788cbbcb9c36-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:55:17 [async_llm.py:261] Added request cmpl-315a56c7407e4ba89075788cbbcb9c36-0.
INFO 03-01 23:55:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-01 23:55:18 [logger.py:42] Received request cmpl-9fe740e660d8494a98842e6c6fa506cd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:55:18 [async_llm.py:261] Added request cmpl-9fe740e660d8494a98842e6c6fa506cd-0.
INFO 03-01 23:55:20 [logger.py:42] Received request cmpl-4b148dc878bb4ee09fa3b8d170f96f84-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:55:20 [async_llm.py:261] Added request cmpl-4b148dc878bb4ee09fa3b8d170f96f84-0.
INFO 03-01 23:55:21 [logger.py:42] Received request cmpl-626880dcf22f4853bcdbe42d15e6d707-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:55:21 [async_llm.py:261] Added request cmpl-626880dcf22f4853bcdbe42d15e6d707-0.
INFO 03-01 23:55:22 [logger.py:42] Received request cmpl-be6dc6c7c3a14b3ab04e28b77b2b1e40-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:55:22 [async_llm.py:261] Added request cmpl-be6dc6c7c3a14b3ab04e28b77b2b1e40-0.
INFO 03-01 23:55:23 [logger.py:42] Received request cmpl-23929da6115f4fbcacfeea154259df91-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:55:23 [async_llm.py:261] Added request cmpl-23929da6115f4fbcacfeea154259df91-0.
INFO 03-01 23:55:24 [logger.py:42] Received request cmpl-6a0d1352d2514885aad1f40f661f51ac-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:55:24 [async_llm.py:261] Added request cmpl-6a0d1352d2514885aad1f40f661f51ac-0.
INFO 03-01 23:55:25 [logger.py:42] Received request cmpl-e2f199cdc5714ff587f5b2326c9f6de6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:55:25 [async_llm.py:261] Added request cmpl-e2f199cdc5714ff587f5b2326c9f6de6-0.
INFO 03-01 23:55:26 [logger.py:42] Received request cmpl-28cfa04d061e4847ad4f616593af81b7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:55:26 [async_llm.py:261] Added request cmpl-28cfa04d061e4847ad4f616593af81b7-0.
INFO 03-01 23:55:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-01 23:55:28 [logger.py:42] Received request cmpl-826c0029b3374492996a7b07341ef4b4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:55:28 [async_llm.py:261] Added request cmpl-826c0029b3374492996a7b07341ef4b4-0.
INFO 03-01 23:55:29 [logger.py:42] Received request cmpl-dbdd08a714184347a8149ea5afa500cb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:55:29 [async_llm.py:261] Added request cmpl-dbdd08a714184347a8149ea5afa500cb-0.
INFO 03-01 23:55:30 [logger.py:42] Received request cmpl-1299622f6b6448658feb18fcc49b8580-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:55:30 [async_llm.py:261] Added request cmpl-1299622f6b6448658feb18fcc49b8580-0.
INFO 03-01 23:55:31 [logger.py:42] Received request cmpl-354ec5d0eea54660916576ca5b472d5b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:55:31 [async_llm.py:261] Added request cmpl-354ec5d0eea54660916576ca5b472d5b-0.
INFO 03-01 23:55:32 [logger.py:42] Received request cmpl-18859cddca5c4f6b8338011d415f03cf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:55:32 [async_llm.py:261] Added request cmpl-18859cddca5c4f6b8338011d415f03cf-0.
INFO 03-01 23:55:33 [logger.py:42] Received request cmpl-ed562b4720c1455cb928185fa83ffd75-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:55:33 [async_llm.py:261] Added request cmpl-ed562b4720c1455cb928185fa83ffd75-0.
INFO 03-01 23:55:35 [logger.py:42] Received request cmpl-10eaaa831d7843eda8334813637c61d0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:55:35 [async_llm.py:261] Added request cmpl-10eaaa831d7843eda8334813637c61d0-0.
INFO 03-01 23:55:36 [logger.py:42] Received request cmpl-6c585671862346b6ac8292b493fa7798-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:55:36 [async_llm.py:261] Added request cmpl-6c585671862346b6ac8292b493fa7798-0.
INFO 03-01 23:55:37 [logger.py:42] Received request cmpl-5504d58d79424632bdaeb703e110b43d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:55:37 [async_llm.py:261] Added request cmpl-5504d58d79424632bdaeb703e110b43d-0.
INFO 03-01 23:55:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-01 23:55:38 [logger.py:42] Received request cmpl-dd5b93036d9d4d6aa7d86ff1cebea7ba-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:55:38 [async_llm.py:261] Added request cmpl-dd5b93036d9d4d6aa7d86ff1cebea7ba-0.
INFO 03-01 23:55:39 [logger.py:42] Received request cmpl-f2d3b1baf649434d85d0ec6f3d23ae27-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:55:39 [async_llm.py:261] Added request cmpl-f2d3b1baf649434d85d0ec6f3d23ae27-0.
INFO 03-01 23:55:40 [logger.py:42] Received request cmpl-3b79f26039ee45709589625afcfa4166-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:55:40 [async_llm.py:261] Added request cmpl-3b79f26039ee45709589625afcfa4166-0.
INFO 03-01 23:55:41 [logger.py:42] Received request cmpl-ef62edafcc2842a282ddad15cb1d5393-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:55:41 [async_llm.py:261] Added request cmpl-ef62edafcc2842a282ddad15cb1d5393-0.
INFO 03-01 23:55:43 [logger.py:42] Received request cmpl-3c02316f409347ab893d55fcd0a07d27-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:55:43 [async_llm.py:261] Added request cmpl-3c02316f409347ab893d55fcd0a07d27-0.
INFO 03-01 23:55:44 [logger.py:42] Received request cmpl-ea7efd9eaa62403eb84c16722c3d85bc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:55:44 [async_llm.py:261] Added request cmpl-ea7efd9eaa62403eb84c16722c3d85bc-0.
INFO 03-01 23:55:45 [logger.py:42] Received request cmpl-47b152e86274403686150df9a7f81262-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:55:45 [async_llm.py:261] Added request cmpl-47b152e86274403686150df9a7f81262-0.
INFO 03-01 23:55:46 [logger.py:42] Received request cmpl-d802b45090e3483ca89cb8c84e187d35-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:55:46 [async_llm.py:261] Added request cmpl-d802b45090e3483ca89cb8c84e187d35-0.
INFO 03-01 23:55:47 [logger.py:42] Received request cmpl-782396e3291c472c87b4c65a9de07bce-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:55:47 [async_llm.py:261] Added request cmpl-782396e3291c472c87b4c65a9de07bce-0.
INFO 03-01 23:55:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-01 23:55:48 [logger.py:42] Received request cmpl-f581daa594dd43bc849ee9207eebf74d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:55:48 [async_llm.py:261] Added request cmpl-f581daa594dd43bc849ee9207eebf74d-0.
INFO 03-01 23:55:50 [logger.py:42] Received request cmpl-89af768a80c74f66ae848db0cdf9ee4c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:55:50 [async_llm.py:261] Added request cmpl-89af768a80c74f66ae848db0cdf9ee4c-0.
INFO 03-01 23:55:51 [logger.py:42] Received request cmpl-509929415ae34deaa4eeb8de03c8c5b1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:55:51 [async_llm.py:261] Added request cmpl-509929415ae34deaa4eeb8de03c8c5b1-0.
INFO 03-01 23:55:52 [logger.py:42] Received request cmpl-0539d5653dfe4c42a0e15c1ef2438778-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:55:52 [async_llm.py:261] Added request cmpl-0539d5653dfe4c42a0e15c1ef2438778-0.
INFO 03-01 23:55:53 [logger.py:42] Received request cmpl-d45ef12b8b914c569386f1a4785385be-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:55:53 [async_llm.py:261] Added request cmpl-d45ef12b8b914c569386f1a4785385be-0.
INFO 03-01 23:55:54 [logger.py:42] Received request cmpl-842cab06e97d4524b3b4061b6dc47d3f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:55:54 [async_llm.py:261] Added request cmpl-842cab06e97d4524b3b4061b6dc47d3f-0.
INFO 03-01 23:55:55 [logger.py:42] Received request cmpl-7c6ccfc1e4d3492fb89af1a73898c36f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:55:55 [async_llm.py:261] Added request cmpl-7c6ccfc1e4d3492fb89af1a73898c36f-0.
INFO 03-01 23:55:56 [logger.py:42] Received request cmpl-0029b3d635da47389923222aa34a2220-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:55:56 [async_llm.py:261] Added request cmpl-0029b3d635da47389923222aa34a2220-0.
INFO 03-01 23:55:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-01 23:55:58 [logger.py:42] Received request cmpl-75c18753f06442a39cad76706d21edcc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:55:58 [async_llm.py:261] Added request cmpl-75c18753f06442a39cad76706d21edcc-0.
INFO 03-01 23:55:59 [logger.py:42] Received request cmpl-ee01468b81f543c5a8bd947bf1bddd7c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:55:59 [async_llm.py:261] Added request cmpl-ee01468b81f543c5a8bd947bf1bddd7c-0.
INFO 03-01 23:56:00 [logger.py:42] Received request cmpl-ef6dea568dbc4470aa84042447854867-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:56:00 [async_llm.py:261] Added request cmpl-ef6dea568dbc4470aa84042447854867-0.
INFO 03-01 23:56:01 [logger.py:42] Received request cmpl-e067841585d84d1fbcb8f9c856662262-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:56:01 [async_llm.py:261] Added request cmpl-e067841585d84d1fbcb8f9c856662262-0.
INFO 03-01 23:56:02 [logger.py:42] Received request cmpl-16d10e4ef1244c4aba229ccc3a63ea43-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:56:02 [async_llm.py:261] Added request cmpl-16d10e4ef1244c4aba229ccc3a63ea43-0.
INFO 03-01 23:56:03 [logger.py:42] Received request cmpl-bc23e85f4a664d6a8cd9d8946da7c45c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:56:03 [async_llm.py:261] Added request cmpl-bc23e85f4a664d6a8cd9d8946da7c45c-0.
INFO 03-01 23:56:05 [logger.py:42] Received request cmpl-95bbe4d4ebb24bde889d6df5e83caae2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:56:05 [async_llm.py:261] Added request cmpl-95bbe4d4ebb24bde889d6df5e83caae2-0.
INFO 03-01 23:56:06 [logger.py:42] Received request cmpl-ae7f33b344a34d5fbc1c14074dbf9a91-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:56:06 [async_llm.py:261] Added request cmpl-ae7f33b344a34d5fbc1c14074dbf9a91-0.
INFO 03-01 23:56:07 [logger.py:42] Received request cmpl-8fd51114f99c4e459e4ae790a3bdc11e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:56:07 [async_llm.py:261] Added request cmpl-8fd51114f99c4e459e4ae790a3bdc11e-0.
INFO 03-01 23:56:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-01 23:56:08 [logger.py:42] Received request cmpl-090a0689c77f4328b1582741dcc00a03-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:56:08 [async_llm.py:261] Added request cmpl-090a0689c77f4328b1582741dcc00a03-0.
INFO 03-01 23:56:09 [logger.py:42] Received request cmpl-a655b82761014490b2b1cf5e46024f9f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:56:09 [async_llm.py:261] Added request cmpl-a655b82761014490b2b1cf5e46024f9f-0.
INFO 03-01 23:56:10 [logger.py:42] Received request cmpl-a29992776f0b4a78ad58fbacd90b452e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:56:10 [async_llm.py:261] Added request cmpl-a29992776f0b4a78ad58fbacd90b452e-0.
INFO 03-01 23:56:11 [logger.py:42] Received request cmpl-c653d20781ce4eaeae670d51a0083bec-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:56:11 [async_llm.py:261] Added request cmpl-c653d20781ce4eaeae670d51a0083bec-0.
INFO 03-01 23:56:13 [logger.py:42] Received request cmpl-a8788f3830f14d22b997a64e18992b47-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:56:13 [async_llm.py:261] Added request cmpl-a8788f3830f14d22b997a64e18992b47-0.
INFO 03-01 23:56:14 [logger.py:42] Received request cmpl-dda6c18803934edfaac2b6cfdc804ab3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:56:14 [async_llm.py:261] Added request cmpl-dda6c18803934edfaac2b6cfdc804ab3-0.
INFO 03-01 23:56:15 [logger.py:42] Received request cmpl-8e7a343ad0034ed4aafa987d42f30772-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:56:15 [async_llm.py:261] Added request cmpl-8e7a343ad0034ed4aafa987d42f30772-0.
INFO 03-01 23:56:16 [logger.py:42] Received request cmpl-7e36e6e196ac4dd5b687d87f19b09420-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:56:16 [async_llm.py:261] Added request cmpl-7e36e6e196ac4dd5b687d87f19b09420-0.
INFO 03-01 23:56:17 [logger.py:42] Received request cmpl-6caa124e5f9c46ec82dc6685803ab101-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:56:17 [async_llm.py:261] Added request cmpl-6caa124e5f9c46ec82dc6685803ab101-0.
INFO 03-01 23:56:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-01 23:56:18 [logger.py:42] Received request cmpl-ebaa5010a83d4ea58c361fe3833c9cc5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:56:18 [async_llm.py:261] Added request cmpl-ebaa5010a83d4ea58c361fe3833c9cc5-0.
INFO 03-01 23:56:19 [logger.py:42] Received request cmpl-fe1863366efb4f618e809ac0a626e8de-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:56:19 [async_llm.py:261] Added request cmpl-fe1863366efb4f618e809ac0a626e8de-0.
INFO 03-01 23:56:21 [logger.py:42] Received request cmpl-9e3e79fea8f14b30bf7548bc47aa7431-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:56:21 [async_llm.py:261] Added request cmpl-9e3e79fea8f14b30bf7548bc47aa7431-0.
INFO 03-01 23:56:22 [logger.py:42] Received request cmpl-88a0eb4a61e044e589202b6a08102376-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:56:22 [async_llm.py:261] Added request cmpl-88a0eb4a61e044e589202b6a08102376-0.
INFO 03-01 23:56:23 [logger.py:42] Received request cmpl-159eba06e324490f8c0ff14e3973de65-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:56:23 [async_llm.py:261] Added request cmpl-159eba06e324490f8c0ff14e3973de65-0.
INFO 03-01 23:56:24 [logger.py:42] Received request cmpl-982c088dec3f4a1e8c07e616c9d89b69-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:56:24 [async_llm.py:261] Added request cmpl-982c088dec3f4a1e8c07e616c9d89b69-0.
INFO 03-01 23:56:25 [logger.py:42] Received request cmpl-811f886d6d5d420c888d5bc19e6e459b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:56:25 [async_llm.py:261] Added request cmpl-811f886d6d5d420c888d5bc19e6e459b-0.
INFO 03-01 23:56:26 [logger.py:42] Received request cmpl-97c8d8a270224210b050c6a6544a3950-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:56:26 [async_llm.py:261] Added request cmpl-97c8d8a270224210b050c6a6544a3950-0.
INFO 03-01 23:56:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-01 23:56:28 [logger.py:42] Received request cmpl-3a60e52b6ae84e539e5f971903d569eb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:56:28 [async_llm.py:261] Added request cmpl-3a60e52b6ae84e539e5f971903d569eb-0.
INFO 03-01 23:56:29 [logger.py:42] Received request cmpl-abdd899fbbe4473fb5a501bedfd3161f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:56:29 [async_llm.py:261] Added request cmpl-abdd899fbbe4473fb5a501bedfd3161f-0.
INFO 03-01 23:56:30 [logger.py:42] Received request cmpl-be249dfc5d7b4263826f9b7c616fa01a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:56:30 [async_llm.py:261] Added request cmpl-be249dfc5d7b4263826f9b7c616fa01a-0.
INFO 03-01 23:56:31 [logger.py:42] Received request cmpl-d7e1b8bd8ca5436685a3c851813e6677-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:56:31 [async_llm.py:261] Added request cmpl-d7e1b8bd8ca5436685a3c851813e6677-0.
INFO 03-01 23:56:32 [logger.py:42] Received request cmpl-c70ad076a6fd45fcb21d753c15374535-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:56:32 [async_llm.py:261] Added request cmpl-c70ad076a6fd45fcb21d753c15374535-0.
INFO 03-01 23:56:33 [logger.py:42] Received request cmpl-3789bad15a7e457f83c73b497f03f5d8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:56:33 [async_llm.py:261] Added request cmpl-3789bad15a7e457f83c73b497f03f5d8-0.
INFO 03-01 23:56:35 [logger.py:42] Received request cmpl-84a0b6d0fb8b49d4a85e1892254e8be8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:56:35 [async_llm.py:261] Added request cmpl-84a0b6d0fb8b49d4a85e1892254e8be8-0.
INFO 03-01 23:56:36 [logger.py:42] Received request cmpl-25fda3d34d5547149ea81a799e3f9e37-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:56:36 [async_llm.py:261] Added request cmpl-25fda3d34d5547149ea81a799e3f9e37-0.
INFO 03-01 23:56:37 [logger.py:42] Received request cmpl-9c1f644a424f42f1980b71c816482926-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:56:37 [async_llm.py:261] Added request cmpl-9c1f644a424f42f1980b71c816482926-0.
INFO 03-01 23:56:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-01 23:56:38 [logger.py:42] Received request cmpl-fb7abb81e72344a19aa9bbfab6e0d396-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:56:38 [async_llm.py:261] Added request cmpl-fb7abb81e72344a19aa9bbfab6e0d396-0.
INFO 03-01 23:56:39 [logger.py:42] Received request cmpl-aa0e1cefff7c4b17a1e178d108c9f6f9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:56:39 [async_llm.py:261] Added request cmpl-aa0e1cefff7c4b17a1e178d108c9f6f9-0.
INFO 03-01 23:56:40 [logger.py:42] Received request cmpl-0922a73254984280b62dea88ca9bea07-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:56:40 [async_llm.py:261] Added request cmpl-0922a73254984280b62dea88ca9bea07-0.
INFO 03-01 23:56:41 [logger.py:42] Received request cmpl-9394782966d843a19e8d7c16edefb13f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:56:41 [async_llm.py:261] Added request cmpl-9394782966d843a19e8d7c16edefb13f-0.
INFO 03-01 23:56:43 [logger.py:42] Received request cmpl-28a038e7f4184439b16d83e20a71fb3d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:56:43 [async_llm.py:261] Added request cmpl-28a038e7f4184439b16d83e20a71fb3d-0.
INFO 03-01 23:56:44 [logger.py:42] Received request cmpl-3b77d538858d4b0b8342a16cdb1fd69f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:56:44 [async_llm.py:261] Added request cmpl-3b77d538858d4b0b8342a16cdb1fd69f-0.
INFO 03-01 23:56:45 [logger.py:42] Received request cmpl-5158bda5e5734a9eb15e90b382e364e7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:56:45 [async_llm.py:261] Added request cmpl-5158bda5e5734a9eb15e90b382e364e7-0.
INFO 03-01 23:56:46 [logger.py:42] Received request cmpl-28f3386e6e5247aca9c4396a6e923afe-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:56:46 [async_llm.py:261] Added request cmpl-28f3386e6e5247aca9c4396a6e923afe-0.
INFO 03-01 23:56:47 [logger.py:42] Received request cmpl-57284d61c1e64837829270b2df5769cb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:56:47 [async_llm.py:261] Added request cmpl-57284d61c1e64837829270b2df5769cb-0.
INFO 03-01 23:56:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-01 23:56:48 [logger.py:42] Received request cmpl-335ef0d88eab42a99be46e7a0208e79a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:56:48 [async_llm.py:261] Added request cmpl-335ef0d88eab42a99be46e7a0208e79a-0.
INFO 03-01 23:56:50 [logger.py:42] Received request cmpl-e52ec74282104bcab7c68135385e0573-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:56:50 [async_llm.py:261] Added request cmpl-e52ec74282104bcab7c68135385e0573-0.
INFO 03-01 23:56:51 [logger.py:42] Received request cmpl-29b93093eb1840ab9ec89613844c393c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:56:51 [async_llm.py:261] Added request cmpl-29b93093eb1840ab9ec89613844c393c-0.
INFO 03-01 23:56:52 [logger.py:42] Received request cmpl-a8daf44a7ed04b579d3688582d65e7f2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:56:52 [async_llm.py:261] Added request cmpl-a8daf44a7ed04b579d3688582d65e7f2-0.
INFO 03-01 23:56:53 [logger.py:42] Received request cmpl-51a7c7d585bc4a7e8069bc63be718b23-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:56:53 [async_llm.py:261] Added request cmpl-51a7c7d585bc4a7e8069bc63be718b23-0.
INFO 03-01 23:56:54 [logger.py:42] Received request cmpl-947e7a88276b407fb060e79b6a2a72b8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:56:54 [async_llm.py:261] Added request cmpl-947e7a88276b407fb060e79b6a2a72b8-0.
INFO 03-01 23:56:55 [logger.py:42] Received request cmpl-23df071d986e4bc089fa023aa0a3ce62-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:56:55 [async_llm.py:261] Added request cmpl-23df071d986e4bc089fa023aa0a3ce62-0.
INFO 03-01 23:56:56 [logger.py:42] Received request cmpl-609d4a77689145aeb37c1a80705ddde3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:56:56 [async_llm.py:261] Added request cmpl-609d4a77689145aeb37c1a80705ddde3-0.
INFO 03-01 23:56:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-01 23:56:58 [logger.py:42] Received request cmpl-7cb38d75fe13453b8732545ee3e81543-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:56:58 [async_llm.py:261] Added request cmpl-7cb38d75fe13453b8732545ee3e81543-0.
INFO 03-01 23:56:59 [logger.py:42] Received request cmpl-615caaec555546bdb2e8a4b1f027a796-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:56:59 [async_llm.py:261] Added request cmpl-615caaec555546bdb2e8a4b1f027a796-0.
INFO 03-01 23:57:00 [logger.py:42] Received request cmpl-a5b5287de870497c961a96e35f3762d0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:57:00 [async_llm.py:261] Added request cmpl-a5b5287de870497c961a96e35f3762d0-0.
INFO 03-01 23:57:01 [logger.py:42] Received request cmpl-1e7b2546235e42d5a8ae6c0a3964de74-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:57:01 [async_llm.py:261] Added request cmpl-1e7b2546235e42d5a8ae6c0a3964de74-0.
INFO 03-01 23:57:02 [logger.py:42] Received request cmpl-55de11eb420349aea8158f8ea7401971-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:57:02 [async_llm.py:261] Added request cmpl-55de11eb420349aea8158f8ea7401971-0.
INFO 03-01 23:57:03 [logger.py:42] Received request cmpl-57f60295a75344ccb2e3fe8a063bdb61-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:57:03 [async_llm.py:261] Added request cmpl-57f60295a75344ccb2e3fe8a063bdb61-0.
INFO 03-01 23:57:05 [logger.py:42] Received request cmpl-f5f6f64779e0463299dbcd6d843d80b5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:57:05 [async_llm.py:261] Added request cmpl-f5f6f64779e0463299dbcd6d843d80b5-0.
INFO 03-01 23:57:06 [logger.py:42] Received request cmpl-9be4f6bf7f6b48fd9aa029bb8b8df31d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:57:06 [async_llm.py:261] Added request cmpl-9be4f6bf7f6b48fd9aa029bb8b8df31d-0.
INFO 03-01 23:57:07 [logger.py:42] Received request cmpl-025bde87b1be441d9c9ca7915f7b4691-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:57:07 [async_llm.py:261] Added request cmpl-025bde87b1be441d9c9ca7915f7b4691-0.
INFO 03-01 23:57:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-01 23:57:08 [logger.py:42] Received request cmpl-e7ac70a200164cea957990ccc22f949b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:57:08 [async_llm.py:261] Added request cmpl-e7ac70a200164cea957990ccc22f949b-0.
INFO 03-01 23:57:09 [logger.py:42] Received request cmpl-a124df9d2daf4d40a5853dc7c783f3e2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:57:09 [async_llm.py:261] Added request cmpl-a124df9d2daf4d40a5853dc7c783f3e2-0.
INFO 03-01 23:57:10 [logger.py:42] Received request cmpl-b74f12114cd743acac603bc5e94fd6bf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:57:10 [async_llm.py:261] Added request cmpl-b74f12114cd743acac603bc5e94fd6bf-0.
INFO 03-01 23:57:11 [logger.py:42] Received request cmpl-42f8cf66e95449eba1edcf8539d1eedf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:57:11 [async_llm.py:261] Added request cmpl-42f8cf66e95449eba1edcf8539d1eedf-0.
INFO 03-01 23:57:13 [logger.py:42] Received request cmpl-c7063a8d38e049c9ad4b46ce95a495e3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:57:13 [async_llm.py:261] Added request cmpl-c7063a8d38e049c9ad4b46ce95a495e3-0.
INFO 03-01 23:57:14 [logger.py:42] Received request cmpl-8c02c4f79e7c4ea58d4dc63830206093-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:57:14 [async_llm.py:261] Added request cmpl-8c02c4f79e7c4ea58d4dc63830206093-0.
INFO 03-01 23:57:15 [logger.py:42] Received request cmpl-9d7e7eb2fbe343deb96b1cf04150700d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:57:15 [async_llm.py:261] Added request cmpl-9d7e7eb2fbe343deb96b1cf04150700d-0.
INFO 03-01 23:57:16 [logger.py:42] Received request cmpl-2fc338c0f75e4ae88d14e8a1d350df71-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:57:16 [async_llm.py:261] Added request cmpl-2fc338c0f75e4ae88d14e8a1d350df71-0.
INFO 03-01 23:57:17 [logger.py:42] Received request cmpl-8d8ac12b5d3045cea17130531b503ef0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:57:17 [async_llm.py:261] Added request cmpl-8d8ac12b5d3045cea17130531b503ef0-0.
INFO 03-01 23:57:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-01 23:57:18 [logger.py:42] Received request cmpl-609e6cfb0f344980b17b79c669746e30-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:57:18 [async_llm.py:261] Added request cmpl-609e6cfb0f344980b17b79c669746e30-0.
INFO 03-01 23:57:20 [logger.py:42] Received request cmpl-e137fcbd8728452b959f7ee8ebd3dd95-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:57:20 [async_llm.py:261] Added request cmpl-e137fcbd8728452b959f7ee8ebd3dd95-0.
INFO 03-01 23:57:21 [logger.py:42] Received request cmpl-29154185c0dd437c84626c327c744d09-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:57:21 [async_llm.py:261] Added request cmpl-29154185c0dd437c84626c327c744d09-0.
INFO 03-01 23:57:22 [logger.py:42] Received request cmpl-678a7d0a12c2472bb92781dbf1732e03-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:57:22 [async_llm.py:261] Added request cmpl-678a7d0a12c2472bb92781dbf1732e03-0.
INFO 03-01 23:57:23 [logger.py:42] Received request cmpl-b7b1a9e546f34f8ead3bfa4c26bb433a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:57:23 [async_llm.py:261] Added request cmpl-b7b1a9e546f34f8ead3bfa4c26bb433a-0.
INFO 03-01 23:57:24 [logger.py:42] Received request cmpl-792480762e5b4df496bf0d268b3d44a3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:57:24 [async_llm.py:261] Added request cmpl-792480762e5b4df496bf0d268b3d44a3-0.
INFO 03-01 23:57:25 [logger.py:42] Received request cmpl-fca7fa2f1f8a424dbe418e0f0d8b1c65-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:57:25 [async_llm.py:261] Added request cmpl-fca7fa2f1f8a424dbe418e0f0d8b1c65-0.
INFO 03-01 23:57:26 [logger.py:42] Received request cmpl-c57f23e9897b48928850472f5c8277d2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:57:26 [async_llm.py:261] Added request cmpl-c57f23e9897b48928850472f5c8277d2-0.
INFO 03-01 23:57:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-01 23:57:28 [logger.py:42] Received request cmpl-f27548f8143940de9a47b875436bfcc2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:57:28 [async_llm.py:261] Added request cmpl-f27548f8143940de9a47b875436bfcc2-0.
INFO 03-01 23:57:29 [logger.py:42] Received request cmpl-d5d12a8b12a749c8a3f24f1b4075e5d0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:57:29 [async_llm.py:261] Added request cmpl-d5d12a8b12a749c8a3f24f1b4075e5d0-0.
INFO 03-01 23:57:30 [logger.py:42] Received request cmpl-3ad01450b7194f3e9c0583b9dc0a7f37-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:57:30 [async_llm.py:261] Added request cmpl-3ad01450b7194f3e9c0583b9dc0a7f37-0.
INFO 03-01 23:57:31 [logger.py:42] Received request cmpl-649ac722dd99424b8a6a7159ad8e6206-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:57:31 [async_llm.py:261] Added request cmpl-649ac722dd99424b8a6a7159ad8e6206-0.
INFO 03-01 23:57:32 [logger.py:42] Received request cmpl-1e8e32ba9796411aa3d1c4fd693f59d6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:57:32 [async_llm.py:261] Added request cmpl-1e8e32ba9796411aa3d1c4fd693f59d6-0.
INFO 03-01 23:57:33 [logger.py:42] Received request cmpl-b25d3aa01cc64c8886dfd5a447f527c0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:57:33 [async_llm.py:261] Added request cmpl-b25d3aa01cc64c8886dfd5a447f527c0-0.
INFO 03-01 23:57:35 [logger.py:42] Received request cmpl-0109a73cef344cbfbd7122468c9768bb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:57:35 [async_llm.py:261] Added request cmpl-0109a73cef344cbfbd7122468c9768bb-0.
INFO 03-01 23:57:36 [logger.py:42] Received request cmpl-30f8f5632d6647a28d33468086eebfca-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:57:36 [async_llm.py:261] Added request cmpl-30f8f5632d6647a28d33468086eebfca-0.
INFO 03-01 23:57:37 [logger.py:42] Received request cmpl-1396b7dedf5a46cb85ef6e42291ee898-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:57:37 [async_llm.py:261] Added request cmpl-1396b7dedf5a46cb85ef6e42291ee898-0.
INFO 03-01 23:57:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-01 23:57:38 [logger.py:42] Received request cmpl-8881c883b6f64226a20d1edab9fdc69c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:57:38 [async_llm.py:261] Added request cmpl-8881c883b6f64226a20d1edab9fdc69c-0.
INFO 03-01 23:57:39 [logger.py:42] Received request cmpl-47dc06f7adeb4c96b0dcb5a8d4ef7f6d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:57:39 [async_llm.py:261] Added request cmpl-47dc06f7adeb4c96b0dcb5a8d4ef7f6d-0.
INFO 03-01 23:57:40 [logger.py:42] Received request cmpl-dcc1dc1b6a644b76ae69ea7b1baf53d3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:57:40 [async_llm.py:261] Added request cmpl-dcc1dc1b6a644b76ae69ea7b1baf53d3-0.
INFO 03-01 23:57:41 [logger.py:42] Received request cmpl-4395936ce4e648b7a513e77bce605618-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:57:41 [async_llm.py:261] Added request cmpl-4395936ce4e648b7a513e77bce605618-0.
INFO 03-01 23:57:43 [logger.py:42] Received request cmpl-adff4d6f204342c4a5eae051ce73a9e5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:57:43 [async_llm.py:261] Added request cmpl-adff4d6f204342c4a5eae051ce73a9e5-0.
INFO 03-01 23:57:44 [logger.py:42] Received request cmpl-891a7f5ab3574637a56e862cc3668b35-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:57:44 [async_llm.py:261] Added request cmpl-891a7f5ab3574637a56e862cc3668b35-0.
INFO 03-01 23:57:45 [logger.py:42] Received request cmpl-662a7f04a3874592b7011c1c93b83f0c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:57:45 [async_llm.py:261] Added request cmpl-662a7f04a3874592b7011c1c93b83f0c-0.
INFO 03-01 23:57:46 [logger.py:42] Received request cmpl-a6494ff66dbc4273a1d7bd1ea9283e3a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:57:46 [async_llm.py:261] Added request cmpl-a6494ff66dbc4273a1d7bd1ea9283e3a-0.
INFO 03-01 23:57:47 [logger.py:42] Received request cmpl-31c8d2827c2048628a8bc960a0de8a7a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:57:47 [async_llm.py:261] Added request cmpl-31c8d2827c2048628a8bc960a0de8a7a-0.
INFO 03-01 23:57:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-01 23:57:48 [logger.py:42] Received request cmpl-096f26c858a34e7c98782aa1e26e4050-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:57:48 [async_llm.py:261] Added request cmpl-096f26c858a34e7c98782aa1e26e4050-0.
INFO 03-01 23:57:50 [logger.py:42] Received request cmpl-a3c806f1c22c409ab8ef3ae359675d23-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:57:50 [async_llm.py:261] Added request cmpl-a3c806f1c22c409ab8ef3ae359675d23-0.
INFO 03-01 23:57:51 [logger.py:42] Received request cmpl-214a8fcaac634f8fa8f95d9742b0bbbe-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:57:51 [async_llm.py:261] Added request cmpl-214a8fcaac634f8fa8f95d9742b0bbbe-0.
INFO 03-01 23:57:52 [logger.py:42] Received request cmpl-49be108a92994286b2acdcb444565dfd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:57:52 [async_llm.py:261] Added request cmpl-49be108a92994286b2acdcb444565dfd-0.
INFO 03-01 23:57:53 [logger.py:42] Received request cmpl-5f9fcbd88f0e438bb6edfbd3ee59f167-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:57:53 [async_llm.py:261] Added request cmpl-5f9fcbd88f0e438bb6edfbd3ee59f167-0.
INFO 03-01 23:57:54 [logger.py:42] Received request cmpl-8acbc2f4bcb3449d8b83e1492af55b3b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:57:54 [async_llm.py:261] Added request cmpl-8acbc2f4bcb3449d8b83e1492af55b3b-0.
INFO 03-01 23:57:55 [logger.py:42] Received request cmpl-7e5260dfbb1c48f0b6c88477e1e8803b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:57:55 [async_llm.py:261] Added request cmpl-7e5260dfbb1c48f0b6c88477e1e8803b-0.
INFO 03-01 23:57:56 [logger.py:42] Received request cmpl-fa4516fa40534f96b301c0119e396e10-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:57:56 [async_llm.py:261] Added request cmpl-fa4516fa40534f96b301c0119e396e10-0.
INFO 03-01 23:57:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-01 23:57:58 [logger.py:42] Received request cmpl-9e45521048cf4dde9c1ae7ebb0f9fa0d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:57:58 [async_llm.py:261] Added request cmpl-9e45521048cf4dde9c1ae7ebb0f9fa0d-0.
INFO 03-01 23:57:59 [logger.py:42] Received request cmpl-c834800181cd44ffa9217c1843f9a726-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:57:59 [async_llm.py:261] Added request cmpl-c834800181cd44ffa9217c1843f9a726-0.
INFO 03-01 23:58:00 [logger.py:42] Received request cmpl-eb61c235373647a4b1ff1abe07ee32c9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:58:00 [async_llm.py:261] Added request cmpl-eb61c235373647a4b1ff1abe07ee32c9-0.
INFO 03-01 23:58:01 [logger.py:42] Received request cmpl-e0066179b53f47f798eecef8276710d0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:58:01 [async_llm.py:261] Added request cmpl-e0066179b53f47f798eecef8276710d0-0.
INFO 03-01 23:58:02 [logger.py:42] Received request cmpl-a52056ab70f6469fab812b3d1b462914-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:58:02 [async_llm.py:261] Added request cmpl-a52056ab70f6469fab812b3d1b462914-0.
INFO 03-01 23:58:03 [logger.py:42] Received request cmpl-9d336447f3644e27bcb6da948179e7f5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:58:03 [async_llm.py:261] Added request cmpl-9d336447f3644e27bcb6da948179e7f5-0.
INFO 03-01 23:58:05 [logger.py:42] Received request cmpl-f870756c2f324241bfd15e22567fd7dd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:58:05 [async_llm.py:261] Added request cmpl-f870756c2f324241bfd15e22567fd7dd-0.
INFO 03-01 23:58:06 [logger.py:42] Received request cmpl-c7b69beaf8ad4490ad61a0b88521bbe9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:58:06 [async_llm.py:261] Added request cmpl-c7b69beaf8ad4490ad61a0b88521bbe9-0.
INFO 03-01 23:58:07 [logger.py:42] Received request cmpl-80c0e976837a4f72b04bc7bbdd9d81aa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:58:07 [async_llm.py:261] Added request cmpl-80c0e976837a4f72b04bc7bbdd9d81aa-0.
INFO 03-01 23:58:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-01 23:58:08 [logger.py:42] Received request cmpl-a757ba85d25a45edbb804d11e28bea1a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:58:08 [async_llm.py:261] Added request cmpl-a757ba85d25a45edbb804d11e28bea1a-0.
INFO 03-01 23:58:09 [logger.py:42] Received request cmpl-eb0c2ec1170c4929970cc4a70bbf6e48-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:58:09 [async_llm.py:261] Added request cmpl-eb0c2ec1170c4929970cc4a70bbf6e48-0.
INFO 03-01 23:58:10 [logger.py:42] Received request cmpl-31010eeda1a4463caab9c580c5db0ef7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:58:10 [async_llm.py:261] Added request cmpl-31010eeda1a4463caab9c580c5db0ef7-0.
INFO 03-01 23:58:11 [logger.py:42] Received request cmpl-1f3a406bfb3441a1bdb53a5abc52b6b3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:58:11 [async_llm.py:261] Added request cmpl-1f3a406bfb3441a1bdb53a5abc52b6b3-0.
INFO 03-01 23:58:13 [logger.py:42] Received request cmpl-df1c110922414047a795ef9e0da076db-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:58:13 [async_llm.py:261] Added request cmpl-df1c110922414047a795ef9e0da076db-0.
INFO 03-01 23:58:14 [logger.py:42] Received request cmpl-e2761bbd0b504e97a1ca7c856692a93f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:58:14 [async_llm.py:261] Added request cmpl-e2761bbd0b504e97a1ca7c856692a93f-0.
INFO 03-01 23:58:15 [logger.py:42] Received request cmpl-7c75b51bbb4846ef8d8fe78d98146cc0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:58:15 [async_llm.py:261] Added request cmpl-7c75b51bbb4846ef8d8fe78d98146cc0-0.
INFO 03-01 23:58:16 [logger.py:42] Received request cmpl-e9b1312c3a284b18b6099f101d2da112-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:58:16 [async_llm.py:261] Added request cmpl-e9b1312c3a284b18b6099f101d2da112-0.
INFO 03-01 23:58:17 [logger.py:42] Received request cmpl-549bffdb99ab41c98cb8b272169c3c15-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:58:17 [async_llm.py:261] Added request cmpl-549bffdb99ab41c98cb8b272169c3c15-0.
INFO 03-01 23:58:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-01 23:58:18 [logger.py:42] Received request cmpl-68245503646648afb15d4e61584349a0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:58:18 [async_llm.py:261] Added request cmpl-68245503646648afb15d4e61584349a0-0.
INFO 03-01 23:58:20 [logger.py:42] Received request cmpl-281f3a5f10d3495fb86e9743ca453fd3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:58:20 [async_llm.py:261] Added request cmpl-281f3a5f10d3495fb86e9743ca453fd3-0.
INFO 03-01 23:58:21 [logger.py:42] Received request cmpl-2fba4e17805d400b9bce89f140e63756-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:58:21 [async_llm.py:261] Added request cmpl-2fba4e17805d400b9bce89f140e63756-0.
INFO 03-01 23:58:22 [logger.py:42] Received request cmpl-1ddd9f82510b4c698bdf1a1617551869-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:58:22 [async_llm.py:261] Added request cmpl-1ddd9f82510b4c698bdf1a1617551869-0.
INFO 03-01 23:58:23 [logger.py:42] Received request cmpl-b4c66cef565c478dbf123ef998e4db94-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:58:23 [async_llm.py:261] Added request cmpl-b4c66cef565c478dbf123ef998e4db94-0.
INFO 03-01 23:58:24 [logger.py:42] Received request cmpl-6a301ba609ca45129b6321be6849550e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:58:24 [async_llm.py:261] Added request cmpl-6a301ba609ca45129b6321be6849550e-0.
INFO 03-01 23:58:25 [logger.py:42] Received request cmpl-1a194151ea514e43b624e62e2151b189-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:58:25 [async_llm.py:261] Added request cmpl-1a194151ea514e43b624e62e2151b189-0.
INFO 03-01 23:58:26 [logger.py:42] Received request cmpl-fece781123104f87adb30cec2eeeda34-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:58:26 [async_llm.py:261] Added request cmpl-fece781123104f87adb30cec2eeeda34-0.
INFO 03-01 23:58:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-01 23:58:28 [logger.py:42] Received request cmpl-ad9d5da463e04fd6ad9666092f23ceec-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:58:28 [async_llm.py:261] Added request cmpl-ad9d5da463e04fd6ad9666092f23ceec-0.
INFO 03-01 23:58:29 [logger.py:42] Received request cmpl-cc0e470ff1aa48ac80cd3c5d850a891b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:58:29 [async_llm.py:261] Added request cmpl-cc0e470ff1aa48ac80cd3c5d850a891b-0.
INFO 03-01 23:58:30 [logger.py:42] Received request cmpl-a1ddf2c64fc64c7599d4874177bf6815-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:58:30 [async_llm.py:261] Added request cmpl-a1ddf2c64fc64c7599d4874177bf6815-0.
INFO 03-01 23:58:31 [logger.py:42] Received request cmpl-1a2ecca1c3c7436fb6a32fda89c2cdee-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:58:31 [async_llm.py:261] Added request cmpl-1a2ecca1c3c7436fb6a32fda89c2cdee-0.
INFO 03-01 23:58:32 [logger.py:42] Received request cmpl-d987ba79bd0d469887bfac45a90dc781-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:58:32 [async_llm.py:261] Added request cmpl-d987ba79bd0d469887bfac45a90dc781-0.
INFO 03-01 23:58:33 [logger.py:42] Received request cmpl-e840dafcd4bc4772806186ab877b34d6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:58:33 [async_llm.py:261] Added request cmpl-e840dafcd4bc4772806186ab877b34d6-0.
INFO 03-01 23:58:35 [logger.py:42] Received request cmpl-6e2256f54a404935a8cff014428d0180-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:58:35 [async_llm.py:261] Added request cmpl-6e2256f54a404935a8cff014428d0180-0.
INFO 03-01 23:58:36 [logger.py:42] Received request cmpl-bf20a4f3893241f99fd38aeccf831b5a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:58:36 [async_llm.py:261] Added request cmpl-bf20a4f3893241f99fd38aeccf831b5a-0.
INFO 03-01 23:58:37 [logger.py:42] Received request cmpl-b931beb565814aa7a12c41faa2cd57dd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:58:37 [async_llm.py:261] Added request cmpl-b931beb565814aa7a12c41faa2cd57dd-0.
INFO 03-01 23:58:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-01 23:58:38 [logger.py:42] Received request cmpl-7b750690c8ad4642870b5397de9f338d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:58:38 [async_llm.py:261] Added request cmpl-7b750690c8ad4642870b5397de9f338d-0.
INFO 03-01 23:58:39 [logger.py:42] Received request cmpl-b60aa96bf582479c97813295137310af-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:58:39 [async_llm.py:261] Added request cmpl-b60aa96bf582479c97813295137310af-0.
INFO 03-01 23:58:40 [logger.py:42] Received request cmpl-5fbd84439fc24cc5be7134610cbb5677-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:58:40 [async_llm.py:261] Added request cmpl-5fbd84439fc24cc5be7134610cbb5677-0.
INFO 03-01 23:58:41 [logger.py:42] Received request cmpl-5ab3a597e3b94d16bea1466ad25747a5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:58:41 [async_llm.py:261] Added request cmpl-5ab3a597e3b94d16bea1466ad25747a5-0.
INFO 03-01 23:58:43 [logger.py:42] Received request cmpl-82e12fc6c5e841a78d17bf61ca3adb41-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:58:43 [async_llm.py:261] Added request cmpl-82e12fc6c5e841a78d17bf61ca3adb41-0.
INFO 03-01 23:58:44 [logger.py:42] Received request cmpl-a7ce3844811c4fa58ee9eb5aebd9853c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:58:44 [async_llm.py:261] Added request cmpl-a7ce3844811c4fa58ee9eb5aebd9853c-0.
INFO 03-01 23:58:45 [logger.py:42] Received request cmpl-3ec2034415d84a8a807b4da09f3629d5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:58:45 [async_llm.py:261] Added request cmpl-3ec2034415d84a8a807b4da09f3629d5-0.
INFO 03-01 23:58:46 [logger.py:42] Received request cmpl-41041c8ba1d94fa6a5b53e4f874d91e0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:58:46 [async_llm.py:261] Added request cmpl-41041c8ba1d94fa6a5b53e4f874d91e0-0.
INFO 03-01 23:58:47 [logger.py:42] Received request cmpl-20079a95d53a4c2f9387dafccce2c457-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:58:47 [async_llm.py:261] Added request cmpl-20079a95d53a4c2f9387dafccce2c457-0.
INFO 03-01 23:58:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-01 23:58:48 [logger.py:42] Received request cmpl-e886ff6beeb74fd29da72e7c7fc55966-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:58:48 [async_llm.py:261] Added request cmpl-e886ff6beeb74fd29da72e7c7fc55966-0.
INFO 03-01 23:58:50 [logger.py:42] Received request cmpl-bec38ba244074225937c329e70e61be0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:58:50 [async_llm.py:261] Added request cmpl-bec38ba244074225937c329e70e61be0-0.
INFO 03-01 23:58:51 [logger.py:42] Received request cmpl-893afbcc4f7c489482df8560e6a55835-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:58:51 [async_llm.py:261] Added request cmpl-893afbcc4f7c489482df8560e6a55835-0.
INFO 03-01 23:58:52 [logger.py:42] Received request cmpl-eb9c60037b53476a8354b727a929bf71-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:58:52 [async_llm.py:261] Added request cmpl-eb9c60037b53476a8354b727a929bf71-0.
INFO 03-01 23:58:53 [logger.py:42] Received request cmpl-016dded8fe6a4401a49de4acbbdcb75a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:58:53 [async_llm.py:261] Added request cmpl-016dded8fe6a4401a49de4acbbdcb75a-0.
INFO 03-01 23:58:54 [logger.py:42] Received request cmpl-1a2707324cde4033811b825812d9b9ba-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:58:54 [async_llm.py:261] Added request cmpl-1a2707324cde4033811b825812d9b9ba-0.
INFO 03-01 23:58:55 [logger.py:42] Received request cmpl-b5e00cfe38c944feb62c094ed2befd48-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:58:55 [async_llm.py:261] Added request cmpl-b5e00cfe38c944feb62c094ed2befd48-0.
INFO 03-01 23:58:56 [logger.py:42] Received request cmpl-de52c4fccf6647dbb625654bbeffdc17-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:58:56 [async_llm.py:261] Added request cmpl-de52c4fccf6647dbb625654bbeffdc17-0.
INFO 03-01 23:58:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-01 23:58:58 [logger.py:42] Received request cmpl-a9d8e2b01d1f4b92a33d1f6c5480c612-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:58:58 [async_llm.py:261] Added request cmpl-a9d8e2b01d1f4b92a33d1f6c5480c612-0.
INFO 03-01 23:58:59 [logger.py:42] Received request cmpl-3d4ed03c14704251ab188c1dfa64bc13-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:58:59 [async_llm.py:261] Added request cmpl-3d4ed03c14704251ab188c1dfa64bc13-0.
INFO 03-01 23:59:00 [logger.py:42] Received request cmpl-557e527a28e345aabe8f67a9b5860190-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:59:00 [async_llm.py:261] Added request cmpl-557e527a28e345aabe8f67a9b5860190-0.
INFO 03-01 23:59:01 [logger.py:42] Received request cmpl-17ec48ce17fb4a5389edb13fceb26dcd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:59:01 [async_llm.py:261] Added request cmpl-17ec48ce17fb4a5389edb13fceb26dcd-0.
INFO 03-01 23:59:02 [logger.py:42] Received request cmpl-a29b5cea580e48fa939da35e767ee32c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:59:02 [async_llm.py:261] Added request cmpl-a29b5cea580e48fa939da35e767ee32c-0.
INFO 03-01 23:59:03 [logger.py:42] Received request cmpl-991a7df8da8447659c9f4051f2a7974f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:59:03 [async_llm.py:261] Added request cmpl-991a7df8da8447659c9f4051f2a7974f-0.
INFO 03-01 23:59:05 [logger.py:42] Received request cmpl-a5efc3c0546348308164fca88ba13e36-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:59:05 [async_llm.py:261] Added request cmpl-a5efc3c0546348308164fca88ba13e36-0.
INFO 03-01 23:59:06 [logger.py:42] Received request cmpl-d6847d834546447ebde9d383cc36b724-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:59:06 [async_llm.py:261] Added request cmpl-d6847d834546447ebde9d383cc36b724-0.
INFO 03-01 23:59:07 [logger.py:42] Received request cmpl-228ccd2e7a2d482fbd875c44e5adc7c8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:59:07 [async_llm.py:261] Added request cmpl-228ccd2e7a2d482fbd875c44e5adc7c8-0.
INFO 03-01 23:59:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-01 23:59:08 [logger.py:42] Received request cmpl-0fbeb065959a4d9f87973f95a39db9e3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:59:08 [async_llm.py:261] Added request cmpl-0fbeb065959a4d9f87973f95a39db9e3-0.
INFO 03-01 23:59:09 [logger.py:42] Received request cmpl-5f4be709923d4cb9b9cf257fe2f5f4d4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:59:09 [async_llm.py:261] Added request cmpl-5f4be709923d4cb9b9cf257fe2f5f4d4-0.
INFO 03-01 23:59:10 [logger.py:42] Received request cmpl-0a94ce4eb85148f8ad5b57eb980d9a5d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:59:10 [async_llm.py:261] Added request cmpl-0a94ce4eb85148f8ad5b57eb980d9a5d-0.
INFO 03-01 23:59:11 [logger.py:42] Received request cmpl-f3e5036f09b4497f855f40d9963c6ad9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:59:11 [async_llm.py:261] Added request cmpl-f3e5036f09b4497f855f40d9963c6ad9-0.
INFO 03-01 23:59:13 [logger.py:42] Received request cmpl-641efa8f7ffe445c9fce024e5d3519bd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:59:13 [async_llm.py:261] Added request cmpl-641efa8f7ffe445c9fce024e5d3519bd-0.
INFO 03-01 23:59:14 [logger.py:42] Received request cmpl-52e4b4af2b524506b7c6daa98fc3e970-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:59:14 [async_llm.py:261] Added request cmpl-52e4b4af2b524506b7c6daa98fc3e970-0.
INFO 03-01 23:59:15 [logger.py:42] Received request cmpl-f7ebece709bc4215b7c42bcd0e6f1e7f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:59:15 [async_llm.py:261] Added request cmpl-f7ebece709bc4215b7c42bcd0e6f1e7f-0.
INFO 03-01 23:59:16 [logger.py:42] Received request cmpl-b7f784ba51bc4c8586087a0eddea04db-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:59:16 [async_llm.py:261] Added request cmpl-b7f784ba51bc4c8586087a0eddea04db-0.
INFO 03-01 23:59:17 [logger.py:42] Received request cmpl-751f7d48a8e649738560205f90b5171c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:59:17 [async_llm.py:261] Added request cmpl-751f7d48a8e649738560205f90b5171c-0.
INFO 03-01 23:59:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-01 23:59:18 [logger.py:42] Received request cmpl-cc530e62360a46fc8d7f19455eb43692-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:59:18 [async_llm.py:261] Added request cmpl-cc530e62360a46fc8d7f19455eb43692-0.
INFO 03-01 23:59:20 [logger.py:42] Received request cmpl-6b7b6f1528f54dd391ddfca5bda90abb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:59:20 [async_llm.py:261] Added request cmpl-6b7b6f1528f54dd391ddfca5bda90abb-0.
INFO 03-01 23:59:21 [logger.py:42] Received request cmpl-3460f0f20e1d464097cb559202e831bf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:59:21 [async_llm.py:261] Added request cmpl-3460f0f20e1d464097cb559202e831bf-0.
INFO 03-01 23:59:22 [logger.py:42] Received request cmpl-00be2efd6cb746f790fd049fc9795644-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:59:22 [async_llm.py:261] Added request cmpl-00be2efd6cb746f790fd049fc9795644-0.
INFO 03-01 23:59:23 [logger.py:42] Received request cmpl-86a6f0651a1d4094b16880577733f205-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:59:23 [async_llm.py:261] Added request cmpl-86a6f0651a1d4094b16880577733f205-0.
INFO 03-01 23:59:24 [logger.py:42] Received request cmpl-99b12d4939e14f8f8007db5788881cbd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:59:24 [async_llm.py:261] Added request cmpl-99b12d4939e14f8f8007db5788881cbd-0.
INFO 03-01 23:59:25 [logger.py:42] Received request cmpl-4094ba7d41154c5cb22ccab947c115d1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:59:25 [async_llm.py:261] Added request cmpl-4094ba7d41154c5cb22ccab947c115d1-0.
INFO 03-01 23:59:27 [logger.py:42] Received request cmpl-78c16a34af4c4c5386abe8a033a1ebb3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:59:27 [async_llm.py:261] Added request cmpl-78c16a34af4c4c5386abe8a033a1ebb3-0.
INFO 03-01 23:59:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-01 23:59:28 [logger.py:42] Received request cmpl-3393fbba3b844dd4bd4a4f11dc4cde97-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:59:28 [async_llm.py:261] Added request cmpl-3393fbba3b844dd4bd4a4f11dc4cde97-0.
INFO 03-01 23:59:29 [logger.py:42] Received request cmpl-e3836555b14b4f088b9465dd4750401b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:59:29 [async_llm.py:261] Added request cmpl-e3836555b14b4f088b9465dd4750401b-0.
INFO 03-01 23:59:30 [logger.py:42] Received request cmpl-675132ae30ca4f46b5db5cce5ce51fe2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:59:30 [async_llm.py:261] Added request cmpl-675132ae30ca4f46b5db5cce5ce51fe2-0.
INFO 03-01 23:59:31 [logger.py:42] Received request cmpl-787396d5060b443ebe23314a03582fb9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:59:31 [async_llm.py:261] Added request cmpl-787396d5060b443ebe23314a03582fb9-0.
INFO 03-01 23:59:32 [logger.py:42] Received request cmpl-cd80652381be4de2af3e4a95ae96a2d5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:59:32 [async_llm.py:261] Added request cmpl-cd80652381be4de2af3e4a95ae96a2d5-0.
INFO 03-01 23:59:33 [logger.py:42] Received request cmpl-9b27a5cd08c5445c8abaeae6cdba9509-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:59:33 [async_llm.py:261] Added request cmpl-9b27a5cd08c5445c8abaeae6cdba9509-0.
INFO 03-01 23:59:35 [logger.py:42] Received request cmpl-0983abd56426410b991ba3930a4035d5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:59:35 [async_llm.py:261] Added request cmpl-0983abd56426410b991ba3930a4035d5-0.
INFO 03-01 23:59:36 [logger.py:42] Received request cmpl-8b36b8934b0643a2bb8cbea84a963cb2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:59:36 [async_llm.py:261] Added request cmpl-8b36b8934b0643a2bb8cbea84a963cb2-0.
INFO 03-01 23:59:37 [logger.py:42] Received request cmpl-b3848067df214e7687ebaf10d0dbcbc1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:59:37 [async_llm.py:261] Added request cmpl-b3848067df214e7687ebaf10d0dbcbc1-0.
INFO 03-01 23:59:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-01 23:59:38 [logger.py:42] Received request cmpl-a4add1bbbc244cdc82334481c35a386a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:59:38 [async_llm.py:261] Added request cmpl-a4add1bbbc244cdc82334481c35a386a-0.
INFO 03-01 23:59:39 [logger.py:42] Received request cmpl-8c85f7c8b0c349c9b1cab8d338ea7562-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:59:39 [async_llm.py:261] Added request cmpl-8c85f7c8b0c349c9b1cab8d338ea7562-0.
INFO 03-01 23:59:40 [logger.py:42] Received request cmpl-b44c6f46b06f4850b27e6ef74a0750b2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:59:40 [async_llm.py:261] Added request cmpl-b44c6f46b06f4850b27e6ef74a0750b2-0.
INFO 03-01 23:59:42 [logger.py:42] Received request cmpl-4a60450b93ef46ad8db8bf05fa06f8b5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:59:42 [async_llm.py:261] Added request cmpl-4a60450b93ef46ad8db8bf05fa06f8b5-0.
INFO 03-01 23:59:43 [logger.py:42] Received request cmpl-471040fff183415da1f3eaf3df4aa589-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:59:43 [async_llm.py:261] Added request cmpl-471040fff183415da1f3eaf3df4aa589-0.
INFO 03-01 23:59:44 [logger.py:42] Received request cmpl-64bfa4daaf1c4225ab291971f854e073-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:59:44 [async_llm.py:261] Added request cmpl-64bfa4daaf1c4225ab291971f854e073-0.
INFO 03-01 23:59:45 [logger.py:42] Received request cmpl-f00dfa4faa8b4642adb31e5e8629de34-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:59:45 [async_llm.py:261] Added request cmpl-f00dfa4faa8b4642adb31e5e8629de34-0.
INFO 03-01 23:59:46 [logger.py:42] Received request cmpl-6234d59156d1487bb1ed0bdbe1ad322f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:59:46 [async_llm.py:261] Added request cmpl-6234d59156d1487bb1ed0bdbe1ad322f-0.
INFO 03-01 23:59:47 [logger.py:42] Received request cmpl-da2ed6e4de0743ea9af7403fd275d211-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:59:47 [async_llm.py:261] Added request cmpl-da2ed6e4de0743ea9af7403fd275d211-0.
INFO 03-01 23:59:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-01 23:59:48 [logger.py:42] Received request cmpl-6323db6531d34d47a3957d515b8c6044-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:59:48 [async_llm.py:261] Added request cmpl-6323db6531d34d47a3957d515b8c6044-0.
INFO 03-01 23:59:50 [logger.py:42] Received request cmpl-98cec59b26a9468e9b90ed401b4a3d30-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:59:50 [async_llm.py:261] Added request cmpl-98cec59b26a9468e9b90ed401b4a3d30-0.
INFO 03-01 23:59:51 [logger.py:42] Received request cmpl-b5c966fb9d1144a492a03170660bcaa6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:59:51 [async_llm.py:261] Added request cmpl-b5c966fb9d1144a492a03170660bcaa6-0.
INFO 03-01 23:59:52 [logger.py:42] Received request cmpl-b5d7a5bc2d804bffa4d51c65ed4f9dca-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:59:52 [async_llm.py:261] Added request cmpl-b5d7a5bc2d804bffa4d51c65ed4f9dca-0.
INFO 03-01 23:59:53 [logger.py:42] Received request cmpl-dc23b25f40ed4c5db9cf1072fb7dcf6b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:59:53 [async_llm.py:261] Added request cmpl-dc23b25f40ed4c5db9cf1072fb7dcf6b-0.
INFO 03-01 23:59:54 [logger.py:42] Received request cmpl-bedbd95deea348c0a57a92d75bd9b973-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:59:54 [async_llm.py:261] Added request cmpl-bedbd95deea348c0a57a92d75bd9b973-0.
INFO 03-01 23:59:55 [logger.py:42] Received request cmpl-08de743f890b497a82a6062ff1897a0a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:59:55 [async_llm.py:261] Added request cmpl-08de743f890b497a82a6062ff1897a0a-0.
INFO 03-01 23:59:57 [logger.py:42] Received request cmpl-e81ea2317b0943ba88051ea83195fb7a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:59:57 [async_llm.py:261] Added request cmpl-e81ea2317b0943ba88051ea83195fb7a-0.
INFO 03-01 23:59:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-01 23:59:58 [logger.py:42] Received request cmpl-4713714b68f14f008c94fc3c5e0feb91-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:59:58 [async_llm.py:261] Added request cmpl-4713714b68f14f008c94fc3c5e0feb91-0.
INFO 03-01 23:59:59 [logger.py:42] Received request cmpl-45c102cfa53d410b960ca61a4e3615b4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-01 23:59:59 [async_llm.py:261] Added request cmpl-45c102cfa53d410b960ca61a4e3615b4-0.
INFO 03-02 00:00:00 [logger.py:42] Received request cmpl-9e9a29cb427a49fd8d53a6fe949e5504-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:00:00 [async_llm.py:261] Added request cmpl-9e9a29cb427a49fd8d53a6fe949e5504-0.
INFO 03-02 00:00:01 [logger.py:42] Received request cmpl-b09c9006636640e2ba260af4bb313b8c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:00:01 [async_llm.py:261] Added request cmpl-b09c9006636640e2ba260af4bb313b8c-0.
INFO 03-02 00:00:02 [logger.py:42] Received request cmpl-e510545b363648b3a87b20314671e27f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:00:02 [async_llm.py:261] Added request cmpl-e510545b363648b3a87b20314671e27f-0.
INFO 03-02 00:00:03 [logger.py:42] Received request cmpl-32b01cf44f7146c3bb41a46af1747502-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:00:03 [async_llm.py:261] Added request cmpl-32b01cf44f7146c3bb41a46af1747502-0.
INFO 03-02 00:00:05 [logger.py:42] Received request cmpl-89c6cb841dae4d5f8bcaf81ae9026dc4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:00:05 [async_llm.py:261] Added request cmpl-89c6cb841dae4d5f8bcaf81ae9026dc4-0.
INFO 03-02 00:00:06 [logger.py:42] Received request cmpl-0f1578b812154f48bd6765a0b2bab828-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:00:06 [async_llm.py:261] Added request cmpl-0f1578b812154f48bd6765a0b2bab828-0.
INFO 03-02 00:00:07 [logger.py:42] Received request cmpl-8a8f4524c2444d0f92a7428f8e13fd2d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:00:07 [async_llm.py:261] Added request cmpl-8a8f4524c2444d0f92a7428f8e13fd2d-0.
INFO 03-02 00:00:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:00:08 [logger.py:42] Received request cmpl-ae123c11eb4f4567afa16fddb00f244a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:00:08 [async_llm.py:261] Added request cmpl-ae123c11eb4f4567afa16fddb00f244a-0.
INFO 03-02 00:00:09 [logger.py:42] Received request cmpl-95dc4b4fa64d4c13b7eeb533721eaa64-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:00:09 [async_llm.py:261] Added request cmpl-95dc4b4fa64d4c13b7eeb533721eaa64-0.
INFO 03-02 00:00:10 [logger.py:42] Received request cmpl-a3715a30c12c4c029361e66111b48806-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:00:10 [async_llm.py:261] Added request cmpl-a3715a30c12c4c029361e66111b48806-0.
INFO 03-02 00:00:12 [logger.py:42] Received request cmpl-ee92b5b8ebcb48abaa1886b7b881693d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:00:12 [async_llm.py:261] Added request cmpl-ee92b5b8ebcb48abaa1886b7b881693d-0.
INFO 03-02 00:00:13 [logger.py:42] Received request cmpl-ab59e0e80a8e41a48c86e75446f80bd0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:00:13 [async_llm.py:261] Added request cmpl-ab59e0e80a8e41a48c86e75446f80bd0-0.
INFO 03-02 00:00:14 [logger.py:42] Received request cmpl-efce6466898243b288331556fbb9e00c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:00:14 [async_llm.py:261] Added request cmpl-efce6466898243b288331556fbb9e00c-0.
INFO 03-02 00:00:15 [logger.py:42] Received request cmpl-c5e15ce4066e4c75bec8d8d7fd1168ff-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:00:15 [async_llm.py:261] Added request cmpl-c5e15ce4066e4c75bec8d8d7fd1168ff-0.
INFO 03-02 00:00:16 [logger.py:42] Received request cmpl-cd6bd117423f453880ea8fa9c2233b69-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:00:16 [async_llm.py:261] Added request cmpl-cd6bd117423f453880ea8fa9c2233b69-0.
INFO 03-02 00:00:17 [logger.py:42] Received request cmpl-de2bd44d53314c278757cc7579747a2a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:00:17 [async_llm.py:261] Added request cmpl-de2bd44d53314c278757cc7579747a2a-0.
INFO 03-02 00:00:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:00:18 [logger.py:42] Received request cmpl-a9dd37d33a5c4d26bbe7ff6e75375d8f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:00:18 [async_llm.py:261] Added request cmpl-a9dd37d33a5c4d26bbe7ff6e75375d8f-0.
INFO 03-02 00:00:20 [logger.py:42] Received request cmpl-132c81f51c924d29a83f8e4953c6e27f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:00:20 [async_llm.py:261] Added request cmpl-132c81f51c924d29a83f8e4953c6e27f-0.
INFO 03-02 00:00:21 [logger.py:42] Received request cmpl-1c97e5b850dd4f48b8495234e7d9a18d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:00:21 [async_llm.py:261] Added request cmpl-1c97e5b850dd4f48b8495234e7d9a18d-0.
INFO 03-02 00:00:22 [logger.py:42] Received request cmpl-59a57367b45d4c318a07a55d0901edb6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:00:22 [async_llm.py:261] Added request cmpl-59a57367b45d4c318a07a55d0901edb6-0.
INFO 03-02 00:00:23 [logger.py:42] Received request cmpl-c1098d81c58b479fbd251d816fc42689-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:00:23 [async_llm.py:261] Added request cmpl-c1098d81c58b479fbd251d816fc42689-0.
INFO 03-02 00:00:24 [logger.py:42] Received request cmpl-df9e5f6efda14f3cbf53ddc6281aea4f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:00:24 [async_llm.py:261] Added request cmpl-df9e5f6efda14f3cbf53ddc6281aea4f-0.
INFO 03-02 00:00:25 [logger.py:42] Received request cmpl-0624d9a8655844e0a47de5983ca684b4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:00:25 [async_llm.py:261] Added request cmpl-0624d9a8655844e0a47de5983ca684b4-0.
INFO 03-02 00:00:27 [logger.py:42] Received request cmpl-9c4558c128434ba69e53e36a995b1409-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:00:27 [async_llm.py:261] Added request cmpl-9c4558c128434ba69e53e36a995b1409-0.
INFO 03-02 00:00:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:00:28 [logger.py:42] Received request cmpl-990141c77d7f4d8eadbb2ffc82707c5f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:00:28 [async_llm.py:261] Added request cmpl-990141c77d7f4d8eadbb2ffc82707c5f-0.
INFO 03-02 00:00:29 [logger.py:42] Received request cmpl-48faa3e2a8814b11a9ee556caa09f502-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:00:29 [async_llm.py:261] Added request cmpl-48faa3e2a8814b11a9ee556caa09f502-0.
INFO 03-02 00:00:30 [logger.py:42] Received request cmpl-73b968fa3e4842c19c029e1240ee839a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:00:30 [async_llm.py:261] Added request cmpl-73b968fa3e4842c19c029e1240ee839a-0.
INFO 03-02 00:00:31 [logger.py:42] Received request cmpl-85c4ee708ff34a7cb07c8b5377da16b4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:00:31 [async_llm.py:261] Added request cmpl-85c4ee708ff34a7cb07c8b5377da16b4-0.
INFO 03-02 00:00:32 [logger.py:42] Received request cmpl-3bb34e1dde4640659c18409324dbb4be-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:00:32 [async_llm.py:261] Added request cmpl-3bb34e1dde4640659c18409324dbb4be-0.
INFO 03-02 00:00:33 [logger.py:42] Received request cmpl-93b2d8ea2d0449db83a625924a2cb672-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:00:33 [async_llm.py:261] Added request cmpl-93b2d8ea2d0449db83a625924a2cb672-0.
INFO 03-02 00:00:35 [logger.py:42] Received request cmpl-344176262d8e4fec8874be43d0cf1267-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:00:35 [async_llm.py:261] Added request cmpl-344176262d8e4fec8874be43d0cf1267-0.
INFO 03-02 00:00:36 [logger.py:42] Received request cmpl-207c11bc95f34312a7382daa98391372-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:00:36 [async_llm.py:261] Added request cmpl-207c11bc95f34312a7382daa98391372-0.
INFO 03-02 00:00:37 [logger.py:42] Received request cmpl-1b501c07419245538903eb2130d779f9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:00:37 [async_llm.py:261] Added request cmpl-1b501c07419245538903eb2130d779f9-0.
INFO 03-02 00:00:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:00:38 [logger.py:42] Received request cmpl-a6e7de11019e4e3c90a8a7aed2f7b633-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:00:38 [async_llm.py:261] Added request cmpl-a6e7de11019e4e3c90a8a7aed2f7b633-0.
INFO 03-02 00:00:39 [logger.py:42] Received request cmpl-fa90b20ad52a4a9ab33ddbac7b624763-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:00:39 [async_llm.py:261] Added request cmpl-fa90b20ad52a4a9ab33ddbac7b624763-0.
INFO 03-02 00:00:40 [logger.py:42] Received request cmpl-00c1f10d3b474e77baf2efbb8cbabe98-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:00:40 [async_llm.py:261] Added request cmpl-00c1f10d3b474e77baf2efbb8cbabe98-0.
INFO 03-02 00:00:42 [logger.py:42] Received request cmpl-f13c88e70f0741c7b813f2581748ebc7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:00:42 [async_llm.py:261] Added request cmpl-f13c88e70f0741c7b813f2581748ebc7-0.
INFO 03-02 00:00:43 [logger.py:42] Received request cmpl-a981537c759f47b19d88133be289bf6e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:00:43 [async_llm.py:261] Added request cmpl-a981537c759f47b19d88133be289bf6e-0.
INFO 03-02 00:00:44 [logger.py:42] Received request cmpl-d4c356294e1b452985696f2347e4879f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:00:44 [async_llm.py:261] Added request cmpl-d4c356294e1b452985696f2347e4879f-0.
INFO 03-02 00:00:45 [logger.py:42] Received request cmpl-4813bae8f82247dcac1c8fb86b8d9e12-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:00:45 [async_llm.py:261] Added request cmpl-4813bae8f82247dcac1c8fb86b8d9e12-0.
INFO 03-02 00:00:46 [logger.py:42] Received request cmpl-44715d98e13f486eb55fcff344a4d4f0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:00:46 [async_llm.py:261] Added request cmpl-44715d98e13f486eb55fcff344a4d4f0-0.
INFO 03-02 00:00:47 [logger.py:42] Received request cmpl-d9d548dc764d480fab8010a67e76536d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:00:47 [async_llm.py:261] Added request cmpl-d9d548dc764d480fab8010a67e76536d-0.
INFO 03-02 00:00:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:00:48 [logger.py:42] Received request cmpl-222c2dbeac5340c697e990fc874de178-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:00:48 [async_llm.py:261] Added request cmpl-222c2dbeac5340c697e990fc874de178-0.
INFO 03-02 00:00:50 [logger.py:42] Received request cmpl-3499e36e04524411b559026782b45f62-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:00:50 [async_llm.py:261] Added request cmpl-3499e36e04524411b559026782b45f62-0.
INFO 03-02 00:00:51 [logger.py:42] Received request cmpl-919573b7f6204e42ad174c6510563ba8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:00:51 [async_llm.py:261] Added request cmpl-919573b7f6204e42ad174c6510563ba8-0.
INFO 03-02 00:00:52 [logger.py:42] Received request cmpl-f7101bb22e0b4caa86d582258e5d6133-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:00:52 [async_llm.py:261] Added request cmpl-f7101bb22e0b4caa86d582258e5d6133-0.
INFO 03-02 00:00:53 [logger.py:42] Received request cmpl-bb8112db21bb4122aaa0082bca52d954-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:00:53 [async_llm.py:261] Added request cmpl-bb8112db21bb4122aaa0082bca52d954-0.
INFO 03-02 00:00:54 [logger.py:42] Received request cmpl-5cee8f45159145a09e04044f667aea3e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:00:54 [async_llm.py:261] Added request cmpl-5cee8f45159145a09e04044f667aea3e-0.
INFO 03-02 00:00:55 [logger.py:42] Received request cmpl-ce55a98c7c3d48498f615406fbf3453f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:00:55 [async_llm.py:261] Added request cmpl-ce55a98c7c3d48498f615406fbf3453f-0.
INFO 03-02 00:00:57 [logger.py:42] Received request cmpl-f0f953d7f3744a48825bde0f7d938458-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:00:57 [async_llm.py:261] Added request cmpl-f0f953d7f3744a48825bde0f7d938458-0.
INFO 03-02 00:00:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:00:58 [logger.py:42] Received request cmpl-07701acb40824e8cb9513415b0ca0f03-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:00:58 [async_llm.py:261] Added request cmpl-07701acb40824e8cb9513415b0ca0f03-0.
INFO 03-02 00:00:59 [logger.py:42] Received request cmpl-c6feec6fb2eb4f89b04d958756444851-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:00:59 [async_llm.py:261] Added request cmpl-c6feec6fb2eb4f89b04d958756444851-0.
INFO 03-02 00:01:00 [logger.py:42] Received request cmpl-aaa9c936e0684b20a01cebc53656bb9d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:01:00 [async_llm.py:261] Added request cmpl-aaa9c936e0684b20a01cebc53656bb9d-0.
INFO 03-02 00:01:01 [logger.py:42] Received request cmpl-367833d568cd452b9a7f2f656077ae31-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:01:01 [async_llm.py:261] Added request cmpl-367833d568cd452b9a7f2f656077ae31-0.
INFO 03-02 00:01:02 [logger.py:42] Received request cmpl-ab8e8aba65fa4ca9ae0b2d3a119fe8f9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:01:02 [async_llm.py:261] Added request cmpl-ab8e8aba65fa4ca9ae0b2d3a119fe8f9-0.
INFO 03-02 00:01:03 [logger.py:42] Received request cmpl-73a8daa85d264fdea9d6bdbe7c807afe-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:01:03 [async_llm.py:261] Added request cmpl-73a8daa85d264fdea9d6bdbe7c807afe-0.
INFO 03-02 00:01:05 [logger.py:42] Received request cmpl-7eceaaec14194194b0b2d76244030cbe-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:01:05 [async_llm.py:261] Added request cmpl-7eceaaec14194194b0b2d76244030cbe-0.
INFO 03-02 00:01:06 [logger.py:42] Received request cmpl-ec2ebfafddda4efbaebabcfc1cbdc88f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:01:06 [async_llm.py:261] Added request cmpl-ec2ebfafddda4efbaebabcfc1cbdc88f-0.
INFO 03-02 00:01:07 [logger.py:42] Received request cmpl-d7b9d605a167483baa7b3f210a80c091-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:01:07 [async_llm.py:261] Added request cmpl-d7b9d605a167483baa7b3f210a80c091-0.
INFO 03-02 00:01:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:01:08 [logger.py:42] Received request cmpl-22736a2d0fec4d0f9a33b24b22ff23eb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:01:08 [async_llm.py:261] Added request cmpl-22736a2d0fec4d0f9a33b24b22ff23eb-0.
INFO 03-02 00:01:09 [logger.py:42] Received request cmpl-add5eb852d614cccafdcf6bf60a3587b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:01:09 [async_llm.py:261] Added request cmpl-add5eb852d614cccafdcf6bf60a3587b-0.
INFO 03-02 00:01:10 [logger.py:42] Received request cmpl-86d4c8250b4a4c6a9f17355ab802ce9a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:01:10 [async_llm.py:261] Added request cmpl-86d4c8250b4a4c6a9f17355ab802ce9a-0.
INFO 03-02 00:01:12 [logger.py:42] Received request cmpl-54e2748d338b4cefa67ebd92955e7620-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:01:12 [async_llm.py:261] Added request cmpl-54e2748d338b4cefa67ebd92955e7620-0.
INFO 03-02 00:01:13 [logger.py:42] Received request cmpl-61eeab2cdf204758996d5d23be0b8d71-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:01:13 [async_llm.py:261] Added request cmpl-61eeab2cdf204758996d5d23be0b8d71-0.
INFO 03-02 00:01:14 [logger.py:42] Received request cmpl-133d0914faa5463497f14d442d9082e7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:01:14 [async_llm.py:261] Added request cmpl-133d0914faa5463497f14d442d9082e7-0.
INFO 03-02 00:01:15 [logger.py:42] Received request cmpl-c3213e27adef46b3a39906d257650c76-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:01:15 [async_llm.py:261] Added request cmpl-c3213e27adef46b3a39906d257650c76-0.
INFO 03-02 00:01:16 [logger.py:42] Received request cmpl-eef85a6c6f8b41c98e2ea8bcacac467b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:01:16 [async_llm.py:261] Added request cmpl-eef85a6c6f8b41c98e2ea8bcacac467b-0.
INFO 03-02 00:01:17 [logger.py:42] Received request cmpl-65b4becf50ff4b7ab397cb5d3120063a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:01:17 [async_llm.py:261] Added request cmpl-65b4becf50ff4b7ab397cb5d3120063a-0.
INFO 03-02 00:01:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:01:18 [logger.py:42] Received request cmpl-60aa534e9f9f44e1954ea3ec659e158b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:01:18 [async_llm.py:261] Added request cmpl-60aa534e9f9f44e1954ea3ec659e158b-0.
INFO 03-02 00:01:20 [logger.py:42] Received request cmpl-e18c7eb3dc984d3db935d9bbe4d19b2b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:01:20 [async_llm.py:261] Added request cmpl-e18c7eb3dc984d3db935d9bbe4d19b2b-0.
INFO 03-02 00:01:21 [logger.py:42] Received request cmpl-a1905469f31c4b4c8b90211fe90e5e79-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:01:21 [async_llm.py:261] Added request cmpl-a1905469f31c4b4c8b90211fe90e5e79-0.
INFO 03-02 00:01:22 [logger.py:42] Received request cmpl-32e373362f7a4f9a8be304baaa6f5959-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:01:22 [async_llm.py:261] Added request cmpl-32e373362f7a4f9a8be304baaa6f5959-0.
INFO 03-02 00:01:23 [logger.py:42] Received request cmpl-2f9f5869dffe43848e91b12644321797-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:01:23 [async_llm.py:261] Added request cmpl-2f9f5869dffe43848e91b12644321797-0.
INFO 03-02 00:01:24 [logger.py:42] Received request cmpl-6244d55141d84084887550f28e3fdde0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:01:24 [async_llm.py:261] Added request cmpl-6244d55141d84084887550f28e3fdde0-0.
INFO 03-02 00:01:25 [logger.py:42] Received request cmpl-8f4d34b5d34441d28e77d8f961841eef-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:01:25 [async_llm.py:261] Added request cmpl-8f4d34b5d34441d28e77d8f961841eef-0.
INFO 03-02 00:01:27 [logger.py:42] Received request cmpl-17ab9d0282b44e08a4282e9e1068c4e6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:01:27 [async_llm.py:261] Added request cmpl-17ab9d0282b44e08a4282e9e1068c4e6-0.
INFO 03-02 00:01:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:01:28 [logger.py:42] Received request cmpl-3bba9d20a57d4a1b8e7abc217d84bd38-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:01:28 [async_llm.py:261] Added request cmpl-3bba9d20a57d4a1b8e7abc217d84bd38-0.
INFO 03-02 00:01:29 [logger.py:42] Received request cmpl-be402c2f82274389a0b4e477b65d8cf6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:01:29 [async_llm.py:261] Added request cmpl-be402c2f82274389a0b4e477b65d8cf6-0.
INFO 03-02 00:01:30 [logger.py:42] Received request cmpl-d9cfe30269f240fe8d62ed28bd0cdc17-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:01:30 [async_llm.py:261] Added request cmpl-d9cfe30269f240fe8d62ed28bd0cdc17-0.
INFO 03-02 00:01:31 [logger.py:42] Received request cmpl-64cc98d610324dec88240dbf17ddc890-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:01:31 [async_llm.py:261] Added request cmpl-64cc98d610324dec88240dbf17ddc890-0.
INFO 03-02 00:01:32 [logger.py:42] Received request cmpl-baad56a6df824aa0b32486dbb85afec7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:01:32 [async_llm.py:261] Added request cmpl-baad56a6df824aa0b32486dbb85afec7-0.
INFO 03-02 00:01:33 [logger.py:42] Received request cmpl-aa61518ec281490dbd59b9d33dd25c98-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:01:33 [async_llm.py:261] Added request cmpl-aa61518ec281490dbd59b9d33dd25c98-0.
INFO 03-02 00:01:35 [logger.py:42] Received request cmpl-cb32d9254276440ea1687736a5c55e71-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:01:35 [async_llm.py:261] Added request cmpl-cb32d9254276440ea1687736a5c55e71-0.
INFO 03-02 00:01:36 [logger.py:42] Received request cmpl-200965e28f094eea9393c9af50074244-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:01:36 [async_llm.py:261] Added request cmpl-200965e28f094eea9393c9af50074244-0.
INFO 03-02 00:01:37 [logger.py:42] Received request cmpl-f7242a1721ff4727ba33bfd81db8cfe8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:01:37 [async_llm.py:261] Added request cmpl-f7242a1721ff4727ba33bfd81db8cfe8-0.
INFO 03-02 00:01:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:01:38 [logger.py:42] Received request cmpl-e2e07b61675c4bd89beac1c9cb8c8a7e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:01:38 [async_llm.py:261] Added request cmpl-e2e07b61675c4bd89beac1c9cb8c8a7e-0.
INFO 03-02 00:01:39 [logger.py:42] Received request cmpl-7d4ae00dd38f44f4bd7cace17010e696-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:01:39 [async_llm.py:261] Added request cmpl-7d4ae00dd38f44f4bd7cace17010e696-0.
INFO 03-02 00:01:40 [logger.py:42] Received request cmpl-16bf8b23b7404515813d987a83eb8cf9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:01:40 [async_llm.py:261] Added request cmpl-16bf8b23b7404515813d987a83eb8cf9-0.
INFO 03-02 00:01:42 [logger.py:42] Received request cmpl-f219f8e2f272470d880240ddeb701661-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:01:42 [async_llm.py:261] Added request cmpl-f219f8e2f272470d880240ddeb701661-0.
INFO 03-02 00:01:43 [logger.py:42] Received request cmpl-636d7036588c47b7b317c87c88889546-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:01:43 [async_llm.py:261] Added request cmpl-636d7036588c47b7b317c87c88889546-0.
INFO 03-02 00:01:44 [logger.py:42] Received request cmpl-6714721da58a4a119a223d6bf8235d89-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:01:44 [async_llm.py:261] Added request cmpl-6714721da58a4a119a223d6bf8235d89-0.
INFO 03-02 00:01:45 [logger.py:42] Received request cmpl-b0aefad7243c42f08ff447523f4ba4cf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:01:45 [async_llm.py:261] Added request cmpl-b0aefad7243c42f08ff447523f4ba4cf-0.
INFO 03-02 00:01:46 [logger.py:42] Received request cmpl-aef75630012d418e86c75cd5a35732d7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:01:46 [async_llm.py:261] Added request cmpl-aef75630012d418e86c75cd5a35732d7-0.
INFO 03-02 00:01:47 [logger.py:42] Received request cmpl-eb1638b5cdbb4ac0941de60cc9a33072-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:01:47 [async_llm.py:261] Added request cmpl-eb1638b5cdbb4ac0941de60cc9a33072-0.
INFO 03-02 00:01:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:01:48 [logger.py:42] Received request cmpl-fddcb411f9954d92a2639440845064d4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:01:48 [async_llm.py:261] Added request cmpl-fddcb411f9954d92a2639440845064d4-0.
INFO 03-02 00:01:50 [logger.py:42] Received request cmpl-e5dabf77cc9742bea658fefc90a6775a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:01:50 [async_llm.py:261] Added request cmpl-e5dabf77cc9742bea658fefc90a6775a-0.
INFO 03-02 00:01:51 [logger.py:42] Received request cmpl-f4cafbe67a2a419696eb6e896223171f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:01:51 [async_llm.py:261] Added request cmpl-f4cafbe67a2a419696eb6e896223171f-0.
INFO 03-02 00:01:52 [logger.py:42] Received request cmpl-dced360cf01d46969366b21875749ff2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:01:52 [async_llm.py:261] Added request cmpl-dced360cf01d46969366b21875749ff2-0.
INFO 03-02 00:01:53 [logger.py:42] Received request cmpl-7617b8c02d77408b9e9a9cce96922517-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:01:53 [async_llm.py:261] Added request cmpl-7617b8c02d77408b9e9a9cce96922517-0.
INFO 03-02 00:01:54 [logger.py:42] Received request cmpl-0d932b7e072c46e2b7128a9bb07b612b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:01:54 [async_llm.py:261] Added request cmpl-0d932b7e072c46e2b7128a9bb07b612b-0.
INFO 03-02 00:01:55 [logger.py:42] Received request cmpl-9709efd0bc4f4b5cb0f822f4b5b59819-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:01:55 [async_llm.py:261] Added request cmpl-9709efd0bc4f4b5cb0f822f4b5b59819-0.
INFO 03-02 00:01:57 [logger.py:42] Received request cmpl-381cd46ed92640028cdab9d94f50b85b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:01:57 [async_llm.py:261] Added request cmpl-381cd46ed92640028cdab9d94f50b85b-0.
INFO 03-02 00:01:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:01:58 [logger.py:42] Received request cmpl-eeb283c6916f45a39edcc20c1b7229b6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:01:58 [async_llm.py:261] Added request cmpl-eeb283c6916f45a39edcc20c1b7229b6-0.
INFO 03-02 00:01:59 [logger.py:42] Received request cmpl-4b5fda9fdd10456882fc5bb30a382655-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:01:59 [async_llm.py:261] Added request cmpl-4b5fda9fdd10456882fc5bb30a382655-0.
INFO 03-02 00:02:00 [logger.py:42] Received request cmpl-e70fab621d73489688e27cb1a369eda1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:02:00 [async_llm.py:261] Added request cmpl-e70fab621d73489688e27cb1a369eda1-0.
INFO 03-02 00:02:01 [logger.py:42] Received request cmpl-076ddf60f9414d408ea8bce845b11ac7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:02:01 [async_llm.py:261] Added request cmpl-076ddf60f9414d408ea8bce845b11ac7-0.
INFO 03-02 00:02:02 [logger.py:42] Received request cmpl-ff5f757258cb4cf585a865e1c3343048-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:02:02 [async_llm.py:261] Added request cmpl-ff5f757258cb4cf585a865e1c3343048-0.
INFO 03-02 00:02:03 [logger.py:42] Received request cmpl-871cfe597e744278922142fdce40c852-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:02:03 [async_llm.py:261] Added request cmpl-871cfe597e744278922142fdce40c852-0.
INFO 03-02 00:02:05 [logger.py:42] Received request cmpl-d7a39b4642424d84a764d751bb0fb933-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:02:05 [async_llm.py:261] Added request cmpl-d7a39b4642424d84a764d751bb0fb933-0.
INFO 03-02 00:02:06 [logger.py:42] Received request cmpl-ed9d4c1c64f94d809db21bddbcdd57b6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:02:06 [async_llm.py:261] Added request cmpl-ed9d4c1c64f94d809db21bddbcdd57b6-0.
INFO 03-02 00:02:07 [logger.py:42] Received request cmpl-39c503e8c18e432cae9ad1c2f7a3bece-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:02:07 [async_llm.py:261] Added request cmpl-39c503e8c18e432cae9ad1c2f7a3bece-0.
INFO 03-02 00:02:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:02:08 [logger.py:42] Received request cmpl-9951a8af4ffc4250ba95f4919bae5bcd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:02:08 [async_llm.py:261] Added request cmpl-9951a8af4ffc4250ba95f4919bae5bcd-0.
INFO 03-02 00:02:09 [logger.py:42] Received request cmpl-a9b5ec9117c04b6cbd14eac76dfd19c6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:02:09 [async_llm.py:261] Added request cmpl-a9b5ec9117c04b6cbd14eac76dfd19c6-0.
INFO 03-02 00:02:10 [logger.py:42] Received request cmpl-3deca812c9954da8985b08883f785f57-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:02:10 [async_llm.py:261] Added request cmpl-3deca812c9954da8985b08883f785f57-0.
INFO 03-02 00:02:12 [logger.py:42] Received request cmpl-3d502ab4aff842938a644edf649c05a0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:02:12 [async_llm.py:261] Added request cmpl-3d502ab4aff842938a644edf649c05a0-0.
INFO 03-02 00:02:13 [logger.py:42] Received request cmpl-bebb88b3d29e4a18afb69f9e146d4c97-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:02:13 [async_llm.py:261] Added request cmpl-bebb88b3d29e4a18afb69f9e146d4c97-0.
INFO 03-02 00:02:14 [logger.py:42] Received request cmpl-aa0bf3dc4d354ba69a17f173a6b345c2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:02:14 [async_llm.py:261] Added request cmpl-aa0bf3dc4d354ba69a17f173a6b345c2-0.
INFO 03-02 00:02:15 [logger.py:42] Received request cmpl-634dacd0daee4d09b8619728150ffc79-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:02:15 [async_llm.py:261] Added request cmpl-634dacd0daee4d09b8619728150ffc79-0.
INFO 03-02 00:02:16 [logger.py:42] Received request cmpl-eb4a842c73a04c8ab545d1f372b126b3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:02:16 [async_llm.py:261] Added request cmpl-eb4a842c73a04c8ab545d1f372b126b3-0.
INFO 03-02 00:02:17 [logger.py:42] Received request cmpl-c53196c303e04d20880170c8be3894a5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:02:17 [async_llm.py:261] Added request cmpl-c53196c303e04d20880170c8be3894a5-0.
INFO 03-02 00:02:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:02:18 [logger.py:42] Received request cmpl-17f0514c1f474cd097604189a415cb0e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:02:18 [async_llm.py:261] Added request cmpl-17f0514c1f474cd097604189a415cb0e-0.
INFO 03-02 00:02:20 [logger.py:42] Received request cmpl-0010d9eb71a8499fb556bd42ede07296-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:02:20 [async_llm.py:261] Added request cmpl-0010d9eb71a8499fb556bd42ede07296-0.
INFO 03-02 00:02:21 [logger.py:42] Received request cmpl-6d29324236a74bc790278247d68cd370-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:02:21 [async_llm.py:261] Added request cmpl-6d29324236a74bc790278247d68cd370-0.
INFO 03-02 00:02:22 [logger.py:42] Received request cmpl-b18b247aea264f9d817f15413ef46eca-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:02:22 [async_llm.py:261] Added request cmpl-b18b247aea264f9d817f15413ef46eca-0.
INFO 03-02 00:02:23 [logger.py:42] Received request cmpl-ded9333133a74f15baae149d8cd4903b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:02:23 [async_llm.py:261] Added request cmpl-ded9333133a74f15baae149d8cd4903b-0.
INFO 03-02 00:02:24 [logger.py:42] Received request cmpl-73e84cd470fd4da3bb7369a8eb13ff2c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:02:24 [async_llm.py:261] Added request cmpl-73e84cd470fd4da3bb7369a8eb13ff2c-0.
INFO 03-02 00:02:25 [logger.py:42] Received request cmpl-2b01db05c5ba4288b895bd0cc9b76c89-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:02:25 [async_llm.py:261] Added request cmpl-2b01db05c5ba4288b895bd0cc9b76c89-0.
INFO 03-02 00:02:27 [logger.py:42] Received request cmpl-942e29b3274345edaedb869d946132cf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:02:27 [async_llm.py:261] Added request cmpl-942e29b3274345edaedb869d946132cf-0.
INFO 03-02 00:02:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:02:28 [logger.py:42] Received request cmpl-9e9f2533f4d649b6a1c733ad53a0a1eb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:02:28 [async_llm.py:261] Added request cmpl-9e9f2533f4d649b6a1c733ad53a0a1eb-0.
INFO 03-02 00:02:29 [logger.py:42] Received request cmpl-a343c4f059f744bab2429713cc7bf445-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:02:29 [async_llm.py:261] Added request cmpl-a343c4f059f744bab2429713cc7bf445-0.
INFO 03-02 00:02:30 [logger.py:42] Received request cmpl-46160828e3374d12a2b77f90734b8505-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:02:30 [async_llm.py:261] Added request cmpl-46160828e3374d12a2b77f90734b8505-0.
INFO 03-02 00:02:31 [logger.py:42] Received request cmpl-fa6e4b340f3447d78a19784511251cde-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:02:31 [async_llm.py:261] Added request cmpl-fa6e4b340f3447d78a19784511251cde-0.
INFO 03-02 00:02:32 [logger.py:42] Received request cmpl-272a9c1109e74655a6ec909991140e88-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:02:32 [async_llm.py:261] Added request cmpl-272a9c1109e74655a6ec909991140e88-0.
INFO 03-02 00:02:33 [logger.py:42] Received request cmpl-f6a0d464e65d4419ad6090aff2d6db12-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:02:33 [async_llm.py:261] Added request cmpl-f6a0d464e65d4419ad6090aff2d6db12-0.
INFO 03-02 00:02:35 [logger.py:42] Received request cmpl-415e1d487beb4bd390fd1b0d46fecc01-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:02:35 [async_llm.py:261] Added request cmpl-415e1d487beb4bd390fd1b0d46fecc01-0.
INFO 03-02 00:02:36 [logger.py:42] Received request cmpl-040feaff9bbf4bd882a3c39e43f09a7e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:02:36 [async_llm.py:261] Added request cmpl-040feaff9bbf4bd882a3c39e43f09a7e-0.
INFO 03-02 00:02:37 [logger.py:42] Received request cmpl-d2b31312810c49c29823541a87daa440-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:02:37 [async_llm.py:261] Added request cmpl-d2b31312810c49c29823541a87daa440-0.
INFO 03-02 00:02:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:02:38 [logger.py:42] Received request cmpl-b0eb5cabfd6b4f2da8728228739a65ef-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:02:38 [async_llm.py:261] Added request cmpl-b0eb5cabfd6b4f2da8728228739a65ef-0.
INFO 03-02 00:02:39 [logger.py:42] Received request cmpl-0fa5bf0e31e74f0ca15ffdc756963b35-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:02:39 [async_llm.py:261] Added request cmpl-0fa5bf0e31e74f0ca15ffdc756963b35-0.
INFO 03-02 00:02:40 [logger.py:42] Received request cmpl-be3b7f1b84134d45bc6acd99f5bc6592-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:02:40 [async_llm.py:261] Added request cmpl-be3b7f1b84134d45bc6acd99f5bc6592-0.
INFO 03-02 00:02:42 [logger.py:42] Received request cmpl-803e6e0c5a9743d0b5adad483357dbc8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:02:42 [async_llm.py:261] Added request cmpl-803e6e0c5a9743d0b5adad483357dbc8-0.
INFO 03-02 00:02:43 [logger.py:42] Received request cmpl-009f5e1f6cda468791b9e580d2989a29-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:02:43 [async_llm.py:261] Added request cmpl-009f5e1f6cda468791b9e580d2989a29-0.
INFO 03-02 00:02:44 [logger.py:42] Received request cmpl-0564f11a7a3543c39cc9d4bb20b5eee4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:02:44 [async_llm.py:261] Added request cmpl-0564f11a7a3543c39cc9d4bb20b5eee4-0.
INFO 03-02 00:02:45 [logger.py:42] Received request cmpl-072369c7bed842828dbdb968308309a7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:02:45 [async_llm.py:261] Added request cmpl-072369c7bed842828dbdb968308309a7-0.
INFO 03-02 00:02:46 [logger.py:42] Received request cmpl-f588cbb5a3ef4b2cbcd9560bed758a36-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:02:46 [async_llm.py:261] Added request cmpl-f588cbb5a3ef4b2cbcd9560bed758a36-0.
INFO 03-02 00:02:47 [logger.py:42] Received request cmpl-633b9b505d1e43aa9d030b8e6ab5356b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:02:47 [async_llm.py:261] Added request cmpl-633b9b505d1e43aa9d030b8e6ab5356b-0.
INFO 03-02 00:02:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:02:48 [logger.py:42] Received request cmpl-d3594b207694469d884e34150845c74c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:02:48 [async_llm.py:261] Added request cmpl-d3594b207694469d884e34150845c74c-0.
INFO 03-02 00:02:50 [logger.py:42] Received request cmpl-3903caa0d7914975ae5bdeac09857676-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:02:50 [async_llm.py:261] Added request cmpl-3903caa0d7914975ae5bdeac09857676-0.
INFO 03-02 00:02:51 [logger.py:42] Received request cmpl-7783b9fb636c408dba044be74a4cdf80-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:02:51 [async_llm.py:261] Added request cmpl-7783b9fb636c408dba044be74a4cdf80-0.
INFO 03-02 00:02:52 [logger.py:42] Received request cmpl-4b5d1e76e5ac495f9d9f6f32822a2500-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:02:52 [async_llm.py:261] Added request cmpl-4b5d1e76e5ac495f9d9f6f32822a2500-0.
INFO 03-02 00:02:53 [logger.py:42] Received request cmpl-0cf8fa4abd99488c8d922a1233f4e929-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:02:53 [async_llm.py:261] Added request cmpl-0cf8fa4abd99488c8d922a1233f4e929-0.
INFO 03-02 00:02:54 [logger.py:42] Received request cmpl-a5c07dc354d047278b634a9f53545e52-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:02:54 [async_llm.py:261] Added request cmpl-a5c07dc354d047278b634a9f53545e52-0.
INFO 03-02 00:02:55 [logger.py:42] Received request cmpl-af3882524ba74a6a96a6ac3c9757ee16-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:02:55 [async_llm.py:261] Added request cmpl-af3882524ba74a6a96a6ac3c9757ee16-0.
INFO 03-02 00:02:57 [logger.py:42] Received request cmpl-58c05b187be34e8d9c64f8e65f552ab6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:02:57 [async_llm.py:261] Added request cmpl-58c05b187be34e8d9c64f8e65f552ab6-0.
INFO 03-02 00:02:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:02:58 [logger.py:42] Received request cmpl-0314d4a392d54a75b1a9b9222f770d8b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:02:58 [async_llm.py:261] Added request cmpl-0314d4a392d54a75b1a9b9222f770d8b-0.
INFO 03-02 00:02:59 [logger.py:42] Received request cmpl-68b68b9dd6ef4f99ae58e914e64d9d09-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:02:59 [async_llm.py:261] Added request cmpl-68b68b9dd6ef4f99ae58e914e64d9d09-0.
INFO 03-02 00:03:00 [logger.py:42] Received request cmpl-70ad49d03eb34c8b85179de205f1356d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:03:00 [async_llm.py:261] Added request cmpl-70ad49d03eb34c8b85179de205f1356d-0.
INFO 03-02 00:03:01 [logger.py:42] Received request cmpl-461c64b05e0c4805be612c1f1464eb97-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:03:01 [async_llm.py:261] Added request cmpl-461c64b05e0c4805be612c1f1464eb97-0.
INFO 03-02 00:03:02 [logger.py:42] Received request cmpl-db94412af40545a0a1e40a90149fbfeb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:03:02 [async_llm.py:261] Added request cmpl-db94412af40545a0a1e40a90149fbfeb-0.
INFO 03-02 00:03:03 [logger.py:42] Received request cmpl-d454ce84e0f64013bf9913f4b62b7be1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:03:03 [async_llm.py:261] Added request cmpl-d454ce84e0f64013bf9913f4b62b7be1-0.
INFO 03-02 00:03:05 [logger.py:42] Received request cmpl-faba3b046f0d4504ab1d59dee9f26532-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:03:05 [async_llm.py:261] Added request cmpl-faba3b046f0d4504ab1d59dee9f26532-0.
INFO 03-02 00:03:06 [logger.py:42] Received request cmpl-dfb95c8979fb481a950dc24585f517c6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:03:06 [async_llm.py:261] Added request cmpl-dfb95c8979fb481a950dc24585f517c6-0.
INFO 03-02 00:03:07 [logger.py:42] Received request cmpl-3966e82be87140fbbeb4023fb68d1528-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:03:07 [async_llm.py:261] Added request cmpl-3966e82be87140fbbeb4023fb68d1528-0.
INFO 03-02 00:03:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:03:08 [logger.py:42] Received request cmpl-fe0275dcedbd4e499219ef0506afe885-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:03:08 [async_llm.py:261] Added request cmpl-fe0275dcedbd4e499219ef0506afe885-0.
INFO 03-02 00:03:09 [logger.py:42] Received request cmpl-80583de526c24c3f9f188a9e8d6821dc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:03:09 [async_llm.py:261] Added request cmpl-80583de526c24c3f9f188a9e8d6821dc-0.
INFO 03-02 00:03:10 [logger.py:42] Received request cmpl-15ff53850fba47db93401e6e0fefacf9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:03:10 [async_llm.py:261] Added request cmpl-15ff53850fba47db93401e6e0fefacf9-0.
INFO 03-02 00:03:12 [logger.py:42] Received request cmpl-4c728d5b8284449cb665fbbd6cd4cc9e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:03:12 [async_llm.py:261] Added request cmpl-4c728d5b8284449cb665fbbd6cd4cc9e-0.
INFO 03-02 00:03:13 [logger.py:42] Received request cmpl-92d3eeea64b04b28b13927047ed877b4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:03:13 [async_llm.py:261] Added request cmpl-92d3eeea64b04b28b13927047ed877b4-0.
INFO 03-02 00:03:14 [logger.py:42] Received request cmpl-e87a63e7772f4b83909d2c6bc36c4080-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:03:14 [async_llm.py:261] Added request cmpl-e87a63e7772f4b83909d2c6bc36c4080-0.
INFO 03-02 00:03:15 [logger.py:42] Received request cmpl-5507b9b677144cff9fef8222d5562adc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:03:15 [async_llm.py:261] Added request cmpl-5507b9b677144cff9fef8222d5562adc-0.
INFO 03-02 00:03:16 [logger.py:42] Received request cmpl-499881552ffc47bd86f0156905811ee7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:03:16 [async_llm.py:261] Added request cmpl-499881552ffc47bd86f0156905811ee7-0.
INFO 03-02 00:03:17 [logger.py:42] Received request cmpl-cbbf7efd6185465794c1c4661f7f3c93-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:03:17 [async_llm.py:261] Added request cmpl-cbbf7efd6185465794c1c4661f7f3c93-0.
INFO 03-02 00:03:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:03:18 [logger.py:42] Received request cmpl-89cd1fff5d55441eae28f5de545f225e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:03:18 [async_llm.py:261] Added request cmpl-89cd1fff5d55441eae28f5de545f225e-0.
INFO 03-02 00:03:20 [logger.py:42] Received request cmpl-9827dd37afb34245a8e060aba130e830-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:03:20 [async_llm.py:261] Added request cmpl-9827dd37afb34245a8e060aba130e830-0.
INFO 03-02 00:03:21 [logger.py:42] Received request cmpl-8cadc4b8ed8e48e89d7d9e82a979bfab-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:03:21 [async_llm.py:261] Added request cmpl-8cadc4b8ed8e48e89d7d9e82a979bfab-0.
INFO 03-02 00:03:22 [logger.py:42] Received request cmpl-d39e860037dc4f9cb90a18413b007f5b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:03:22 [async_llm.py:261] Added request cmpl-d39e860037dc4f9cb90a18413b007f5b-0.
INFO 03-02 00:03:23 [logger.py:42] Received request cmpl-1be22245d5a3488b8d9c064a95391e7d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:03:23 [async_llm.py:261] Added request cmpl-1be22245d5a3488b8d9c064a95391e7d-0.
INFO 03-02 00:03:24 [logger.py:42] Received request cmpl-e58040f58ce64be486b8e28dddd1317d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:03:24 [async_llm.py:261] Added request cmpl-e58040f58ce64be486b8e28dddd1317d-0.
INFO 03-02 00:03:25 [logger.py:42] Received request cmpl-9bd41de15d5944fd8fe5c11ad2fc5a42-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:03:25 [async_llm.py:261] Added request cmpl-9bd41de15d5944fd8fe5c11ad2fc5a42-0.
INFO 03-02 00:03:27 [logger.py:42] Received request cmpl-edc2aae6bb244d96b9007348e975a6b7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:03:27 [async_llm.py:261] Added request cmpl-edc2aae6bb244d96b9007348e975a6b7-0.
INFO 03-02 00:03:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:03:28 [logger.py:42] Received request cmpl-826e504c63e14513a1e7943f825a1f1f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:03:28 [async_llm.py:261] Added request cmpl-826e504c63e14513a1e7943f825a1f1f-0.
INFO 03-02 00:03:29 [logger.py:42] Received request cmpl-5a94e480560846e7b12ff2fbed7a41bb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:03:29 [async_llm.py:261] Added request cmpl-5a94e480560846e7b12ff2fbed7a41bb-0.
INFO 03-02 00:03:30 [logger.py:42] Received request cmpl-984c2a43fff24d49abeff3daa094a8c5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:03:30 [async_llm.py:261] Added request cmpl-984c2a43fff24d49abeff3daa094a8c5-0.
INFO 03-02 00:03:31 [logger.py:42] Received request cmpl-26ef8411abf74f989df1fe143b9a1708-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:03:31 [async_llm.py:261] Added request cmpl-26ef8411abf74f989df1fe143b9a1708-0.
INFO 03-02 00:03:32 [logger.py:42] Received request cmpl-7c6ebd6199f942879045e2562862184d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:03:32 [async_llm.py:261] Added request cmpl-7c6ebd6199f942879045e2562862184d-0.
INFO 03-02 00:03:33 [logger.py:42] Received request cmpl-b6133f4ad5d24587b8fbbe40b3b3e354-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:03:33 [async_llm.py:261] Added request cmpl-b6133f4ad5d24587b8fbbe40b3b3e354-0.
INFO 03-02 00:03:35 [logger.py:42] Received request cmpl-ebba6734c4af4e26a374d2b1b18293a8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:03:35 [async_llm.py:261] Added request cmpl-ebba6734c4af4e26a374d2b1b18293a8-0.
INFO 03-02 00:03:36 [logger.py:42] Received request cmpl-3c01e94568ee437dbeae8b512814ca63-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:03:36 [async_llm.py:261] Added request cmpl-3c01e94568ee437dbeae8b512814ca63-0.
INFO 03-02 00:03:37 [logger.py:42] Received request cmpl-31050c70ee2749108fb70d7388338dd3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:03:37 [async_llm.py:261] Added request cmpl-31050c70ee2749108fb70d7388338dd3-0.
INFO 03-02 00:03:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:03:38 [logger.py:42] Received request cmpl-8c44111db06241d2b30f9d1736063479-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:03:38 [async_llm.py:261] Added request cmpl-8c44111db06241d2b30f9d1736063479-0.
INFO 03-02 00:03:39 [logger.py:42] Received request cmpl-3c915491d0d74c07b0f91685b8e1af9b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:03:39 [async_llm.py:261] Added request cmpl-3c915491d0d74c07b0f91685b8e1af9b-0.
INFO 03-02 00:03:40 [logger.py:42] Received request cmpl-4b4f45e6e8214cb99c724cd40c1bede9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:03:40 [async_llm.py:261] Added request cmpl-4b4f45e6e8214cb99c724cd40c1bede9-0.
INFO 03-02 00:03:42 [logger.py:42] Received request cmpl-ed0c8fba92da44e6ae8f855e67035f17-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:03:42 [async_llm.py:261] Added request cmpl-ed0c8fba92da44e6ae8f855e67035f17-0.
INFO 03-02 00:03:43 [logger.py:42] Received request cmpl-30a10fd1894e4c31b79c741db021fd4a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:03:43 [async_llm.py:261] Added request cmpl-30a10fd1894e4c31b79c741db021fd4a-0.
INFO 03-02 00:03:44 [logger.py:42] Received request cmpl-3cb54240071d47c686da39857ab791cb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:03:44 [async_llm.py:261] Added request cmpl-3cb54240071d47c686da39857ab791cb-0.
INFO 03-02 00:03:45 [logger.py:42] Received request cmpl-7c44431a4d474b2e90c0687b61e20fe2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:03:45 [async_llm.py:261] Added request cmpl-7c44431a4d474b2e90c0687b61e20fe2-0.
INFO 03-02 00:03:46 [logger.py:42] Received request cmpl-fe01d6198b464f03a49eb6c050b9dc1d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:03:46 [async_llm.py:261] Added request cmpl-fe01d6198b464f03a49eb6c050b9dc1d-0.
INFO 03-02 00:03:47 [logger.py:42] Received request cmpl-f14b3b45bdb24530a6674ebf0e6eb8e1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:03:47 [async_llm.py:261] Added request cmpl-f14b3b45bdb24530a6674ebf0e6eb8e1-0.
INFO 03-02 00:03:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:03:48 [logger.py:42] Received request cmpl-f5fe18c8d5cb41418e8e7b3d9bf3f37b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:03:48 [async_llm.py:261] Added request cmpl-f5fe18c8d5cb41418e8e7b3d9bf3f37b-0.
INFO 03-02 00:03:50 [logger.py:42] Received request cmpl-220d8daf61924e81b3b5abe6b111a334-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:03:50 [async_llm.py:261] Added request cmpl-220d8daf61924e81b3b5abe6b111a334-0.
INFO 03-02 00:03:51 [logger.py:42] Received request cmpl-e2d09d4de08643e999fd9cc0fe0d04dc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:03:51 [async_llm.py:261] Added request cmpl-e2d09d4de08643e999fd9cc0fe0d04dc-0.
INFO 03-02 00:03:52 [logger.py:42] Received request cmpl-85b114791196480ea934cd188c9a8684-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:03:52 [async_llm.py:261] Added request cmpl-85b114791196480ea934cd188c9a8684-0.
INFO 03-02 00:03:53 [logger.py:42] Received request cmpl-f704727bb5d1409ba3be7d0cc114bf5e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:03:53 [async_llm.py:261] Added request cmpl-f704727bb5d1409ba3be7d0cc114bf5e-0.
INFO 03-02 00:03:54 [logger.py:42] Received request cmpl-6901632631a4428a86eca0008a84742b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:03:54 [async_llm.py:261] Added request cmpl-6901632631a4428a86eca0008a84742b-0.
INFO 03-02 00:03:55 [logger.py:42] Received request cmpl-c432b060d4fd4ddab96d418d52f5eead-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:03:55 [async_llm.py:261] Added request cmpl-c432b060d4fd4ddab96d418d52f5eead-0.
INFO 03-02 00:03:57 [logger.py:42] Received request cmpl-b8eadea4c0ce47c880aa76a8c853bf4c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:03:57 [async_llm.py:261] Added request cmpl-b8eadea4c0ce47c880aa76a8c853bf4c-0.
INFO 03-02 00:03:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:03:58 [logger.py:42] Received request cmpl-7c3c47b1bbd745828d9ad5692a1024f7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:03:58 [async_llm.py:261] Added request cmpl-7c3c47b1bbd745828d9ad5692a1024f7-0.
INFO 03-02 00:03:59 [logger.py:42] Received request cmpl-b1bf9ed18f194d6588061a978350fbdc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:03:59 [async_llm.py:261] Added request cmpl-b1bf9ed18f194d6588061a978350fbdc-0.
INFO 03-02 00:04:00 [logger.py:42] Received request cmpl-8492b3b8f1fc4170ba34a6a868c1bbde-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:04:00 [async_llm.py:261] Added request cmpl-8492b3b8f1fc4170ba34a6a868c1bbde-0.
INFO 03-02 00:04:01 [logger.py:42] Received request cmpl-c243a34e0743499980ea3b5fe0e0b66a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:04:01 [async_llm.py:261] Added request cmpl-c243a34e0743499980ea3b5fe0e0b66a-0.
INFO 03-02 00:04:02 [logger.py:42] Received request cmpl-e5f13098073a440e8200f418f6c364e4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:04:02 [async_llm.py:261] Added request cmpl-e5f13098073a440e8200f418f6c364e4-0.
INFO 03-02 00:04:03 [logger.py:42] Received request cmpl-5dd5fbef44b7479bbdf4f77479c9fc6d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:04:03 [async_llm.py:261] Added request cmpl-5dd5fbef44b7479bbdf4f77479c9fc6d-0.
INFO 03-02 00:04:05 [logger.py:42] Received request cmpl-e459de591a004230bc821a5be4f33736-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:04:05 [async_llm.py:261] Added request cmpl-e459de591a004230bc821a5be4f33736-0.
INFO 03-02 00:04:06 [logger.py:42] Received request cmpl-28f7c857a7a347029d9d110c1b5c1f1d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:04:06 [async_llm.py:261] Added request cmpl-28f7c857a7a347029d9d110c1b5c1f1d-0.
INFO 03-02 00:04:07 [logger.py:42] Received request cmpl-c6e22af950254fe691dcf7fc1f330a3b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:04:07 [async_llm.py:261] Added request cmpl-c6e22af950254fe691dcf7fc1f330a3b-0.
INFO 03-02 00:04:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:04:08 [logger.py:42] Received request cmpl-460e959c84464c1f992737a2c556245d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:04:08 [async_llm.py:261] Added request cmpl-460e959c84464c1f992737a2c556245d-0.
INFO 03-02 00:04:09 [logger.py:42] Received request cmpl-4249d24d2de54f418fd53144979d33dc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:04:09 [async_llm.py:261] Added request cmpl-4249d24d2de54f418fd53144979d33dc-0.
INFO 03-02 00:04:10 [logger.py:42] Received request cmpl-18e576f3e1374905855dcd969fde015a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:04:10 [async_llm.py:261] Added request cmpl-18e576f3e1374905855dcd969fde015a-0.
INFO 03-02 00:04:12 [logger.py:42] Received request cmpl-5047d39942e34a7789a960f5eeaa18ca-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:04:12 [async_llm.py:261] Added request cmpl-5047d39942e34a7789a960f5eeaa18ca-0.
INFO 03-02 00:04:13 [logger.py:42] Received request cmpl-8c7ede93d7a642598ff16656aa687d2e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:04:13 [async_llm.py:261] Added request cmpl-8c7ede93d7a642598ff16656aa687d2e-0.
INFO 03-02 00:04:14 [logger.py:42] Received request cmpl-34bf6015582343ffb602d4f2d1bd4b30-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:04:14 [async_llm.py:261] Added request cmpl-34bf6015582343ffb602d4f2d1bd4b30-0.
INFO 03-02 00:04:15 [logger.py:42] Received request cmpl-0cd25f8165d34c1299e767fedcf10ebb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:04:15 [async_llm.py:261] Added request cmpl-0cd25f8165d34c1299e767fedcf10ebb-0.
INFO 03-02 00:04:16 [logger.py:42] Received request cmpl-f32c0f70a7ca479ba18cf9d2b7b9dcf5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:04:16 [async_llm.py:261] Added request cmpl-f32c0f70a7ca479ba18cf9d2b7b9dcf5-0.
INFO 03-02 00:04:17 [logger.py:42] Received request cmpl-6e46eaa6972641459e08cd7cf7d9d3bd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:04:17 [async_llm.py:261] Added request cmpl-6e46eaa6972641459e08cd7cf7d9d3bd-0.
INFO 03-02 00:04:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:04:18 [logger.py:42] Received request cmpl-525a7d3ceb454444a85519e13046976e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:04:18 [async_llm.py:261] Added request cmpl-525a7d3ceb454444a85519e13046976e-0.
INFO 03-02 00:04:20 [logger.py:42] Received request cmpl-3080efd8f69c407b85fb364b005c4975-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:04:20 [async_llm.py:261] Added request cmpl-3080efd8f69c407b85fb364b005c4975-0.
INFO 03-02 00:04:21 [logger.py:42] Received request cmpl-f2ed8919056c4cfd9b5f8d0fe5b7059a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:04:21 [async_llm.py:261] Added request cmpl-f2ed8919056c4cfd9b5f8d0fe5b7059a-0.
INFO 03-02 00:04:22 [logger.py:42] Received request cmpl-6ea01652b31e4b05bf930ed05ee6f02e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:04:22 [async_llm.py:261] Added request cmpl-6ea01652b31e4b05bf930ed05ee6f02e-0.
INFO 03-02 00:04:23 [logger.py:42] Received request cmpl-2682a467377e499fbc47cbda7664b94a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:04:23 [async_llm.py:261] Added request cmpl-2682a467377e499fbc47cbda7664b94a-0.
INFO 03-02 00:04:24 [logger.py:42] Received request cmpl-d1c3e68b483144b892869cc2b9d38640-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:04:24 [async_llm.py:261] Added request cmpl-d1c3e68b483144b892869cc2b9d38640-0.
INFO 03-02 00:04:25 [logger.py:42] Received request cmpl-504a9549428d4340a91073de6cf26631-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:04:25 [async_llm.py:261] Added request cmpl-504a9549428d4340a91073de6cf26631-0.
INFO 03-02 00:04:27 [logger.py:42] Received request cmpl-33db5dbff5e846f5a5e2c63d2acf3097-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:04:27 [async_llm.py:261] Added request cmpl-33db5dbff5e846f5a5e2c63d2acf3097-0.
INFO 03-02 00:04:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:04:28 [logger.py:42] Received request cmpl-abd39de325ac497f874fb55469b78973-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:04:28 [async_llm.py:261] Added request cmpl-abd39de325ac497f874fb55469b78973-0.
INFO 03-02 00:04:29 [logger.py:42] Received request cmpl-7e606ed4f1934c69acec64cb1c4e9ad8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:04:29 [async_llm.py:261] Added request cmpl-7e606ed4f1934c69acec64cb1c4e9ad8-0.
INFO 03-02 00:04:30 [logger.py:42] Received request cmpl-3dac689db640470c938d09e98271747e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:04:30 [async_llm.py:261] Added request cmpl-3dac689db640470c938d09e98271747e-0.
INFO 03-02 00:04:31 [logger.py:42] Received request cmpl-d7772eecd5e04c5f844b1242fef4bd74-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:04:31 [async_llm.py:261] Added request cmpl-d7772eecd5e04c5f844b1242fef4bd74-0.
INFO 03-02 00:04:32 [logger.py:42] Received request cmpl-2051567682d74b71b69fdae8e6477181-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:04:32 [async_llm.py:261] Added request cmpl-2051567682d74b71b69fdae8e6477181-0.
INFO 03-02 00:04:33 [logger.py:42] Received request cmpl-527ec168d96d412fafd0646079196548-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:04:33 [async_llm.py:261] Added request cmpl-527ec168d96d412fafd0646079196548-0.
INFO 03-02 00:04:35 [logger.py:42] Received request cmpl-c92456393e334f13b186ed700f742103-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:04:35 [async_llm.py:261] Added request cmpl-c92456393e334f13b186ed700f742103-0.
INFO 03-02 00:04:36 [logger.py:42] Received request cmpl-b8cc867ff4444a59bb2c97c7f2f2f960-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:04:36 [async_llm.py:261] Added request cmpl-b8cc867ff4444a59bb2c97c7f2f2f960-0.
INFO 03-02 00:04:37 [logger.py:42] Received request cmpl-58fab142a8584680b855596863f9ac12-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:04:37 [async_llm.py:261] Added request cmpl-58fab142a8584680b855596863f9ac12-0.
INFO 03-02 00:04:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:04:38 [logger.py:42] Received request cmpl-20d7ecb2bb964dbfaa64076658d72267-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:04:38 [async_llm.py:261] Added request cmpl-20d7ecb2bb964dbfaa64076658d72267-0.
INFO 03-02 00:04:39 [logger.py:42] Received request cmpl-e918629e12db4d9cac4c2300544b3cb5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:04:39 [async_llm.py:261] Added request cmpl-e918629e12db4d9cac4c2300544b3cb5-0.
INFO 03-02 00:04:40 [logger.py:42] Received request cmpl-a9427b4e0f6747ccaed31982fc42a73d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:04:40 [async_llm.py:261] Added request cmpl-a9427b4e0f6747ccaed31982fc42a73d-0.
INFO 03-02 00:04:42 [logger.py:42] Received request cmpl-bce6eb8c534d46f386f9dc4563f7c3b6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:04:42 [async_llm.py:261] Added request cmpl-bce6eb8c534d46f386f9dc4563f7c3b6-0.
INFO 03-02 00:04:43 [logger.py:42] Received request cmpl-534462294efa4ffa9f118a1158624862-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:04:43 [async_llm.py:261] Added request cmpl-534462294efa4ffa9f118a1158624862-0.
INFO 03-02 00:04:44 [logger.py:42] Received request cmpl-1aee39965081490484669a9a8ff7053e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:04:44 [async_llm.py:261] Added request cmpl-1aee39965081490484669a9a8ff7053e-0.
INFO 03-02 00:04:45 [logger.py:42] Received request cmpl-7e26cde3ae474434956198342245afd9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:04:45 [async_llm.py:261] Added request cmpl-7e26cde3ae474434956198342245afd9-0.
INFO 03-02 00:04:46 [logger.py:42] Received request cmpl-28f7e3e5f7d041bab6c9931b3201c0f5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:04:46 [async_llm.py:261] Added request cmpl-28f7e3e5f7d041bab6c9931b3201c0f5-0.
INFO 03-02 00:04:47 [logger.py:42] Received request cmpl-c9856bcd76ea418b900e7e8d6f9cc355-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:04:47 [async_llm.py:261] Added request cmpl-c9856bcd76ea418b900e7e8d6f9cc355-0.
INFO 03-02 00:04:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:04:49 [logger.py:42] Received request cmpl-29d4d0a227b24c5f8dc6fb3a466f24a2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:04:49 [async_llm.py:261] Added request cmpl-29d4d0a227b24c5f8dc6fb3a466f24a2-0.
INFO 03-02 00:04:50 [logger.py:42] Received request cmpl-8031949346ad43a495e1d67a129fb251-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:04:50 [async_llm.py:261] Added request cmpl-8031949346ad43a495e1d67a129fb251-0.
INFO 03-02 00:04:51 [logger.py:42] Received request cmpl-7ac987e65dfb44c4b3d2358ffe4554e9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:04:51 [async_llm.py:261] Added request cmpl-7ac987e65dfb44c4b3d2358ffe4554e9-0.
INFO 03-02 00:04:52 [logger.py:42] Received request cmpl-8af7a0aa400d4c9dbd648275b39a76a2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:04:52 [async_llm.py:261] Added request cmpl-8af7a0aa400d4c9dbd648275b39a76a2-0.
INFO 03-02 00:04:53 [logger.py:42] Received request cmpl-238f0c2992be4e58aa1e151dc8b926d7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:04:53 [async_llm.py:261] Added request cmpl-238f0c2992be4e58aa1e151dc8b926d7-0.
INFO 03-02 00:04:54 [logger.py:42] Received request cmpl-2ae540094150448bb0d890102b5ae40d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:04:54 [async_llm.py:261] Added request cmpl-2ae540094150448bb0d890102b5ae40d-0.
INFO 03-02 00:04:55 [logger.py:42] Received request cmpl-7cac6937cdd048bc846754aef5e42f23-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:04:55 [async_llm.py:261] Added request cmpl-7cac6937cdd048bc846754aef5e42f23-0.
INFO 03-02 00:04:57 [logger.py:42] Received request cmpl-7ce3fd4de1fa4af8a0f533e307271938-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:04:57 [async_llm.py:261] Added request cmpl-7ce3fd4de1fa4af8a0f533e307271938-0.
INFO 03-02 00:04:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:04:58 [logger.py:42] Received request cmpl-7015fb3dee2a4a81a8ccd5a7ebccf130-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:04:58 [async_llm.py:261] Added request cmpl-7015fb3dee2a4a81a8ccd5a7ebccf130-0.
INFO 03-02 00:04:59 [logger.py:42] Received request cmpl-19ccd81540fe49f59e05d0f8478a6810-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:04:59 [async_llm.py:261] Added request cmpl-19ccd81540fe49f59e05d0f8478a6810-0.
INFO 03-02 00:05:00 [logger.py:42] Received request cmpl-515d9bd534334a59bf1830a3b743a113-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:05:00 [async_llm.py:261] Added request cmpl-515d9bd534334a59bf1830a3b743a113-0.
INFO 03-02 00:05:01 [logger.py:42] Received request cmpl-269428be85c44692a8bbab87a6cc5686-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:05:01 [async_llm.py:261] Added request cmpl-269428be85c44692a8bbab87a6cc5686-0.
INFO 03-02 00:05:02 [logger.py:42] Received request cmpl-454bf3b0640e46d882d20128dbaaa525-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:05:02 [async_llm.py:261] Added request cmpl-454bf3b0640e46d882d20128dbaaa525-0.
INFO 03-02 00:05:04 [logger.py:42] Received request cmpl-05b65f9a30224f8f9245cb9260ddbf99-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:05:04 [async_llm.py:261] Added request cmpl-05b65f9a30224f8f9245cb9260ddbf99-0.
INFO 03-02 00:05:05 [logger.py:42] Received request cmpl-5aebfbb11a714647b48ecd19b29ea6e5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:05:05 [async_llm.py:261] Added request cmpl-5aebfbb11a714647b48ecd19b29ea6e5-0.
INFO 03-02 00:05:06 [logger.py:42] Received request cmpl-80487e86d257455b988c17604d4c360d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:05:06 [async_llm.py:261] Added request cmpl-80487e86d257455b988c17604d4c360d-0.
INFO 03-02 00:05:07 [logger.py:42] Received request cmpl-44745f1ea2534c3ca09b732ed17090fe-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:05:07 [async_llm.py:261] Added request cmpl-44745f1ea2534c3ca09b732ed17090fe-0.
INFO 03-02 00:05:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:05:08 [logger.py:42] Received request cmpl-20513d97d6f1459f94baf14389fba448-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:05:08 [async_llm.py:261] Added request cmpl-20513d97d6f1459f94baf14389fba448-0.
INFO 03-02 00:05:09 [logger.py:42] Received request cmpl-686d62a4399f49d3a44db340070d2c67-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:05:09 [async_llm.py:261] Added request cmpl-686d62a4399f49d3a44db340070d2c67-0.
INFO 03-02 00:05:10 [logger.py:42] Received request cmpl-0d3aade9dd6946919142481762081ba0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:05:10 [async_llm.py:261] Added request cmpl-0d3aade9dd6946919142481762081ba0-0.
INFO 03-02 00:05:12 [logger.py:42] Received request cmpl-909a505a5411416eb61cbe0c7c89bc43-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:05:12 [async_llm.py:261] Added request cmpl-909a505a5411416eb61cbe0c7c89bc43-0.
INFO 03-02 00:05:13 [logger.py:42] Received request cmpl-9f221e9af840436197388b4e7cbbf87e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:05:13 [async_llm.py:261] Added request cmpl-9f221e9af840436197388b4e7cbbf87e-0.
INFO 03-02 00:05:14 [logger.py:42] Received request cmpl-c6e15d9641e0448d8f7276c5e6dee315-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:05:14 [async_llm.py:261] Added request cmpl-c6e15d9641e0448d8f7276c5e6dee315-0.
INFO 03-02 00:05:15 [logger.py:42] Received request cmpl-af085b7039804a3c89de4f7e7a167db5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:05:15 [async_llm.py:261] Added request cmpl-af085b7039804a3c89de4f7e7a167db5-0.
INFO 03-02 00:05:16 [logger.py:42] Received request cmpl-93f04355b0194c91bf25f91a096d9a01-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:05:16 [async_llm.py:261] Added request cmpl-93f04355b0194c91bf25f91a096d9a01-0.
INFO 03-02 00:05:17 [logger.py:42] Received request cmpl-6feda2752b864f9fa9f89fbb1a180c8c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:05:17 [async_llm.py:261] Added request cmpl-6feda2752b864f9fa9f89fbb1a180c8c-0.
INFO 03-02 00:05:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.4 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.4%, Prefix cache hit rate: 51.6%
INFO 03-02 00:05:19 [logger.py:42] Received request cmpl-0a9dc2cf80f741d092228fa94fbf0b49-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:05:19 [async_llm.py:261] Added request cmpl-0a9dc2cf80f741d092228fa94fbf0b49-0.
INFO 03-02 00:05:20 [logger.py:42] Received request cmpl-58c35d73dae4459bafd63b2f484ea7a4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:05:20 [async_llm.py:261] Added request cmpl-58c35d73dae4459bafd63b2f484ea7a4-0.
INFO 03-02 00:05:21 [logger.py:42] Received request cmpl-ecf11d47553a4c94b3e76714188ff52f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:05:21 [async_llm.py:261] Added request cmpl-ecf11d47553a4c94b3e76714188ff52f-0.
INFO 03-02 00:05:22 [logger.py:42] Received request cmpl-6a082807a59844f79a2fcd0219ec96af-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:05:22 [async_llm.py:261] Added request cmpl-6a082807a59844f79a2fcd0219ec96af-0.
INFO 03-02 00:05:23 [logger.py:42] Received request cmpl-e242558dafbc4b06988574ac506c4e4f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:05:23 [async_llm.py:261] Added request cmpl-e242558dafbc4b06988574ac506c4e4f-0.
INFO 03-02 00:05:24 [logger.py:42] Received request cmpl-859448a277b243abb8dd7c072437db4d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:05:24 [async_llm.py:261] Added request cmpl-859448a277b243abb8dd7c072437db4d-0.
INFO 03-02 00:05:25 [logger.py:42] Received request cmpl-1c09a9002b4c469c8a2ba1b89891841d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:05:25 [async_llm.py:261] Added request cmpl-1c09a9002b4c469c8a2ba1b89891841d-0.
INFO 03-02 00:05:27 [logger.py:42] Received request cmpl-df51c10fa9f44951bee6422ff6d2d998-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:05:27 [async_llm.py:261] Added request cmpl-df51c10fa9f44951bee6422ff6d2d998-0.
INFO 03-02 00:05:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.1 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:05:28 [logger.py:42] Received request cmpl-c41bc96751ac416eb05473cbfc33ef1f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:05:28 [async_llm.py:261] Added request cmpl-c41bc96751ac416eb05473cbfc33ef1f-0.
INFO 03-02 00:05:29 [logger.py:42] Received request cmpl-55d66314076845b4a82fd7f3ede322be-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:05:29 [async_llm.py:261] Added request cmpl-55d66314076845b4a82fd7f3ede322be-0.
INFO 03-02 00:05:30 [logger.py:42] Received request cmpl-9d49536546e94bf0af187cebf330c3f0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:05:30 [async_llm.py:261] Added request cmpl-9d49536546e94bf0af187cebf330c3f0-0.
INFO 03-02 00:05:31 [logger.py:42] Received request cmpl-af88c16d162040eb8866fac00cbde733-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:05:31 [async_llm.py:261] Added request cmpl-af88c16d162040eb8866fac00cbde733-0.
INFO 03-02 00:05:32 [logger.py:42] Received request cmpl-9746c5c66d8e461d94d2a05bd041a08e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:05:32 [async_llm.py:261] Added request cmpl-9746c5c66d8e461d94d2a05bd041a08e-0.
INFO 03-02 00:05:34 [logger.py:42] Received request cmpl-dc15f0dabf52406daade38457eb3705b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:05:34 [async_llm.py:261] Added request cmpl-dc15f0dabf52406daade38457eb3705b-0.
INFO 03-02 00:05:35 [logger.py:42] Received request cmpl-06250fd7ad0a488980b2229569abe135-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:05:35 [async_llm.py:261] Added request cmpl-06250fd7ad0a488980b2229569abe135-0.
INFO 03-02 00:05:36 [logger.py:42] Received request cmpl-a60157e19d18451ba8b76abb7070ba66-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:05:36 [async_llm.py:261] Added request cmpl-a60157e19d18451ba8b76abb7070ba66-0.
INFO 03-02 00:05:37 [logger.py:42] Received request cmpl-9220ee487f5b45768dc9b1dad321e827-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:05:37 [async_llm.py:261] Added request cmpl-9220ee487f5b45768dc9b1dad321e827-0.
INFO 03-02 00:05:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:05:38 [logger.py:42] Received request cmpl-191d0b7facdf4827ac94368e705b7f23-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:05:38 [async_llm.py:261] Added request cmpl-191d0b7facdf4827ac94368e705b7f23-0.
INFO 03-02 00:05:39 [logger.py:42] Received request cmpl-44d42868037644339fa74c120f9f5388-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:05:39 [async_llm.py:261] Added request cmpl-44d42868037644339fa74c120f9f5388-0.
INFO 03-02 00:05:41 [logger.py:42] Received request cmpl-eab7eaf36afd45009376f77c9be6653c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:05:41 [async_llm.py:261] Added request cmpl-eab7eaf36afd45009376f77c9be6653c-0.
INFO 03-02 00:05:42 [logger.py:42] Received request cmpl-c59c7b72bbdd4d70acd6bd233cac2299-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:05:42 [async_llm.py:261] Added request cmpl-c59c7b72bbdd4d70acd6bd233cac2299-0.
INFO 03-02 00:05:43 [logger.py:42] Received request cmpl-91ef2c94216f4281970f0d3304e4a59b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:05:43 [async_llm.py:261] Added request cmpl-91ef2c94216f4281970f0d3304e4a59b-0.
INFO 03-02 00:05:44 [logger.py:42] Received request cmpl-09d9bf930e0048dfb2e21f50b1e41bc7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:05:44 [async_llm.py:261] Added request cmpl-09d9bf930e0048dfb2e21f50b1e41bc7-0.
INFO 03-02 00:05:45 [logger.py:42] Received request cmpl-d895cbbe6ac24a01b7e4c679382195d4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:05:45 [async_llm.py:261] Added request cmpl-d895cbbe6ac24a01b7e4c679382195d4-0.
INFO 03-02 00:05:46 [logger.py:42] Received request cmpl-d0f12a2a351542f1ab54ca6205b58113-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:05:46 [async_llm.py:261] Added request cmpl-d0f12a2a351542f1ab54ca6205b58113-0.
INFO 03-02 00:05:47 [logger.py:42] Received request cmpl-8ceb0c60966a48bfb2361a2a60058803-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:05:47 [async_llm.py:261] Added request cmpl-8ceb0c60966a48bfb2361a2a60058803-0.
INFO 03-02 00:05:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.3 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.4%, Prefix cache hit rate: 51.6%
INFO 03-02 00:05:49 [logger.py:42] Received request cmpl-f04d51d0a5454c74ad4339c0cfd39cd4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:05:49 [async_llm.py:261] Added request cmpl-f04d51d0a5454c74ad4339c0cfd39cd4-0.
INFO 03-02 00:05:50 [logger.py:42] Received request cmpl-b1aa4feaa8e046339672c0999c09ff30-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:05:50 [async_llm.py:261] Added request cmpl-b1aa4feaa8e046339672c0999c09ff30-0.
INFO 03-02 00:05:51 [logger.py:42] Received request cmpl-453919e1bc794d788c5e97e0ea68f895-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:05:51 [async_llm.py:261] Added request cmpl-453919e1bc794d788c5e97e0ea68f895-0.
INFO 03-02 00:05:52 [logger.py:42] Received request cmpl-3ff2689fcf264c9d8afd92037b358c6b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:05:52 [async_llm.py:261] Added request cmpl-3ff2689fcf264c9d8afd92037b358c6b-0.
INFO 03-02 00:05:53 [logger.py:42] Received request cmpl-4923e005d1f94d7c8cefe94476c4df69-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:05:53 [async_llm.py:261] Added request cmpl-4923e005d1f94d7c8cefe94476c4df69-0.
INFO 03-02 00:05:54 [logger.py:42] Received request cmpl-a4a5f124d731415c926e30127033658b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:05:54 [async_llm.py:261] Added request cmpl-a4a5f124d731415c926e30127033658b-0.
INFO 03-02 00:05:56 [logger.py:42] Received request cmpl-df4650d2c5d847cb89cefd4f0876f7f6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:05:56 [async_llm.py:261] Added request cmpl-df4650d2c5d847cb89cefd4f0876f7f6-0.
INFO 03-02 00:05:57 [logger.py:42] Received request cmpl-35dcde958c8b4f84a1e05605e9762adc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:05:57 [async_llm.py:261] Added request cmpl-35dcde958c8b4f84a1e05605e9762adc-0.
INFO 03-02 00:05:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.2 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:05:58 [logger.py:42] Received request cmpl-dc6aa4a5cfea4f6fada047cb13d5b93e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:05:58 [async_llm.py:261] Added request cmpl-dc6aa4a5cfea4f6fada047cb13d5b93e-0.
INFO 03-02 00:05:59 [logger.py:42] Received request cmpl-de959304b7f44cddb87d213ec07d70b7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:05:59 [async_llm.py:261] Added request cmpl-de959304b7f44cddb87d213ec07d70b7-0.
INFO 03-02 00:06:00 [logger.py:42] Received request cmpl-b46f228dce484caf8fadae3e040d1745-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:06:00 [async_llm.py:261] Added request cmpl-b46f228dce484caf8fadae3e040d1745-0.
INFO 03-02 00:06:01 [logger.py:42] Received request cmpl-09303e30634140eaac741b61781ccedd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:06:01 [async_llm.py:261] Added request cmpl-09303e30634140eaac741b61781ccedd-0.
INFO 03-02 00:06:02 [logger.py:42] Received request cmpl-5a802088e0b045768a855d12458aac98-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:06:02 [async_llm.py:261] Added request cmpl-5a802088e0b045768a855d12458aac98-0.
INFO 03-02 00:06:04 [logger.py:42] Received request cmpl-99a8296b173040bf82e38e60208568b5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:06:04 [async_llm.py:261] Added request cmpl-99a8296b173040bf82e38e60208568b5-0.
INFO 03-02 00:06:05 [logger.py:42] Received request cmpl-5ed0e14521f94c1aa49df467be95b380-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:06:05 [async_llm.py:261] Added request cmpl-5ed0e14521f94c1aa49df467be95b380-0.
INFO 03-02 00:06:06 [logger.py:42] Received request cmpl-15c5473134ec49879917cad0c6d23c60-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:06:06 [async_llm.py:261] Added request cmpl-15c5473134ec49879917cad0c6d23c60-0.
INFO 03-02 00:06:07 [logger.py:42] Received request cmpl-4477e4f745fb4a378393c8e853a947b6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:06:07 [async_llm.py:261] Added request cmpl-4477e4f745fb4a378393c8e853a947b6-0.
INFO 03-02 00:06:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:06:08 [logger.py:42] Received request cmpl-106c920988ec482f9eb266bdbaf06645-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:06:08 [async_llm.py:261] Added request cmpl-106c920988ec482f9eb266bdbaf06645-0.
INFO 03-02 00:06:09 [logger.py:42] Received request cmpl-5cae354157804fddb1d93ad16f5ab04f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:06:09 [async_llm.py:261] Added request cmpl-5cae354157804fddb1d93ad16f5ab04f-0.
INFO 03-02 00:06:11 [logger.py:42] Received request cmpl-0ed2d369de67407ea35d6bf2a28e8618-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:06:11 [async_llm.py:261] Added request cmpl-0ed2d369de67407ea35d6bf2a28e8618-0.
INFO 03-02 00:06:12 [logger.py:42] Received request cmpl-0be87d86947945c786116844a36fa098-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:06:12 [async_llm.py:261] Added request cmpl-0be87d86947945c786116844a36fa098-0.
INFO 03-02 00:06:13 [logger.py:42] Received request cmpl-53eed2418a2a4a79a3d92a573efd44b1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:06:13 [async_llm.py:261] Added request cmpl-53eed2418a2a4a79a3d92a573efd44b1-0.
INFO 03-02 00:06:14 [logger.py:42] Received request cmpl-290b082cfb0d490da3ae648f05d61d5d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:06:14 [async_llm.py:261] Added request cmpl-290b082cfb0d490da3ae648f05d61d5d-0.
INFO 03-02 00:06:15 [logger.py:42] Received request cmpl-16410d704ebb4b6394783e328cee2419-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:06:15 [async_llm.py:261] Added request cmpl-16410d704ebb4b6394783e328cee2419-0.
INFO 03-02 00:06:16 [logger.py:42] Received request cmpl-ed411264b6d04f9abadc1c2b99d0ba3e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:06:16 [async_llm.py:261] Added request cmpl-ed411264b6d04f9abadc1c2b99d0ba3e-0.
INFO 03-02 00:06:17 [logger.py:42] Received request cmpl-004122f9f0df4dc3ac56a2946959403d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:06:17 [async_llm.py:261] Added request cmpl-004122f9f0df4dc3ac56a2946959403d-0.
INFO 03-02 00:06:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.4 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.4%, Prefix cache hit rate: 51.6%
INFO 03-02 00:06:19 [logger.py:42] Received request cmpl-f89420ab00394c13b3c78c89919b1383-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:06:19 [async_llm.py:261] Added request cmpl-f89420ab00394c13b3c78c89919b1383-0.
INFO 03-02 00:06:20 [logger.py:42] Received request cmpl-e95be3d226334a0e858c44ba2b6da130-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:06:20 [async_llm.py:261] Added request cmpl-e95be3d226334a0e858c44ba2b6da130-0.
INFO 03-02 00:06:21 [logger.py:42] Received request cmpl-6756cce17b104429a1469042231045e0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:06:21 [async_llm.py:261] Added request cmpl-6756cce17b104429a1469042231045e0-0.
INFO 03-02 00:06:22 [logger.py:42] Received request cmpl-9f60ced4e059438bbc0edf66c874e981-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:06:22 [async_llm.py:261] Added request cmpl-9f60ced4e059438bbc0edf66c874e981-0.
INFO 03-02 00:06:23 [logger.py:42] Received request cmpl-615623f60dc54be49f0554c9af221b9c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:06:23 [async_llm.py:261] Added request cmpl-615623f60dc54be49f0554c9af221b9c-0.
INFO 03-02 00:06:24 [logger.py:42] Received request cmpl-496f906c2e8a4125993fcc6433764962-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:06:24 [async_llm.py:261] Added request cmpl-496f906c2e8a4125993fcc6433764962-0.
INFO 03-02 00:06:26 [logger.py:42] Received request cmpl-75a70fce0d15465cb50dfa9f15b2920a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:06:26 [async_llm.py:261] Added request cmpl-75a70fce0d15465cb50dfa9f15b2920a-0.
INFO 03-02 00:06:27 [logger.py:42] Received request cmpl-e22a2bf02d6145fea6ff0bba61f512df-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:06:27 [async_llm.py:261] Added request cmpl-e22a2bf02d6145fea6ff0bba61f512df-0.
INFO 03-02 00:06:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.1 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:06:28 [logger.py:42] Received request cmpl-f50d9d7c1b104998be7fc466e1c09936-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:06:28 [async_llm.py:261] Added request cmpl-f50d9d7c1b104998be7fc466e1c09936-0.
INFO 03-02 00:06:29 [logger.py:42] Received request cmpl-502d37dc8e47476c84b1830db2358199-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:06:29 [async_llm.py:261] Added request cmpl-502d37dc8e47476c84b1830db2358199-0.
INFO 03-02 00:06:30 [logger.py:42] Received request cmpl-7d7e11759eb34631bdcb38d0582f488d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:06:30 [async_llm.py:261] Added request cmpl-7d7e11759eb34631bdcb38d0582f488d-0.
INFO 03-02 00:06:31 [logger.py:42] Received request cmpl-4ab18d0455864f9ebe0f8abe57332891-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:06:31 [async_llm.py:261] Added request cmpl-4ab18d0455864f9ebe0f8abe57332891-0.
INFO 03-02 00:06:32 [logger.py:42] Received request cmpl-2aa2b2a0dd3a4d43930a34513cd03666-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:06:32 [async_llm.py:261] Added request cmpl-2aa2b2a0dd3a4d43930a34513cd03666-0.
INFO 03-02 00:06:34 [logger.py:42] Received request cmpl-1491aa1ff88b409b8b950d1500c350c8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:06:34 [async_llm.py:261] Added request cmpl-1491aa1ff88b409b8b950d1500c350c8-0.
INFO 03-02 00:06:35 [logger.py:42] Received request cmpl-bb44bafe9f6b4755ae5296e0499a5f6b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:06:35 [async_llm.py:261] Added request cmpl-bb44bafe9f6b4755ae5296e0499a5f6b-0.
INFO 03-02 00:06:36 [logger.py:42] Received request cmpl-c7d5d5e150e34208b524e2b294ae4891-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:06:36 [async_llm.py:261] Added request cmpl-c7d5d5e150e34208b524e2b294ae4891-0.
INFO 03-02 00:06:37 [logger.py:42] Received request cmpl-008d4326286f473b99391e8fca68ebdc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:06:37 [async_llm.py:261] Added request cmpl-008d4326286f473b99391e8fca68ebdc-0.
INFO 03-02 00:06:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:06:38 [logger.py:42] Received request cmpl-3865bc9eff4a410b963e1ce7a978b82e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:06:38 [async_llm.py:261] Added request cmpl-3865bc9eff4a410b963e1ce7a978b82e-0.
INFO 03-02 00:06:39 [logger.py:42] Received request cmpl-3a0ada3e2e2f4864a8cfd0eaef7216b2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:06:39 [async_llm.py:261] Added request cmpl-3a0ada3e2e2f4864a8cfd0eaef7216b2-0.
INFO 03-02 00:06:41 [logger.py:42] Received request cmpl-03ac11f2557248adae8d104bc693174a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:06:41 [async_llm.py:261] Added request cmpl-03ac11f2557248adae8d104bc693174a-0.
INFO 03-02 00:06:42 [logger.py:42] Received request cmpl-5f80892c2bd940ddbf7e3aae1a4fe86c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:06:42 [async_llm.py:261] Added request cmpl-5f80892c2bd940ddbf7e3aae1a4fe86c-0.
INFO 03-02 00:06:43 [logger.py:42] Received request cmpl-eb3e5963a8b743cabd42a3d23e0cc342-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:06:43 [async_llm.py:261] Added request cmpl-eb3e5963a8b743cabd42a3d23e0cc342-0.
INFO 03-02 00:06:44 [logger.py:42] Received request cmpl-3cd9e4aac1ef4cba9f8bfb9c88ebec04-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:06:44 [async_llm.py:261] Added request cmpl-3cd9e4aac1ef4cba9f8bfb9c88ebec04-0.
INFO 03-02 00:06:45 [logger.py:42] Received request cmpl-f38cfd18be834c4ca8634d2c4414b1d3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:06:45 [async_llm.py:261] Added request cmpl-f38cfd18be834c4ca8634d2c4414b1d3-0.
INFO 03-02 00:06:46 [logger.py:42] Received request cmpl-edd136c92446497b9fe7a31a325fae6e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:06:46 [async_llm.py:261] Added request cmpl-edd136c92446497b9fe7a31a325fae6e-0.
INFO 03-02 00:06:47 [logger.py:42] Received request cmpl-806ba1ae5e934225ac1eeb9d36324d0f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:06:47 [async_llm.py:261] Added request cmpl-806ba1ae5e934225ac1eeb9d36324d0f-0.
INFO 03-02 00:06:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.4 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.4%, Prefix cache hit rate: 51.6%
INFO 03-02 00:06:49 [logger.py:42] Received request cmpl-04fc897fe6a8450493ddcf6ac7a69435-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:06:49 [async_llm.py:261] Added request cmpl-04fc897fe6a8450493ddcf6ac7a69435-0.
INFO 03-02 00:06:50 [logger.py:42] Received request cmpl-7037d44de6a14461b916c5ba388c5f7f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:06:50 [async_llm.py:261] Added request cmpl-7037d44de6a14461b916c5ba388c5f7f-0.
INFO 03-02 00:06:51 [logger.py:42] Received request cmpl-c1da14aa7efe4063989a9523d294c572-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:06:51 [async_llm.py:261] Added request cmpl-c1da14aa7efe4063989a9523d294c572-0.
INFO 03-02 00:06:52 [logger.py:42] Received request cmpl-813d13e9480e4837b4232a3a22c8b155-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:06:52 [async_llm.py:261] Added request cmpl-813d13e9480e4837b4232a3a22c8b155-0.
INFO 03-02 00:06:53 [logger.py:42] Received request cmpl-5ed15161382745e1a18a163edd7c8de6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:06:53 [async_llm.py:261] Added request cmpl-5ed15161382745e1a18a163edd7c8de6-0.
INFO 03-02 00:06:54 [logger.py:42] Received request cmpl-c05d1c1e61e74edba1ff0429a810a91e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:06:54 [async_llm.py:261] Added request cmpl-c05d1c1e61e74edba1ff0429a810a91e-0.
INFO 03-02 00:06:56 [logger.py:42] Received request cmpl-2503dab98b404d4788c2ebe4f50387df-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:06:56 [async_llm.py:261] Added request cmpl-2503dab98b404d4788c2ebe4f50387df-0.
INFO 03-02 00:06:57 [logger.py:42] Received request cmpl-84ec11187b634810bfd60ec9edd4f313-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:06:57 [async_llm.py:261] Added request cmpl-84ec11187b634810bfd60ec9edd4f313-0.
INFO 03-02 00:06:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.1 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:06:58 [logger.py:42] Received request cmpl-92beec11f4a34efb8fb5bf08b03b6702-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:06:58 [async_llm.py:261] Added request cmpl-92beec11f4a34efb8fb5bf08b03b6702-0.
INFO 03-02 00:06:59 [logger.py:42] Received request cmpl-82a8c7ae41104cf483a587c9d8c07894-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:06:59 [async_llm.py:261] Added request cmpl-82a8c7ae41104cf483a587c9d8c07894-0.
INFO 03-02 00:07:00 [logger.py:42] Received request cmpl-111a575958ef4aaab338fab75f929f96-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:07:00 [async_llm.py:261] Added request cmpl-111a575958ef4aaab338fab75f929f96-0.
INFO 03-02 00:07:01 [logger.py:42] Received request cmpl-6ae7b5f23ff54197976bcdd3cb3784ad-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:07:01 [async_llm.py:261] Added request cmpl-6ae7b5f23ff54197976bcdd3cb3784ad-0.
INFO 03-02 00:07:02 [logger.py:42] Received request cmpl-1202c92c875c48fc9897b21c11b5c4a8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:07:02 [async_llm.py:261] Added request cmpl-1202c92c875c48fc9897b21c11b5c4a8-0.
INFO 03-02 00:07:04 [logger.py:42] Received request cmpl-b561e04200be49a6bb8b657b9de2c79e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:07:04 [async_llm.py:261] Added request cmpl-b561e04200be49a6bb8b657b9de2c79e-0.
INFO 03-02 00:07:05 [logger.py:42] Received request cmpl-95bc791b1a014e1e836675d43754ed5c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:07:05 [async_llm.py:261] Added request cmpl-95bc791b1a014e1e836675d43754ed5c-0.
INFO 03-02 00:07:06 [logger.py:42] Received request cmpl-fc8650b6f64d480cab85c4dcaca0fb11-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:07:06 [async_llm.py:261] Added request cmpl-fc8650b6f64d480cab85c4dcaca0fb11-0.
INFO 03-02 00:07:07 [logger.py:42] Received request cmpl-2f1804f405084219a7b7146b9d421db2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:07:07 [async_llm.py:261] Added request cmpl-2f1804f405084219a7b7146b9d421db2-0.
INFO 03-02 00:07:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:07:08 [logger.py:42] Received request cmpl-2fa93de330064582b2093499fa4f3c97-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:07:08 [async_llm.py:261] Added request cmpl-2fa93de330064582b2093499fa4f3c97-0.
INFO 03-02 00:07:09 [logger.py:42] Received request cmpl-39d39df55b6043068d2c9bcd47dc1d9f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:07:09 [async_llm.py:261] Added request cmpl-39d39df55b6043068d2c9bcd47dc1d9f-0.
INFO 03-02 00:07:10 [logger.py:42] Received request cmpl-875100901f4f4bc8a4e33f4fae7d746e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:07:10 [async_llm.py:261] Added request cmpl-875100901f4f4bc8a4e33f4fae7d746e-0.
INFO 03-02 00:07:12 [logger.py:42] Received request cmpl-82ce3e793c16461fae939dd61cbc826a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:07:12 [async_llm.py:261] Added request cmpl-82ce3e793c16461fae939dd61cbc826a-0.
INFO 03-02 00:07:13 [logger.py:42] Received request cmpl-34fc98c5be4c4a6f8b7aa8bb1de7f7d6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:07:13 [async_llm.py:261] Added request cmpl-34fc98c5be4c4a6f8b7aa8bb1de7f7d6-0.
INFO 03-02 00:07:14 [logger.py:42] Received request cmpl-788282a53ba2435fabe925c2ef1d3a43-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:07:14 [async_llm.py:261] Added request cmpl-788282a53ba2435fabe925c2ef1d3a43-0.
INFO 03-02 00:07:15 [logger.py:42] Received request cmpl-4378e21cf726472a98f2ddba635e736c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:07:15 [async_llm.py:261] Added request cmpl-4378e21cf726472a98f2ddba635e736c-0.
INFO 03-02 00:07:16 [logger.py:42] Received request cmpl-7fe7d826dfe34d238da20c66884ed087-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:07:16 [async_llm.py:261] Added request cmpl-7fe7d826dfe34d238da20c66884ed087-0.
INFO 03-02 00:07:17 [logger.py:42] Received request cmpl-26fac7ba1aae44848c65036cce2918b6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:07:17 [async_llm.py:261] Added request cmpl-26fac7ba1aae44848c65036cce2918b6-0.
INFO 03-02 00:07:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:07:19 [logger.py:42] Received request cmpl-fd3848c6c2c74e98987736cb5851c66a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:07:19 [async_llm.py:261] Added request cmpl-fd3848c6c2c74e98987736cb5851c66a-0.
INFO 03-02 00:07:20 [logger.py:42] Received request cmpl-8b445bc0fbe440ef9df26f48cac49a78-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:07:20 [async_llm.py:261] Added request cmpl-8b445bc0fbe440ef9df26f48cac49a78-0.
INFO 03-02 00:07:21 [logger.py:42] Received request cmpl-064fd1a0762f42de9b7ab1a6627c813b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:07:21 [async_llm.py:261] Added request cmpl-064fd1a0762f42de9b7ab1a6627c813b-0.
INFO 03-02 00:07:22 [logger.py:42] Received request cmpl-a26ec899e347403f9933590c601091e6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:07:22 [async_llm.py:261] Added request cmpl-a26ec899e347403f9933590c601091e6-0.
INFO 03-02 00:07:23 [logger.py:42] Received request cmpl-14bcc52f085a4a9e85f7864ff3d966d5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:07:23 [async_llm.py:261] Added request cmpl-14bcc52f085a4a9e85f7864ff3d966d5-0.
INFO 03-02 00:07:24 [logger.py:42] Received request cmpl-5c38e6facade4cb0a0395edcadf1499f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:07:24 [async_llm.py:261] Added request cmpl-5c38e6facade4cb0a0395edcadf1499f-0.
INFO 03-02 00:07:25 [logger.py:42] Received request cmpl-92d60d8c75e4461abbe28e695ac3b5ce-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:07:25 [async_llm.py:261] Added request cmpl-92d60d8c75e4461abbe28e695ac3b5ce-0.
INFO 03-02 00:07:27 [logger.py:42] Received request cmpl-42c912ee1a3646908452d966c70d3613-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:07:27 [async_llm.py:261] Added request cmpl-42c912ee1a3646908452d966c70d3613-0.
INFO 03-02 00:07:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:07:28 [logger.py:42] Received request cmpl-5070f650f8a943be89119b4eaf5c9f56-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:07:28 [async_llm.py:261] Added request cmpl-5070f650f8a943be89119b4eaf5c9f56-0.
INFO 03-02 00:07:29 [logger.py:42] Received request cmpl-89a670c8c15d473a819e92298e0033fb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:07:29 [async_llm.py:261] Added request cmpl-89a670c8c15d473a819e92298e0033fb-0.
INFO 03-02 00:07:30 [logger.py:42] Received request cmpl-e7e6856998964727b198baa79ff77ca5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:07:30 [async_llm.py:261] Added request cmpl-e7e6856998964727b198baa79ff77ca5-0.
INFO 03-02 00:07:31 [logger.py:42] Received request cmpl-c830f9dd8ea64aeba18c4d911778bfd0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:07:31 [async_llm.py:261] Added request cmpl-c830f9dd8ea64aeba18c4d911778bfd0-0.
INFO 03-02 00:07:32 [logger.py:42] Received request cmpl-5fbbad724d364bd8b1a840b2038c7c48-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:07:32 [async_llm.py:261] Added request cmpl-5fbbad724d364bd8b1a840b2038c7c48-0.
INFO 03-02 00:07:34 [logger.py:42] Received request cmpl-177961499855415c84936c4a7a021f73-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:07:34 [async_llm.py:261] Added request cmpl-177961499855415c84936c4a7a021f73-0.
INFO 03-02 00:07:35 [logger.py:42] Received request cmpl-f5bfa43b6011421e8d3e2e8dea13d6e9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:07:35 [async_llm.py:261] Added request cmpl-f5bfa43b6011421e8d3e2e8dea13d6e9-0.
INFO 03-02 00:07:36 [logger.py:42] Received request cmpl-5d6184402c57495a8d070d1a1cd43527-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:07:36 [async_llm.py:261] Added request cmpl-5d6184402c57495a8d070d1a1cd43527-0.
INFO 03-02 00:07:37 [logger.py:42] Received request cmpl-0d86fc6af20c42628758fe41e9f74270-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:07:37 [async_llm.py:261] Added request cmpl-0d86fc6af20c42628758fe41e9f74270-0.
INFO 03-02 00:07:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:07:38 [logger.py:42] Received request cmpl-2894586f1c3b42698ad8b83cb3046a45-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:07:38 [async_llm.py:261] Added request cmpl-2894586f1c3b42698ad8b83cb3046a45-0.
INFO 03-02 00:07:39 [logger.py:42] Received request cmpl-79306ccb42414f7ebba86dfc7ca14ba2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:07:39 [async_llm.py:261] Added request cmpl-79306ccb42414f7ebba86dfc7ca14ba2-0.
INFO 03-02 00:07:40 [logger.py:42] Received request cmpl-7bc2f5f902f64ffcac43c678c4cb17df-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:07:40 [async_llm.py:261] Added request cmpl-7bc2f5f902f64ffcac43c678c4cb17df-0.
INFO 03-02 00:07:42 [logger.py:42] Received request cmpl-b9019fa98c604817b1a1a4b2de0da2ef-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:07:42 [async_llm.py:261] Added request cmpl-b9019fa98c604817b1a1a4b2de0da2ef-0.
INFO 03-02 00:07:43 [logger.py:42] Received request cmpl-40a5c6c51f55442dad4f748d181b4e69-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:07:43 [async_llm.py:261] Added request cmpl-40a5c6c51f55442dad4f748d181b4e69-0.
INFO 03-02 00:07:44 [logger.py:42] Received request cmpl-eea3722279234db08239f592b74ac371-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:07:44 [async_llm.py:261] Added request cmpl-eea3722279234db08239f592b74ac371-0.
INFO 03-02 00:07:45 [logger.py:42] Received request cmpl-dc62efae20224f3eaec9ad11f9654ed9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:07:45 [async_llm.py:261] Added request cmpl-dc62efae20224f3eaec9ad11f9654ed9-0.
INFO 03-02 00:07:46 [logger.py:42] Received request cmpl-e4ba18e96ee249aea0a0298d0be39915-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:07:46 [async_llm.py:261] Added request cmpl-e4ba18e96ee249aea0a0298d0be39915-0.
INFO 03-02 00:07:47 [logger.py:42] Received request cmpl-ac141b3a74cf4d6abec6ab94303a17ae-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:07:47 [async_llm.py:261] Added request cmpl-ac141b3a74cf4d6abec6ab94303a17ae-0.
INFO 03-02 00:07:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:07:49 [logger.py:42] Received request cmpl-2ed16a244beb4983b7c726adeb3d1950-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:07:49 [async_llm.py:261] Added request cmpl-2ed16a244beb4983b7c726adeb3d1950-0.
INFO 03-02 00:07:50 [logger.py:42] Received request cmpl-6c975e5a9e0d4ce496b6bd6fef0f417b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:07:50 [async_llm.py:261] Added request cmpl-6c975e5a9e0d4ce496b6bd6fef0f417b-0.
INFO 03-02 00:07:51 [logger.py:42] Received request cmpl-fb3943d92e474917944510668accf64b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:07:51 [async_llm.py:261] Added request cmpl-fb3943d92e474917944510668accf64b-0.
INFO 03-02 00:07:52 [logger.py:42] Received request cmpl-95bbb9ddc0bb44a38ee08f3ab9e64381-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:07:52 [async_llm.py:261] Added request cmpl-95bbb9ddc0bb44a38ee08f3ab9e64381-0.
INFO 03-02 00:07:53 [logger.py:42] Received request cmpl-b287851243ef4f01a1560edc2fd1d6ae-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:07:53 [async_llm.py:261] Added request cmpl-b287851243ef4f01a1560edc2fd1d6ae-0.
INFO 03-02 00:07:54 [logger.py:42] Received request cmpl-b7694d7b63fe4a9daf2494d6901fb6ef-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:07:54 [async_llm.py:261] Added request cmpl-b7694d7b63fe4a9daf2494d6901fb6ef-0.
INFO 03-02 00:07:55 [logger.py:42] Received request cmpl-cd0e6475dbfa435e87172a137cbc338a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:07:55 [async_llm.py:261] Added request cmpl-cd0e6475dbfa435e87172a137cbc338a-0.
INFO 03-02 00:07:57 [logger.py:42] Received request cmpl-1f00bc4f403c47c2b33ccfba87d01ad8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:07:57 [async_llm.py:261] Added request cmpl-1f00bc4f403c47c2b33ccfba87d01ad8-0.
INFO 03-02 00:07:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:07:58 [logger.py:42] Received request cmpl-c53c087078b34264958f274a10f102c0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:07:58 [async_llm.py:261] Added request cmpl-c53c087078b34264958f274a10f102c0-0.
INFO 03-02 00:07:59 [logger.py:42] Received request cmpl-b569286ef27f40a8b93f0537d18190d6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:07:59 [async_llm.py:261] Added request cmpl-b569286ef27f40a8b93f0537d18190d6-0.
INFO 03-02 00:08:00 [logger.py:42] Received request cmpl-a39e743298b94ad58d53aaae1473df12-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:08:00 [async_llm.py:261] Added request cmpl-a39e743298b94ad58d53aaae1473df12-0.
INFO 03-02 00:08:01 [logger.py:42] Received request cmpl-e8c075ff25ce43a28179d00f94db6a9d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:08:01 [async_llm.py:261] Added request cmpl-e8c075ff25ce43a28179d00f94db6a9d-0.
INFO 03-02 00:08:02 [logger.py:42] Received request cmpl-e24126ed75e745b0979b3971dc2bc75d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:08:02 [async_llm.py:261] Added request cmpl-e24126ed75e745b0979b3971dc2bc75d-0.
INFO 03-02 00:08:03 [logger.py:42] Received request cmpl-b44ab0c1c6ea46c694568c422d1523fb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:08:04 [async_llm.py:261] Added request cmpl-b44ab0c1c6ea46c694568c422d1523fb-0.
INFO 03-02 00:08:05 [logger.py:42] Received request cmpl-e40bd0d5336b4b46ad027e0f20533e0e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:08:05 [async_llm.py:261] Added request cmpl-e40bd0d5336b4b46ad027e0f20533e0e-0.
INFO 03-02 00:08:06 [logger.py:42] Received request cmpl-4e2334990fa0462ca0fe34fc4084d55f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:08:06 [async_llm.py:261] Added request cmpl-4e2334990fa0462ca0fe34fc4084d55f-0.
INFO 03-02 00:08:07 [logger.py:42] Received request cmpl-8d4a6f1669f04e4296251ad291abda38-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:08:07 [async_llm.py:261] Added request cmpl-8d4a6f1669f04e4296251ad291abda38-0.
INFO 03-02 00:08:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:08:08 [logger.py:42] Received request cmpl-6ec22528120a482f8c3ecc0ea6448893-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:08:08 [async_llm.py:261] Added request cmpl-6ec22528120a482f8c3ecc0ea6448893-0.
INFO 03-02 00:08:09 [logger.py:42] Received request cmpl-e67c41a50529422fa746cea6322b5eca-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:08:09 [async_llm.py:261] Added request cmpl-e67c41a50529422fa746cea6322b5eca-0.
INFO 03-02 00:08:10 [logger.py:42] Received request cmpl-8b48a71add6c4fe6be089f438c61fd71-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:08:10 [async_llm.py:261] Added request cmpl-8b48a71add6c4fe6be089f438c61fd71-0.
INFO 03-02 00:08:12 [logger.py:42] Received request cmpl-3cc951224d4748e1a375ce6d6bab4637-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:08:12 [async_llm.py:261] Added request cmpl-3cc951224d4748e1a375ce6d6bab4637-0.
INFO 03-02 00:08:13 [logger.py:42] Received request cmpl-3f34185dc27040d58ee1f38f2533dac2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:08:13 [async_llm.py:261] Added request cmpl-3f34185dc27040d58ee1f38f2533dac2-0.
INFO 03-02 00:08:14 [logger.py:42] Received request cmpl-7d7962318290485a89cb3f7c5395b350-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:08:14 [async_llm.py:261] Added request cmpl-7d7962318290485a89cb3f7c5395b350-0.
INFO 03-02 00:08:15 [logger.py:42] Received request cmpl-51e585bbeebb4e63b4b4ee5ba187920c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:08:15 [async_llm.py:261] Added request cmpl-51e585bbeebb4e63b4b4ee5ba187920c-0.
INFO 03-02 00:08:16 [logger.py:42] Received request cmpl-82068c0ae53342c0abc25f2728a966a9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:08:16 [async_llm.py:261] Added request cmpl-82068c0ae53342c0abc25f2728a966a9-0.
INFO 03-02 00:08:17 [logger.py:42] Received request cmpl-a822aea16c8b469db035c76aec277645-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:08:17 [async_llm.py:261] Added request cmpl-a822aea16c8b469db035c76aec277645-0.
INFO 03-02 00:08:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:08:18 [logger.py:42] Received request cmpl-1a91fe5b3c1a49b4ab50a317ffafd78a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:08:18 [async_llm.py:261] Added request cmpl-1a91fe5b3c1a49b4ab50a317ffafd78a-0.
INFO 03-02 00:08:20 [logger.py:42] Received request cmpl-28c2a55393a9496787e789b2902ea346-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:08:20 [async_llm.py:261] Added request cmpl-28c2a55393a9496787e789b2902ea346-0.
INFO 03-02 00:08:21 [logger.py:42] Received request cmpl-e21a1e365d9e496d9682cd749f40e121-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:08:21 [async_llm.py:261] Added request cmpl-e21a1e365d9e496d9682cd749f40e121-0.
INFO 03-02 00:08:22 [logger.py:42] Received request cmpl-c3139ec6bd754c7592731b4288d897f8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:08:22 [async_llm.py:261] Added request cmpl-c3139ec6bd754c7592731b4288d897f8-0.
INFO 03-02 00:08:23 [logger.py:42] Received request cmpl-ed93345d480e4a46aa285495f1f43778-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:08:23 [async_llm.py:261] Added request cmpl-ed93345d480e4a46aa285495f1f43778-0.
INFO 03-02 00:08:24 [logger.py:42] Received request cmpl-c7d5d44a1a704794a5a82f621ed84584-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:08:24 [async_llm.py:261] Added request cmpl-c7d5d44a1a704794a5a82f621ed84584-0.
INFO 03-02 00:08:25 [logger.py:42] Received request cmpl-7fefcb34830a424daad96852d4f71135-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:08:25 [async_llm.py:261] Added request cmpl-7fefcb34830a424daad96852d4f71135-0.
INFO 03-02 00:08:27 [logger.py:42] Received request cmpl-2594ec47c6914faea0e5d50cd1e31845-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:08:27 [async_llm.py:261] Added request cmpl-2594ec47c6914faea0e5d50cd1e31845-0.
INFO 03-02 00:08:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:08:28 [logger.py:42] Received request cmpl-202564bd3f76441e83dc129ad9f64cd1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:08:28 [async_llm.py:261] Added request cmpl-202564bd3f76441e83dc129ad9f64cd1-0.
INFO 03-02 00:08:29 [logger.py:42] Received request cmpl-1e39a57e3eb44467a6aea938eef71e17-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:08:29 [async_llm.py:261] Added request cmpl-1e39a57e3eb44467a6aea938eef71e17-0.
INFO 03-02 00:08:30 [logger.py:42] Received request cmpl-f376f809e02a4ab8a11cd4a5ca89681c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:08:30 [async_llm.py:261] Added request cmpl-f376f809e02a4ab8a11cd4a5ca89681c-0.
INFO 03-02 00:08:31 [logger.py:42] Received request cmpl-215988d2e03948f9aed16508e53a991a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:08:31 [async_llm.py:261] Added request cmpl-215988d2e03948f9aed16508e53a991a-0.
INFO 03-02 00:08:32 [logger.py:42] Received request cmpl-e7741d813bd74403876df3520febbdda-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:08:32 [async_llm.py:261] Added request cmpl-e7741d813bd74403876df3520febbdda-0.
INFO 03-02 00:08:33 [logger.py:42] Received request cmpl-3a5adecb0e614e4ba1d5aaab7a52dd7f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:08:33 [async_llm.py:261] Added request cmpl-3a5adecb0e614e4ba1d5aaab7a52dd7f-0.
INFO 03-02 00:08:35 [logger.py:42] Received request cmpl-a9464bcd281e4885b2267d53139271c0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:08:35 [async_llm.py:261] Added request cmpl-a9464bcd281e4885b2267d53139271c0-0.
INFO 03-02 00:08:36 [logger.py:42] Received request cmpl-f3ea7b10cbc0417389a62e583cfb2fec-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:08:36 [async_llm.py:261] Added request cmpl-f3ea7b10cbc0417389a62e583cfb2fec-0.
INFO 03-02 00:08:37 [logger.py:42] Received request cmpl-7137d4acc6ec4de1b39f1507e82cb83b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:08:37 [async_llm.py:261] Added request cmpl-7137d4acc6ec4de1b39f1507e82cb83b-0.
INFO 03-02 00:08:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:08:38 [logger.py:42] Received request cmpl-33d300ce770441a2a879eed75dd71c4f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:08:38 [async_llm.py:261] Added request cmpl-33d300ce770441a2a879eed75dd71c4f-0.
INFO 03-02 00:08:39 [logger.py:42] Received request cmpl-dd75ab6edd27444f976bb10163759b6b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:08:39 [async_llm.py:261] Added request cmpl-dd75ab6edd27444f976bb10163759b6b-0.
INFO 03-02 00:08:40 [logger.py:42] Received request cmpl-7c02ff1e95e244dd8527ff8b7d90ccf4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:08:40 [async_llm.py:261] Added request cmpl-7c02ff1e95e244dd8527ff8b7d90ccf4-0.
INFO 03-02 00:08:42 [logger.py:42] Received request cmpl-98a89d07c89543e6a9d595e164ffff49-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:08:42 [async_llm.py:261] Added request cmpl-98a89d07c89543e6a9d595e164ffff49-0.
INFO 03-02 00:08:43 [logger.py:42] Received request cmpl-556b385c5a87436084ce9ad4d3d7ea25-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:08:43 [async_llm.py:261] Added request cmpl-556b385c5a87436084ce9ad4d3d7ea25-0.
INFO 03-02 00:08:44 [logger.py:42] Received request cmpl-9fa4c17065754247a162a359a67b2046-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:08:44 [async_llm.py:261] Added request cmpl-9fa4c17065754247a162a359a67b2046-0.
INFO 03-02 00:08:45 [logger.py:42] Received request cmpl-6a520e2548e74a55a9fd8a6f0c9ee3f6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:08:45 [async_llm.py:261] Added request cmpl-6a520e2548e74a55a9fd8a6f0c9ee3f6-0.
INFO 03-02 00:08:46 [logger.py:42] Received request cmpl-5bf5ebd8fddc428cb1c4241cdb810a14-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:08:46 [async_llm.py:261] Added request cmpl-5bf5ebd8fddc428cb1c4241cdb810a14-0.
INFO 03-02 00:08:47 [logger.py:42] Received request cmpl-8aa37895df4748acaa3577bdcc92db74-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:08:47 [async_llm.py:261] Added request cmpl-8aa37895df4748acaa3577bdcc92db74-0.
INFO 03-02 00:08:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:08:48 [logger.py:42] Received request cmpl-9a1baf91edb24adfa41e9f7da12ccc35-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:08:48 [async_llm.py:261] Added request cmpl-9a1baf91edb24adfa41e9f7da12ccc35-0.
INFO 03-02 00:08:50 [logger.py:42] Received request cmpl-f831c6409e5d4d4187f655afdb5e30db-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:08:50 [async_llm.py:261] Added request cmpl-f831c6409e5d4d4187f655afdb5e30db-0.
INFO 03-02 00:08:51 [logger.py:42] Received request cmpl-cbf949195e6f41c392374dcf49671c90-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:08:51 [async_llm.py:261] Added request cmpl-cbf949195e6f41c392374dcf49671c90-0.
INFO 03-02 00:08:52 [logger.py:42] Received request cmpl-c8170d819dcb42f8af2f65139883b258-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:08:52 [async_llm.py:261] Added request cmpl-c8170d819dcb42f8af2f65139883b258-0.
INFO 03-02 00:08:53 [logger.py:42] Received request cmpl-9d367abedf784c87962e2359cf7d6a47-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:08:53 [async_llm.py:261] Added request cmpl-9d367abedf784c87962e2359cf7d6a47-0.
INFO 03-02 00:08:54 [logger.py:42] Received request cmpl-c97608dad32c4d6b96abd026ca439357-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:08:54 [async_llm.py:261] Added request cmpl-c97608dad32c4d6b96abd026ca439357-0.
INFO 03-02 00:08:55 [logger.py:42] Received request cmpl-a0fd57b36df34f93921cd81c210cca3c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:08:55 [async_llm.py:261] Added request cmpl-a0fd57b36df34f93921cd81c210cca3c-0.
INFO 03-02 00:08:57 [logger.py:42] Received request cmpl-f5fb7f2de58d4d5fab480288999b2ea9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:08:57 [async_llm.py:261] Added request cmpl-f5fb7f2de58d4d5fab480288999b2ea9-0.
INFO 03-02 00:08:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:08:58 [logger.py:42] Received request cmpl-8dd9c92d2a4d4508a9bcb05b5848ce32-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:08:58 [async_llm.py:261] Added request cmpl-8dd9c92d2a4d4508a9bcb05b5848ce32-0.
INFO 03-02 00:08:59 [logger.py:42] Received request cmpl-165dc3c992834cc1a7ef514f0d7cf144-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:08:59 [async_llm.py:261] Added request cmpl-165dc3c992834cc1a7ef514f0d7cf144-0.
INFO 03-02 00:09:00 [logger.py:42] Received request cmpl-5f56b9a2d4f54544b1765b7a2bbd01e7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:09:00 [async_llm.py:261] Added request cmpl-5f56b9a2d4f54544b1765b7a2bbd01e7-0.
INFO 03-02 00:09:01 [logger.py:42] Received request cmpl-8dc729685e4a403b9aa3ffd907fa6fc2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:09:01 [async_llm.py:261] Added request cmpl-8dc729685e4a403b9aa3ffd907fa6fc2-0.
INFO 03-02 00:09:02 [logger.py:42] Received request cmpl-c29471719be14fa79db0e0050ef2811d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:09:02 [async_llm.py:261] Added request cmpl-c29471719be14fa79db0e0050ef2811d-0.
INFO 03-02 00:09:03 [logger.py:42] Received request cmpl-8660e0e86bd2490d9dd4838df8e4645e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:09:03 [async_llm.py:261] Added request cmpl-8660e0e86bd2490d9dd4838df8e4645e-0.
INFO 03-02 00:09:05 [logger.py:42] Received request cmpl-a1a92ea81eab450c81cec861c72fb6df-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:09:05 [async_llm.py:261] Added request cmpl-a1a92ea81eab450c81cec861c72fb6df-0.
INFO 03-02 00:09:06 [logger.py:42] Received request cmpl-c083ef61f4004f8c9923134256708664-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:09:06 [async_llm.py:261] Added request cmpl-c083ef61f4004f8c9923134256708664-0.
INFO 03-02 00:09:07 [logger.py:42] Received request cmpl-6ede03d8297149108444c5cbcbf6e0c6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:09:07 [async_llm.py:261] Added request cmpl-6ede03d8297149108444c5cbcbf6e0c6-0.
INFO 03-02 00:09:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:09:08 [logger.py:42] Received request cmpl-2fbbc28fda8147c29c806c60c6ac7d7c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:09:08 [async_llm.py:261] Added request cmpl-2fbbc28fda8147c29c806c60c6ac7d7c-0.
INFO 03-02 00:09:09 [logger.py:42] Received request cmpl-6fafe86efd09440ba67a52c59316ade9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:09:09 [async_llm.py:261] Added request cmpl-6fafe86efd09440ba67a52c59316ade9-0.
INFO 03-02 00:09:10 [logger.py:42] Received request cmpl-34989f50662749588909c31fcc717d42-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:09:10 [async_llm.py:261] Added request cmpl-34989f50662749588909c31fcc717d42-0.
INFO 03-02 00:09:12 [logger.py:42] Received request cmpl-0b716e4505f44ea6983bb0acb47e9bb4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:09:12 [async_llm.py:261] Added request cmpl-0b716e4505f44ea6983bb0acb47e9bb4-0.
INFO 03-02 00:09:13 [logger.py:42] Received request cmpl-8c435963630a42c1b089fd27e3febc92-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:09:13 [async_llm.py:261] Added request cmpl-8c435963630a42c1b089fd27e3febc92-0.
INFO 03-02 00:09:14 [logger.py:42] Received request cmpl-08e3822e585341e3b22195a70e82e630-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:09:14 [async_llm.py:261] Added request cmpl-08e3822e585341e3b22195a70e82e630-0.
INFO 03-02 00:09:15 [logger.py:42] Received request cmpl-ca554b766d134251a452262641b96e1f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:09:15 [async_llm.py:261] Added request cmpl-ca554b766d134251a452262641b96e1f-0.
INFO 03-02 00:09:16 [logger.py:42] Received request cmpl-058cb5a0630647858c495512222a30e4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:09:16 [async_llm.py:261] Added request cmpl-058cb5a0630647858c495512222a30e4-0.
INFO 03-02 00:09:17 [logger.py:42] Received request cmpl-26c49ce758df41feb8d00300d7259723-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:09:17 [async_llm.py:261] Added request cmpl-26c49ce758df41feb8d00300d7259723-0.
INFO 03-02 00:09:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:09:18 [logger.py:42] Received request cmpl-a10ae436666040b9a725ef985107ed7a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:09:18 [async_llm.py:261] Added request cmpl-a10ae436666040b9a725ef985107ed7a-0.
INFO 03-02 00:09:20 [logger.py:42] Received request cmpl-403341e6ee8942bbb986182b72380ad8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:09:20 [async_llm.py:261] Added request cmpl-403341e6ee8942bbb986182b72380ad8-0.
INFO 03-02 00:09:21 [logger.py:42] Received request cmpl-9030cb74e8c549568fd09b5a07f6daf4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:09:21 [async_llm.py:261] Added request cmpl-9030cb74e8c549568fd09b5a07f6daf4-0.
INFO 03-02 00:09:22 [logger.py:42] Received request cmpl-8de3b706565d4cb8905cce406ea3082a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:09:22 [async_llm.py:261] Added request cmpl-8de3b706565d4cb8905cce406ea3082a-0.
INFO 03-02 00:09:23 [logger.py:42] Received request cmpl-c2e6c9cc7ed74f0aa82ffc5d9601806d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:09:23 [async_llm.py:261] Added request cmpl-c2e6c9cc7ed74f0aa82ffc5d9601806d-0.
INFO 03-02 00:09:24 [logger.py:42] Received request cmpl-8105d9b3012b4ff2b46e691fe6476c01-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:09:24 [async_llm.py:261] Added request cmpl-8105d9b3012b4ff2b46e691fe6476c01-0.
INFO 03-02 00:09:25 [logger.py:42] Received request cmpl-0c1bc1002d3c493d82cef3f1c3be7cd5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:09:25 [async_llm.py:261] Added request cmpl-0c1bc1002d3c493d82cef3f1c3be7cd5-0.
INFO 03-02 00:09:26 [logger.py:42] Received request cmpl-7e0d37ccb92948de9714e4988489cabd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:09:26 [async_llm.py:261] Added request cmpl-7e0d37ccb92948de9714e4988489cabd-0.
INFO 03-02 00:09:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:09:28 [logger.py:42] Received request cmpl-ce90f7deee424cf8bb90e05e9659d9c6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:09:28 [async_llm.py:261] Added request cmpl-ce90f7deee424cf8bb90e05e9659d9c6-0.
INFO 03-02 00:09:29 [logger.py:42] Received request cmpl-a00cf3ec43a94885a422311241bd300c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:09:29 [async_llm.py:261] Added request cmpl-a00cf3ec43a94885a422311241bd300c-0.
INFO 03-02 00:09:30 [logger.py:42] Received request cmpl-49e754503ece4ef9a619356d5c136349-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:09:30 [async_llm.py:261] Added request cmpl-49e754503ece4ef9a619356d5c136349-0.
INFO 03-02 00:09:31 [logger.py:42] Received request cmpl-04428b364029407eae82ee52153d0288-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:09:31 [async_llm.py:261] Added request cmpl-04428b364029407eae82ee52153d0288-0.
INFO 03-02 00:09:32 [logger.py:42] Received request cmpl-5548e229763a4db78c7f8d637769f262-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:09:32 [async_llm.py:261] Added request cmpl-5548e229763a4db78c7f8d637769f262-0.
INFO 03-02 00:09:33 [logger.py:42] Received request cmpl-70a2a3aeefae4bc4b4cbeee7f97316fd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:09:33 [async_llm.py:261] Added request cmpl-70a2a3aeefae4bc4b4cbeee7f97316fd-0.
INFO 03-02 00:09:35 [logger.py:42] Received request cmpl-d84d025af60f458e9c4956e425f28ca0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:09:35 [async_llm.py:261] Added request cmpl-d84d025af60f458e9c4956e425f28ca0-0.
INFO 03-02 00:09:36 [logger.py:42] Received request cmpl-cc7f1aa85bf84005a0453654dd2e8c9e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:09:36 [async_llm.py:261] Added request cmpl-cc7f1aa85bf84005a0453654dd2e8c9e-0.
INFO 03-02 00:09:37 [logger.py:42] Received request cmpl-0c645c06c0da47dda44196b08c829510-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:09:37 [async_llm.py:261] Added request cmpl-0c645c06c0da47dda44196b08c829510-0.
INFO 03-02 00:09:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:09:38 [logger.py:42] Received request cmpl-53b5372f66664f1998269321463eee05-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:09:38 [async_llm.py:261] Added request cmpl-53b5372f66664f1998269321463eee05-0.
INFO 03-02 00:09:39 [logger.py:42] Received request cmpl-9ef1e9ca8894492da8d2f101c8ccd436-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:09:39 [async_llm.py:261] Added request cmpl-9ef1e9ca8894492da8d2f101c8ccd436-0.
INFO 03-02 00:09:40 [logger.py:42] Received request cmpl-16665839a6204b5b9c44a84798c3d7cb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:09:40 [async_llm.py:261] Added request cmpl-16665839a6204b5b9c44a84798c3d7cb-0.
INFO 03-02 00:09:41 [logger.py:42] Received request cmpl-7a9d7dbca8c74e50866a3c6e147479d4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:09:41 [async_llm.py:261] Added request cmpl-7a9d7dbca8c74e50866a3c6e147479d4-0.
INFO 03-02 00:09:43 [logger.py:42] Received request cmpl-760e61944b724840bc3047ad81125376-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:09:43 [async_llm.py:261] Added request cmpl-760e61944b724840bc3047ad81125376-0.
INFO 03-02 00:09:44 [logger.py:42] Received request cmpl-1319370688044db1b98dcc6fbdc94e7a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:09:44 [async_llm.py:261] Added request cmpl-1319370688044db1b98dcc6fbdc94e7a-0.
INFO 03-02 00:09:45 [logger.py:42] Received request cmpl-b55a2945bfe746618873173d8a1b7139-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:09:45 [async_llm.py:261] Added request cmpl-b55a2945bfe746618873173d8a1b7139-0.
INFO 03-02 00:09:46 [logger.py:42] Received request cmpl-0f2b7cb5ba2848af860c6b9029b1f056-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:09:46 [async_llm.py:261] Added request cmpl-0f2b7cb5ba2848af860c6b9029b1f056-0.
INFO 03-02 00:09:47 [logger.py:42] Received request cmpl-8913fb3cd2fb4288bd90f8f512a4be81-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:09:47 [async_llm.py:261] Added request cmpl-8913fb3cd2fb4288bd90f8f512a4be81-0.
INFO 03-02 00:09:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:09:48 [logger.py:42] Received request cmpl-d644c179e19042adb29bb1c0545a2448-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:09:48 [async_llm.py:261] Added request cmpl-d644c179e19042adb29bb1c0545a2448-0.
INFO 03-02 00:09:50 [logger.py:42] Received request cmpl-570b7182a7524f2e98376eea9f50049f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:09:50 [async_llm.py:261] Added request cmpl-570b7182a7524f2e98376eea9f50049f-0.
INFO 03-02 00:09:51 [logger.py:42] Received request cmpl-f6495763a7664bedbc901ab98bca1029-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:09:51 [async_llm.py:261] Added request cmpl-f6495763a7664bedbc901ab98bca1029-0.
INFO 03-02 00:09:52 [logger.py:42] Received request cmpl-a7da141b7ef64568b5e94bbe5144c77a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:09:52 [async_llm.py:261] Added request cmpl-a7da141b7ef64568b5e94bbe5144c77a-0.
INFO 03-02 00:09:53 [logger.py:42] Received request cmpl-90931540f8f54d8684caa25d93e4955e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:09:53 [async_llm.py:261] Added request cmpl-90931540f8f54d8684caa25d93e4955e-0.
INFO 03-02 00:09:54 [logger.py:42] Received request cmpl-f2152f6f94f845148da512f4cd5a9008-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:09:54 [async_llm.py:261] Added request cmpl-f2152f6f94f845148da512f4cd5a9008-0.
INFO 03-02 00:09:55 [logger.py:42] Received request cmpl-368e35708973468ab2107dc14e7715bc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:09:55 [async_llm.py:261] Added request cmpl-368e35708973468ab2107dc14e7715bc-0.
INFO 03-02 00:09:56 [logger.py:42] Received request cmpl-4745f666b7a342a0b9033af77e913804-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:09:56 [async_llm.py:261] Added request cmpl-4745f666b7a342a0b9033af77e913804-0.
INFO 03-02 00:09:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:09:58 [logger.py:42] Received request cmpl-2021532789314f319862e6891c1b619d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:09:58 [async_llm.py:261] Added request cmpl-2021532789314f319862e6891c1b619d-0.
INFO 03-02 00:09:59 [logger.py:42] Received request cmpl-4ad2664312464dc0b54891e7ab1df3fa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:09:59 [async_llm.py:261] Added request cmpl-4ad2664312464dc0b54891e7ab1df3fa-0.
INFO 03-02 00:10:00 [logger.py:42] Received request cmpl-741859d132144e7da8f256f21e627595-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:10:00 [async_llm.py:261] Added request cmpl-741859d132144e7da8f256f21e627595-0.
INFO 03-02 00:10:01 [logger.py:42] Received request cmpl-3fc706cbaff341658afa221c4619bf43-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:10:01 [async_llm.py:261] Added request cmpl-3fc706cbaff341658afa221c4619bf43-0.
INFO 03-02 00:10:02 [logger.py:42] Received request cmpl-441a910a8c16482599c1ab9898e77ee1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:10:02 [async_llm.py:261] Added request cmpl-441a910a8c16482599c1ab9898e77ee1-0.
INFO 03-02 00:10:03 [logger.py:42] Received request cmpl-847cab8811c74d249aad5435ee530368-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:10:03 [async_llm.py:261] Added request cmpl-847cab8811c74d249aad5435ee530368-0.
INFO 03-02 00:10:05 [logger.py:42] Received request cmpl-c60981cb04f74cd28cee65823e7bad67-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:10:05 [async_llm.py:261] Added request cmpl-c60981cb04f74cd28cee65823e7bad67-0.
INFO 03-02 00:10:06 [logger.py:42] Received request cmpl-717bd08c362b43d6bb6ab787ca55d868-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:10:06 [async_llm.py:261] Added request cmpl-717bd08c362b43d6bb6ab787ca55d868-0.
INFO 03-02 00:10:07 [logger.py:42] Received request cmpl-932ee3b8699b4137af3c5d8accf62823-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:10:07 [async_llm.py:261] Added request cmpl-932ee3b8699b4137af3c5d8accf62823-0.
INFO 03-02 00:10:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:10:08 [logger.py:42] Received request cmpl-18e82b96bb01488c9bea436ddc09fcba-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:10:08 [async_llm.py:261] Added request cmpl-18e82b96bb01488c9bea436ddc09fcba-0.
INFO 03-02 00:10:09 [logger.py:42] Received request cmpl-03ed5ddae32a491fbfaf4c82a8a3a7df-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:10:09 [async_llm.py:261] Added request cmpl-03ed5ddae32a491fbfaf4c82a8a3a7df-0.
INFO 03-02 00:10:10 [logger.py:42] Received request cmpl-80b81274cc834ee9b9ea91c213d04c86-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:10:10 [async_llm.py:261] Added request cmpl-80b81274cc834ee9b9ea91c213d04c86-0.
INFO 03-02 00:10:11 [logger.py:42] Received request cmpl-12fb114bcb0b46c78d4f74d630d8adf7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:10:11 [async_llm.py:261] Added request cmpl-12fb114bcb0b46c78d4f74d630d8adf7-0.
INFO 03-02 00:10:13 [logger.py:42] Received request cmpl-f4b8cc8b3f6049fe83583d924c6797a5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:10:13 [async_llm.py:261] Added request cmpl-f4b8cc8b3f6049fe83583d924c6797a5-0.
INFO 03-02 00:10:14 [logger.py:42] Received request cmpl-1b07c33dff414b19bdd5fa3f4492eae9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:10:14 [async_llm.py:261] Added request cmpl-1b07c33dff414b19bdd5fa3f4492eae9-0.
INFO 03-02 00:10:15 [logger.py:42] Received request cmpl-4dade655001c437f93ae4676c8113b88-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:10:15 [async_llm.py:261] Added request cmpl-4dade655001c437f93ae4676c8113b88-0.
INFO 03-02 00:10:16 [logger.py:42] Received request cmpl-20aafa7c4dde4448a9ed9ec0f46e39d2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:10:16 [async_llm.py:261] Added request cmpl-20aafa7c4dde4448a9ed9ec0f46e39d2-0.
INFO 03-02 00:10:17 [logger.py:42] Received request cmpl-22e288f63db04a009b7cf9c1cd18bc71-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:10:17 [async_llm.py:261] Added request cmpl-22e288f63db04a009b7cf9c1cd18bc71-0.
INFO 03-02 00:10:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:10:18 [logger.py:42] Received request cmpl-12d2e14802c54bf89b1a25e16163d524-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:10:18 [async_llm.py:261] Added request cmpl-12d2e14802c54bf89b1a25e16163d524-0.
INFO 03-02 00:10:20 [logger.py:42] Received request cmpl-b2a6936163f747f4a01fd7cd3bd7b3e7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:10:20 [async_llm.py:261] Added request cmpl-b2a6936163f747f4a01fd7cd3bd7b3e7-0.
INFO 03-02 00:10:21 [logger.py:42] Received request cmpl-aeb77c21ae8d461da68aa23fe49d02ed-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:10:21 [async_llm.py:261] Added request cmpl-aeb77c21ae8d461da68aa23fe49d02ed-0.
INFO 03-02 00:10:22 [logger.py:42] Received request cmpl-03ebb2807a5d4ef393ac610f5fc70655-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:10:22 [async_llm.py:261] Added request cmpl-03ebb2807a5d4ef393ac610f5fc70655-0.
INFO 03-02 00:10:23 [logger.py:42] Received request cmpl-98cd64c86cd44b24871096ed5c6791c4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:10:23 [async_llm.py:261] Added request cmpl-98cd64c86cd44b24871096ed5c6791c4-0.
INFO 03-02 00:10:24 [logger.py:42] Received request cmpl-71500c8326694b7a8871954ff429ee18-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:10:24 [async_llm.py:261] Added request cmpl-71500c8326694b7a8871954ff429ee18-0.
INFO 03-02 00:10:25 [logger.py:42] Received request cmpl-1089e979773b46388c0a295351a0611f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:10:25 [async_llm.py:261] Added request cmpl-1089e979773b46388c0a295351a0611f-0.
INFO 03-02 00:10:26 [logger.py:42] Received request cmpl-ebd717fc2ac74469b4acb51de4ca0c81-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:10:26 [async_llm.py:261] Added request cmpl-ebd717fc2ac74469b4acb51de4ca0c81-0.
INFO 03-02 00:10:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:10:28 [logger.py:42] Received request cmpl-ea22e969bad143c19390dec2acc870ba-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:10:28 [async_llm.py:261] Added request cmpl-ea22e969bad143c19390dec2acc870ba-0.
INFO 03-02 00:10:29 [logger.py:42] Received request cmpl-ec60344a5e684b3f8d968a3f75fbc849-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:10:29 [async_llm.py:261] Added request cmpl-ec60344a5e684b3f8d968a3f75fbc849-0.
INFO 03-02 00:10:30 [logger.py:42] Received request cmpl-554ee1394d964f85b565a02a6886dc7f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:10:30 [async_llm.py:261] Added request cmpl-554ee1394d964f85b565a02a6886dc7f-0.
INFO 03-02 00:10:31 [logger.py:42] Received request cmpl-db870707d9fa457598fcdaf5cc08c673-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:10:31 [async_llm.py:261] Added request cmpl-db870707d9fa457598fcdaf5cc08c673-0.
INFO 03-02 00:10:32 [logger.py:42] Received request cmpl-645b7ffcfea845b39a29f0e26425bb69-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:10:32 [async_llm.py:261] Added request cmpl-645b7ffcfea845b39a29f0e26425bb69-0.
INFO 03-02 00:10:33 [logger.py:42] Received request cmpl-0a249906f6f543c1b9cb64708d06032f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:10:33 [async_llm.py:261] Added request cmpl-0a249906f6f543c1b9cb64708d06032f-0.
INFO 03-02 00:10:35 [logger.py:42] Received request cmpl-973ede3d0fba42ed9cbf32cc15e474e6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:10:35 [async_llm.py:261] Added request cmpl-973ede3d0fba42ed9cbf32cc15e474e6-0.
INFO 03-02 00:10:36 [logger.py:42] Received request cmpl-7df3e03c39da43c1a7fb59cf2467ff9e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:10:36 [async_llm.py:261] Added request cmpl-7df3e03c39da43c1a7fb59cf2467ff9e-0.
INFO 03-02 00:10:37 [logger.py:42] Received request cmpl-9ba8bdd92e4e4014b988bea568d003f1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:10:37 [async_llm.py:261] Added request cmpl-9ba8bdd92e4e4014b988bea568d003f1-0.
INFO 03-02 00:10:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:10:38 [logger.py:42] Received request cmpl-d5d1868ded4b4c759f29065acd07030c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:10:38 [async_llm.py:261] Added request cmpl-d5d1868ded4b4c759f29065acd07030c-0.
INFO 03-02 00:10:39 [logger.py:42] Received request cmpl-f042583019e948158813d6d550b5bdee-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:10:39 [async_llm.py:261] Added request cmpl-f042583019e948158813d6d550b5bdee-0.
INFO 03-02 00:10:40 [logger.py:42] Received request cmpl-5d800be5025d491fa2a850aa70db26e1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:10:40 [async_llm.py:261] Added request cmpl-5d800be5025d491fa2a850aa70db26e1-0.
INFO 03-02 00:10:41 [logger.py:42] Received request cmpl-a2e73508ae4344fba2152f4a4fcac285-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:10:41 [async_llm.py:261] Added request cmpl-a2e73508ae4344fba2152f4a4fcac285-0.
INFO 03-02 00:10:43 [logger.py:42] Received request cmpl-f57d0157a7714d74a7424a85a3603369-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:10:43 [async_llm.py:261] Added request cmpl-f57d0157a7714d74a7424a85a3603369-0.
INFO 03-02 00:10:44 [logger.py:42] Received request cmpl-3518a13234424c59b2eebb62195d27d0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:10:44 [async_llm.py:261] Added request cmpl-3518a13234424c59b2eebb62195d27d0-0.
INFO 03-02 00:10:45 [logger.py:42] Received request cmpl-2c15e3e0d7f5496a9c9511c7259d4e8d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:10:45 [async_llm.py:261] Added request cmpl-2c15e3e0d7f5496a9c9511c7259d4e8d-0.
INFO 03-02 00:10:46 [logger.py:42] Received request cmpl-848200686d6b4d0c9b9da31190b83127-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:10:46 [async_llm.py:261] Added request cmpl-848200686d6b4d0c9b9da31190b83127-0.
INFO 03-02 00:10:47 [logger.py:42] Received request cmpl-0273ae3e18ee4dbfb032d717004e7863-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:10:47 [async_llm.py:261] Added request cmpl-0273ae3e18ee4dbfb032d717004e7863-0.
INFO 03-02 00:10:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:10:48 [logger.py:42] Received request cmpl-75b1e6a0e8d546bc803e604f4800a1ea-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:10:48 [async_llm.py:261] Added request cmpl-75b1e6a0e8d546bc803e604f4800a1ea-0.
INFO 03-02 00:10:50 [logger.py:42] Received request cmpl-4c9eb1ffd43844569eed11d934dbbccb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:10:50 [async_llm.py:261] Added request cmpl-4c9eb1ffd43844569eed11d934dbbccb-0.
INFO 03-02 00:10:51 [logger.py:42] Received request cmpl-5e226f4ff526401994c42619a5f286ac-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:10:51 [async_llm.py:261] Added request cmpl-5e226f4ff526401994c42619a5f286ac-0.
INFO 03-02 00:10:52 [logger.py:42] Received request cmpl-fff6a91b2ede4c43a0f2589539698b6c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:10:52 [async_llm.py:261] Added request cmpl-fff6a91b2ede4c43a0f2589539698b6c-0.
INFO 03-02 00:10:53 [logger.py:42] Received request cmpl-361010492779491cb089cad7fdc88567-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:10:53 [async_llm.py:261] Added request cmpl-361010492779491cb089cad7fdc88567-0.
INFO 03-02 00:10:54 [logger.py:42] Received request cmpl-3a18a9a774704356a45b3fe053be9fd4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:10:54 [async_llm.py:261] Added request cmpl-3a18a9a774704356a45b3fe053be9fd4-0.
INFO 03-02 00:10:55 [logger.py:42] Received request cmpl-5267925d0ef74277bdbb119837f6f8a8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:10:55 [async_llm.py:261] Added request cmpl-5267925d0ef74277bdbb119837f6f8a8-0.
INFO 03-02 00:10:56 [logger.py:42] Received request cmpl-46299be1e2dd4ef49694ec35b3922d1c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:10:56 [async_llm.py:261] Added request cmpl-46299be1e2dd4ef49694ec35b3922d1c-0.
INFO 03-02 00:10:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:10:58 [logger.py:42] Received request cmpl-3f1c9d0c2171437699182bcd3319c6ef-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:10:58 [async_llm.py:261] Added request cmpl-3f1c9d0c2171437699182bcd3319c6ef-0.
INFO 03-02 00:10:59 [logger.py:42] Received request cmpl-a105ed0ccae64130a9143432a16c6ffd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:10:59 [async_llm.py:261] Added request cmpl-a105ed0ccae64130a9143432a16c6ffd-0.
INFO 03-02 00:11:00 [logger.py:42] Received request cmpl-4c12364c09e54c7fbba46034fe7f153a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:11:00 [async_llm.py:261] Added request cmpl-4c12364c09e54c7fbba46034fe7f153a-0.
INFO 03-02 00:11:01 [logger.py:42] Received request cmpl-ff2f699c5abb49409c6c6e2b5913f588-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:11:01 [async_llm.py:261] Added request cmpl-ff2f699c5abb49409c6c6e2b5913f588-0.
INFO 03-02 00:11:02 [logger.py:42] Received request cmpl-5b1f9e04e81e4b8693cd8a7b9ef6a829-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:11:02 [async_llm.py:261] Added request cmpl-5b1f9e04e81e4b8693cd8a7b9ef6a829-0.
INFO 03-02 00:11:03 [logger.py:42] Received request cmpl-01527ab89b2b48318ec3c185dc012ea8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:11:03 [async_llm.py:261] Added request cmpl-01527ab89b2b48318ec3c185dc012ea8-0.
INFO 03-02 00:11:05 [logger.py:42] Received request cmpl-dfbeca9d218645bdaa914f0f3ed46fd6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:11:05 [async_llm.py:261] Added request cmpl-dfbeca9d218645bdaa914f0f3ed46fd6-0.
INFO 03-02 00:11:06 [logger.py:42] Received request cmpl-d735d2c9151549a59a7e720d37280b1f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:11:06 [async_llm.py:261] Added request cmpl-d735d2c9151549a59a7e720d37280b1f-0.
INFO 03-02 00:11:07 [logger.py:42] Received request cmpl-1d526508290a4737b06e757764e42fab-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:11:07 [async_llm.py:261] Added request cmpl-1d526508290a4737b06e757764e42fab-0.
INFO 03-02 00:11:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:11:08 [logger.py:42] Received request cmpl-45dce2e338324ba0adce22c821a2d272-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:11:08 [async_llm.py:261] Added request cmpl-45dce2e338324ba0adce22c821a2d272-0.
INFO 03-02 00:11:09 [logger.py:42] Received request cmpl-01ab3a6d4e3d482b88b7c32539c27106-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:11:09 [async_llm.py:261] Added request cmpl-01ab3a6d4e3d482b88b7c32539c27106-0.
INFO 03-02 00:11:10 [logger.py:42] Received request cmpl-82139bdcfb2f411790092fdb4edecc20-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:11:10 [async_llm.py:261] Added request cmpl-82139bdcfb2f411790092fdb4edecc20-0.
INFO 03-02 00:11:11 [logger.py:42] Received request cmpl-9722011379ac46ada8e9fa95008cb08c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:11:11 [async_llm.py:261] Added request cmpl-9722011379ac46ada8e9fa95008cb08c-0.
INFO 03-02 00:11:13 [logger.py:42] Received request cmpl-0c8c2b86825942569292e98a3bf4988f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:11:13 [async_llm.py:261] Added request cmpl-0c8c2b86825942569292e98a3bf4988f-0.
INFO 03-02 00:11:14 [logger.py:42] Received request cmpl-0d53669b58804c3899fa4e3fa8fe1081-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:11:14 [async_llm.py:261] Added request cmpl-0d53669b58804c3899fa4e3fa8fe1081-0.
INFO 03-02 00:11:15 [logger.py:42] Received request cmpl-8e22d982c5d3440fbaed11aa668bf704-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:11:15 [async_llm.py:261] Added request cmpl-8e22d982c5d3440fbaed11aa668bf704-0.
INFO 03-02 00:11:16 [logger.py:42] Received request cmpl-f081d1f573454126899b49c3334fc302-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:11:16 [async_llm.py:261] Added request cmpl-f081d1f573454126899b49c3334fc302-0.
INFO 03-02 00:11:17 [logger.py:42] Received request cmpl-c393b321b90c414081d30ad92a36d28c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:11:17 [async_llm.py:261] Added request cmpl-c393b321b90c414081d30ad92a36d28c-0.
INFO 03-02 00:11:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:11:18 [logger.py:42] Received request cmpl-d88cf56eaaf1471b82228b3a144aaee7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:11:18 [async_llm.py:261] Added request cmpl-d88cf56eaaf1471b82228b3a144aaee7-0.
INFO 03-02 00:11:20 [logger.py:42] Received request cmpl-75ee71f8dde34c558da767e3f213007a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:11:20 [async_llm.py:261] Added request cmpl-75ee71f8dde34c558da767e3f213007a-0.
INFO 03-02 00:11:21 [logger.py:42] Received request cmpl-0085aa010c074e008acc52f2dec87450-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:11:21 [async_llm.py:261] Added request cmpl-0085aa010c074e008acc52f2dec87450-0.
INFO 03-02 00:11:22 [logger.py:42] Received request cmpl-9d89915d217541d38344d0a8103cf5b4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:11:22 [async_llm.py:261] Added request cmpl-9d89915d217541d38344d0a8103cf5b4-0.
INFO 03-02 00:11:23 [logger.py:42] Received request cmpl-f1b96fe976434d55b7077545c17c3316-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:11:23 [async_llm.py:261] Added request cmpl-f1b96fe976434d55b7077545c17c3316-0.
INFO 03-02 00:11:24 [logger.py:42] Received request cmpl-26201a17087740639b3c4534527bf09e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:11:24 [async_llm.py:261] Added request cmpl-26201a17087740639b3c4534527bf09e-0.
INFO 03-02 00:11:25 [logger.py:42] Received request cmpl-85b9f204868c408a8c767bdcbde8ba98-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:11:25 [async_llm.py:261] Added request cmpl-85b9f204868c408a8c767bdcbde8ba98-0.
INFO 03-02 00:11:26 [logger.py:42] Received request cmpl-8f66f9780cab4c219508ab880b5ff382-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:11:26 [async_llm.py:261] Added request cmpl-8f66f9780cab4c219508ab880b5ff382-0.
INFO 03-02 00:11:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:11:28 [logger.py:42] Received request cmpl-c0c877c4771040499ccaa93e307b802e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:11:28 [async_llm.py:261] Added request cmpl-c0c877c4771040499ccaa93e307b802e-0.
INFO 03-02 00:11:29 [logger.py:42] Received request cmpl-eecacec2e698491fa6bbd65c9787e6cd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:11:29 [async_llm.py:261] Added request cmpl-eecacec2e698491fa6bbd65c9787e6cd-0.
INFO 03-02 00:11:30 [logger.py:42] Received request cmpl-cb46a13648374ce693a182094e6bdc6e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:11:30 [async_llm.py:261] Added request cmpl-cb46a13648374ce693a182094e6bdc6e-0.
INFO 03-02 00:11:31 [logger.py:42] Received request cmpl-153936c3c37245d79ac128238b773a54-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:11:31 [async_llm.py:261] Added request cmpl-153936c3c37245d79ac128238b773a54-0.
INFO 03-02 00:11:32 [logger.py:42] Received request cmpl-8803cc06122249cda224e5f9b60a5d7e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:11:32 [async_llm.py:261] Added request cmpl-8803cc06122249cda224e5f9b60a5d7e-0.
INFO 03-02 00:11:33 [logger.py:42] Received request cmpl-ce0dff35e7274b5ca028e62d307ed6d2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:11:33 [async_llm.py:261] Added request cmpl-ce0dff35e7274b5ca028e62d307ed6d2-0.
INFO 03-02 00:11:35 [logger.py:42] Received request cmpl-8dafe8df28114438a6791c646e570ccd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:11:35 [async_llm.py:261] Added request cmpl-8dafe8df28114438a6791c646e570ccd-0.
INFO 03-02 00:11:36 [logger.py:42] Received request cmpl-c2662c00d53c4033ac6355cbdb6cf056-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:11:36 [async_llm.py:261] Added request cmpl-c2662c00d53c4033ac6355cbdb6cf056-0.
INFO 03-02 00:11:37 [logger.py:42] Received request cmpl-4605710a6dd1457980d15266975702fe-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:11:37 [async_llm.py:261] Added request cmpl-4605710a6dd1457980d15266975702fe-0.
INFO 03-02 00:11:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:11:38 [logger.py:42] Received request cmpl-21a4cf17cd204e458ec5968b55ca3b30-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:11:38 [async_llm.py:261] Added request cmpl-21a4cf17cd204e458ec5968b55ca3b30-0.
INFO 03-02 00:11:39 [logger.py:42] Received request cmpl-53c49e56e2a2478b9c0dd0cd4bef561d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:11:39 [async_llm.py:261] Added request cmpl-53c49e56e2a2478b9c0dd0cd4bef561d-0.
INFO 03-02 00:11:40 [logger.py:42] Received request cmpl-771185d18a474dc0bb1dca127cca4afb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:11:40 [async_llm.py:261] Added request cmpl-771185d18a474dc0bb1dca127cca4afb-0.
INFO 03-02 00:11:41 [logger.py:42] Received request cmpl-e1f9e0cdc9cc4f07b0797dd310c97a32-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:11:41 [async_llm.py:261] Added request cmpl-e1f9e0cdc9cc4f07b0797dd310c97a32-0.
INFO 03-02 00:11:43 [logger.py:42] Received request cmpl-2b72b2a875de42c8939b4f59dea08604-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:11:43 [async_llm.py:261] Added request cmpl-2b72b2a875de42c8939b4f59dea08604-0.
INFO 03-02 00:11:44 [logger.py:42] Received request cmpl-6f6bf53d4e1349ebb570e74132d8d898-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:11:44 [async_llm.py:261] Added request cmpl-6f6bf53d4e1349ebb570e74132d8d898-0.
INFO 03-02 00:11:45 [logger.py:42] Received request cmpl-e009514594554e28b26c53bf67894033-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:11:45 [async_llm.py:261] Added request cmpl-e009514594554e28b26c53bf67894033-0.
INFO 03-02 00:11:46 [logger.py:42] Received request cmpl-c54ce6e6cccc4963928a7b3af936d9a6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:11:46 [async_llm.py:261] Added request cmpl-c54ce6e6cccc4963928a7b3af936d9a6-0.
INFO 03-02 00:11:47 [logger.py:42] Received request cmpl-25b1318937984a26ab9acb535991e6e2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:11:47 [async_llm.py:261] Added request cmpl-25b1318937984a26ab9acb535991e6e2-0.
INFO 03-02 00:11:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:11:48 [logger.py:42] Received request cmpl-f86a1e0d92e142248a2238fb3e9d989e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:11:48 [async_llm.py:261] Added request cmpl-f86a1e0d92e142248a2238fb3e9d989e-0.
INFO 03-02 00:11:49 [logger.py:42] Received request cmpl-f9e3d772273e4e438609e86f45fc858b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:11:49 [async_llm.py:261] Added request cmpl-f9e3d772273e4e438609e86f45fc858b-0.
INFO 03-02 00:11:51 [logger.py:42] Received request cmpl-4bfa97d6f3e344e0abde6cce0968afb9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:11:51 [async_llm.py:261] Added request cmpl-4bfa97d6f3e344e0abde6cce0968afb9-0.
INFO 03-02 00:11:52 [logger.py:42] Received request cmpl-43b2c2a729b84432b7d872ebe8b645f0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:11:52 [async_llm.py:261] Added request cmpl-43b2c2a729b84432b7d872ebe8b645f0-0.
INFO 03-02 00:11:53 [logger.py:42] Received request cmpl-f1cd213f7cdd4763b5c6338edfc49d82-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:11:53 [async_llm.py:261] Added request cmpl-f1cd213f7cdd4763b5c6338edfc49d82-0.
INFO 03-02 00:11:54 [logger.py:42] Received request cmpl-eed8d4e3adce443db991e3f07a351770-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:11:54 [async_llm.py:261] Added request cmpl-eed8d4e3adce443db991e3f07a351770-0.
INFO 03-02 00:11:55 [logger.py:42] Received request cmpl-b72bd436afad412b867cd385ca155137-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:11:55 [async_llm.py:261] Added request cmpl-b72bd436afad412b867cd385ca155137-0.
INFO 03-02 00:11:56 [logger.py:42] Received request cmpl-f4a6db65bd704b52a2ad22882b5c4739-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:11:56 [async_llm.py:261] Added request cmpl-f4a6db65bd704b52a2ad22882b5c4739-0.
INFO 03-02 00:11:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:11:58 [logger.py:42] Received request cmpl-bbc37aebf2ae42bf97d1bf2dbd8610db-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:11:58 [async_llm.py:261] Added request cmpl-bbc37aebf2ae42bf97d1bf2dbd8610db-0.
INFO 03-02 00:11:59 [logger.py:42] Received request cmpl-a944ca2bf442403193fe0e50cfb12f9c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:11:59 [async_llm.py:261] Added request cmpl-a944ca2bf442403193fe0e50cfb12f9c-0.
INFO 03-02 00:12:00 [logger.py:42] Received request cmpl-1e39ed099aa24554a57f5871f41d8980-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:12:00 [async_llm.py:261] Added request cmpl-1e39ed099aa24554a57f5871f41d8980-0.
INFO 03-02 00:12:01 [logger.py:42] Received request cmpl-16bc4c9668c2496bb9ee8bb77f06c6da-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:12:01 [async_llm.py:261] Added request cmpl-16bc4c9668c2496bb9ee8bb77f06c6da-0.
INFO 03-02 00:12:02 [logger.py:42] Received request cmpl-139da85708cf42749bd1eb6bf837afd7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:12:02 [async_llm.py:261] Added request cmpl-139da85708cf42749bd1eb6bf837afd7-0.
INFO 03-02 00:12:03 [logger.py:42] Received request cmpl-01c855bc2eb9491db814c2981e6084ed-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:12:03 [async_llm.py:261] Added request cmpl-01c855bc2eb9491db814c2981e6084ed-0.
INFO 03-02 00:12:04 [logger.py:42] Received request cmpl-74e3333290de45e49acc047495a8fb20-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:12:04 [async_llm.py:261] Added request cmpl-74e3333290de45e49acc047495a8fb20-0.
INFO 03-02 00:12:06 [logger.py:42] Received request cmpl-045af268dade44cbba684b27efb4f593-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:12:06 [async_llm.py:261] Added request cmpl-045af268dade44cbba684b27efb4f593-0.
INFO 03-02 00:12:07 [logger.py:42] Received request cmpl-46943b9952424f21b89f3133e886aeaa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:12:07 [async_llm.py:261] Added request cmpl-46943b9952424f21b89f3133e886aeaa-0.
INFO 03-02 00:12:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:12:08 [logger.py:42] Received request cmpl-f0c9d32d3bb34458b8ba6be02b235066-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:12:08 [async_llm.py:261] Added request cmpl-f0c9d32d3bb34458b8ba6be02b235066-0.
INFO 03-02 00:12:09 [logger.py:42] Received request cmpl-415dd46fb7644654ac04a953349262ff-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:12:09 [async_llm.py:261] Added request cmpl-415dd46fb7644654ac04a953349262ff-0.
INFO 03-02 00:12:10 [logger.py:42] Received request cmpl-cfa31bfbde0f4350a41ab9f026b95b0f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:12:10 [async_llm.py:261] Added request cmpl-cfa31bfbde0f4350a41ab9f026b95b0f-0.
INFO 03-02 00:12:11 [logger.py:42] Received request cmpl-a0efaab480e64a1f9a1ee7f903c61128-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:12:11 [async_llm.py:261] Added request cmpl-a0efaab480e64a1f9a1ee7f903c61128-0.
INFO 03-02 00:12:13 [logger.py:42] Received request cmpl-76d54d1c94fe43e596bbfa3e5931ab7c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:12:13 [async_llm.py:261] Added request cmpl-76d54d1c94fe43e596bbfa3e5931ab7c-0.
INFO 03-02 00:12:14 [logger.py:42] Received request cmpl-bfab47a30399421ba4430e932f04b47a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:12:14 [async_llm.py:261] Added request cmpl-bfab47a30399421ba4430e932f04b47a-0.
INFO 03-02 00:12:15 [logger.py:42] Received request cmpl-eefcf310f66348349c958b968c9961ae-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:12:15 [async_llm.py:261] Added request cmpl-eefcf310f66348349c958b968c9961ae-0.
INFO 03-02 00:12:16 [logger.py:42] Received request cmpl-583ce507e6c54905aae4f4616a78bfa9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:12:16 [async_llm.py:261] Added request cmpl-583ce507e6c54905aae4f4616a78bfa9-0.
INFO 03-02 00:12:17 [logger.py:42] Received request cmpl-a09f3b3e7aaf4d59b2950c1c6c57e3f0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:12:17 [async_llm.py:261] Added request cmpl-a09f3b3e7aaf4d59b2950c1c6c57e3f0-0.
INFO 03-02 00:12:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:12:18 [logger.py:42] Received request cmpl-c3bbc27580424de19b584a3f82d29d03-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:12:18 [async_llm.py:261] Added request cmpl-c3bbc27580424de19b584a3f82d29d03-0.
INFO 03-02 00:12:19 [logger.py:42] Received request cmpl-18def89cebe14aa2bed865db4447e634-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:12:19 [async_llm.py:261] Added request cmpl-18def89cebe14aa2bed865db4447e634-0.
INFO 03-02 00:12:21 [logger.py:42] Received request cmpl-0fca5775100c4b3ba8bc929b4a371b8d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:12:21 [async_llm.py:261] Added request cmpl-0fca5775100c4b3ba8bc929b4a371b8d-0.
INFO 03-02 00:12:22 [logger.py:42] Received request cmpl-7763ad7b189d44f68a94cfe877e56ad7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:12:22 [async_llm.py:261] Added request cmpl-7763ad7b189d44f68a94cfe877e56ad7-0.
INFO 03-02 00:12:23 [logger.py:42] Received request cmpl-f29902b91f864eeea44a2d4af957a360-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:12:23 [async_llm.py:261] Added request cmpl-f29902b91f864eeea44a2d4af957a360-0.
INFO 03-02 00:12:24 [logger.py:42] Received request cmpl-2e1969ca1e6c4462a4f4508e93c3cc5d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:12:24 [async_llm.py:261] Added request cmpl-2e1969ca1e6c4462a4f4508e93c3cc5d-0.
INFO 03-02 00:12:25 [logger.py:42] Received request cmpl-374f787e141e412c955d8ee75cd6e9f8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:12:25 [async_llm.py:261] Added request cmpl-374f787e141e412c955d8ee75cd6e9f8-0.
INFO 03-02 00:12:26 [logger.py:42] Received request cmpl-5e8cc747580a4221b9c0f338e5d84b33-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:12:26 [async_llm.py:261] Added request cmpl-5e8cc747580a4221b9c0f338e5d84b33-0.
INFO 03-02 00:12:28 [logger.py:42] Received request cmpl-ebdd8870bcd94bcd9cb474ae06361fdb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:12:28 [async_llm.py:261] Added request cmpl-ebdd8870bcd94bcd9cb474ae06361fdb-0.
INFO 03-02 00:12:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:12:29 [logger.py:42] Received request cmpl-dc27bac16d9247848956488d81da08c7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:12:29 [async_llm.py:261] Added request cmpl-dc27bac16d9247848956488d81da08c7-0.
INFO 03-02 00:12:30 [logger.py:42] Received request cmpl-f1d1dbdea4034af08a133bc5c10352e2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:12:30 [async_llm.py:261] Added request cmpl-f1d1dbdea4034af08a133bc5c10352e2-0.
INFO 03-02 00:12:31 [logger.py:42] Received request cmpl-530186b0360c47e8afb3daa48ae009a0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:12:31 [async_llm.py:261] Added request cmpl-530186b0360c47e8afb3daa48ae009a0-0.
INFO 03-02 00:12:32 [logger.py:42] Received request cmpl-20fea33b927147ca919a9e738508d720-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:12:32 [async_llm.py:261] Added request cmpl-20fea33b927147ca919a9e738508d720-0.
INFO 03-02 00:12:33 [logger.py:42] Received request cmpl-a4267dd85aa646b58b43fee1bbce029f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:12:33 [async_llm.py:261] Added request cmpl-a4267dd85aa646b58b43fee1bbce029f-0.
INFO 03-02 00:12:34 [logger.py:42] Received request cmpl-4a09d834c5a942e3ae42196343eed8d5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:12:34 [async_llm.py:261] Added request cmpl-4a09d834c5a942e3ae42196343eed8d5-0.
INFO 03-02 00:12:36 [logger.py:42] Received request cmpl-7013588d2db84a3c80fe37760dd98e6e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:12:36 [async_llm.py:261] Added request cmpl-7013588d2db84a3c80fe37760dd98e6e-0.
INFO 03-02 00:12:37 [logger.py:42] Received request cmpl-ca0ae4162a374676975e75be624a1915-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:12:37 [async_llm.py:261] Added request cmpl-ca0ae4162a374676975e75be624a1915-0.
INFO 03-02 00:12:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:12:38 [logger.py:42] Received request cmpl-08bbe0b6ed6e4e7ca25d1417db9d9a6c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:12:38 [async_llm.py:261] Added request cmpl-08bbe0b6ed6e4e7ca25d1417db9d9a6c-0.
INFO 03-02 00:12:39 [logger.py:42] Received request cmpl-6972ab7dba3b44bb8d3485a9038634c7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:12:39 [async_llm.py:261] Added request cmpl-6972ab7dba3b44bb8d3485a9038634c7-0.
INFO 03-02 00:12:40 [logger.py:42] Received request cmpl-ab962d301f70444497f261ed3c71f53c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:12:40 [async_llm.py:261] Added request cmpl-ab962d301f70444497f261ed3c71f53c-0.
INFO 03-02 00:12:41 [logger.py:42] Received request cmpl-5ba5abd63e8a4281bbd6749389062a78-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:12:41 [async_llm.py:261] Added request cmpl-5ba5abd63e8a4281bbd6749389062a78-0.
INFO 03-02 00:12:43 [logger.py:42] Received request cmpl-f411069f08ba441b90ab8938740a1096-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:12:43 [async_llm.py:261] Added request cmpl-f411069f08ba441b90ab8938740a1096-0.
INFO 03-02 00:12:44 [logger.py:42] Received request cmpl-0bd1dd80657f4e69bf1606ef337965f8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:12:44 [async_llm.py:261] Added request cmpl-0bd1dd80657f4e69bf1606ef337965f8-0.
INFO 03-02 00:12:45 [logger.py:42] Received request cmpl-8464eb4fd5b54d13b5036c337834bd01-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:12:45 [async_llm.py:261] Added request cmpl-8464eb4fd5b54d13b5036c337834bd01-0.
INFO 03-02 00:12:46 [logger.py:42] Received request cmpl-e9eada5ac7bc41508098451ce89c1eec-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:12:46 [async_llm.py:261] Added request cmpl-e9eada5ac7bc41508098451ce89c1eec-0.
INFO 03-02 00:12:47 [logger.py:42] Received request cmpl-086e15d445784fd683575b08c1c1c1d2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:12:47 [async_llm.py:261] Added request cmpl-086e15d445784fd683575b08c1c1c1d2-0.
INFO 03-02 00:12:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:12:48 [logger.py:42] Received request cmpl-87832ffea23e4f8e9707ba799fb0e2cc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:12:48 [async_llm.py:261] Added request cmpl-87832ffea23e4f8e9707ba799fb0e2cc-0.
INFO 03-02 00:12:49 [logger.py:42] Received request cmpl-25c8453dfb454ec88b8d1c9d4dd2b489-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:12:49 [async_llm.py:261] Added request cmpl-25c8453dfb454ec88b8d1c9d4dd2b489-0.
INFO 03-02 00:12:51 [logger.py:42] Received request cmpl-6178c5c7ca57465f9bf727788b4f7bdf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:12:51 [async_llm.py:261] Added request cmpl-6178c5c7ca57465f9bf727788b4f7bdf-0.
INFO 03-02 00:12:52 [logger.py:42] Received request cmpl-7b2c3b5b5a4340edb72c5aed5b270a20-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:12:52 [async_llm.py:261] Added request cmpl-7b2c3b5b5a4340edb72c5aed5b270a20-0.
INFO 03-02 00:12:53 [logger.py:42] Received request cmpl-0a054bc9af404565a95389536b419976-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:12:53 [async_llm.py:261] Added request cmpl-0a054bc9af404565a95389536b419976-0.
INFO 03-02 00:12:54 [logger.py:42] Received request cmpl-2bf8a4779ebc47e9907c81c9c07a45e7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:12:54 [async_llm.py:261] Added request cmpl-2bf8a4779ebc47e9907c81c9c07a45e7-0.
INFO 03-02 00:12:55 [logger.py:42] Received request cmpl-f1c57e0166a140b7b3f06688833fdf09-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:12:55 [async_llm.py:261] Added request cmpl-f1c57e0166a140b7b3f06688833fdf09-0.
INFO 03-02 00:12:56 [logger.py:42] Received request cmpl-fad8af4d49ba4657970339115e41e10f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:12:56 [async_llm.py:261] Added request cmpl-fad8af4d49ba4657970339115e41e10f-0.
INFO 03-02 00:12:58 [logger.py:42] Received request cmpl-c6c674a185ab425fa6ae93a5f8f90343-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:12:58 [async_llm.py:261] Added request cmpl-c6c674a185ab425fa6ae93a5f8f90343-0.
INFO 03-02 00:12:58 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.1 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.3%, Prefix cache hit rate: 51.6%
INFO 03-02 00:12:59 [logger.py:42] Received request cmpl-8cc0bc9aa48a4e7a96f66098e1efbe63-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:12:59 [async_llm.py:261] Added request cmpl-8cc0bc9aa48a4e7a96f66098e1efbe63-0.
INFO 03-02 00:13:00 [logger.py:42] Received request cmpl-874f1503e22b41c4b8ec2c25322dfc8b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:13:00 [async_llm.py:261] Added request cmpl-874f1503e22b41c4b8ec2c25322dfc8b-0.
INFO 03-02 00:13:01 [logger.py:42] Received request cmpl-ca17a707d6914388ae4a4724414d2da9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:13:01 [async_llm.py:261] Added request cmpl-ca17a707d6914388ae4a4724414d2da9-0.
INFO 03-02 00:13:02 [logger.py:42] Received request cmpl-c3ec97e56bea4d96b5e530b9f74d6999-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:13:02 [async_llm.py:261] Added request cmpl-c3ec97e56bea4d96b5e530b9f74d6999-0.
INFO 03-02 00:13:03 [logger.py:42] Received request cmpl-741f650538384ea2bf8d4c47ece42d35-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:13:03 [async_llm.py:261] Added request cmpl-741f650538384ea2bf8d4c47ece42d35-0.
INFO 03-02 00:13:04 [logger.py:42] Received request cmpl-5c6ba965cced4f9a98685b1c180d599e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:13:04 [async_llm.py:261] Added request cmpl-5c6ba965cced4f9a98685b1c180d599e-0.
INFO 03-02 00:13:06 [logger.py:42] Received request cmpl-0a627771f0b24046a82ff82e2d081055-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:13:06 [async_llm.py:261] Added request cmpl-0a627771f0b24046a82ff82e2d081055-0.
INFO 03-02 00:13:07 [logger.py:42] Received request cmpl-96b0790acf594532ad811dab691edb41-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:13:07 [async_llm.py:261] Added request cmpl-96b0790acf594532ad811dab691edb41-0.
INFO 03-02 00:13:08 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.4 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:13:08 [logger.py:42] Received request cmpl-17e6bd5ff15f4bf6ad3f28623d177f7b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:13:08 [async_llm.py:261] Added request cmpl-17e6bd5ff15f4bf6ad3f28623d177f7b-0.
INFO 03-02 00:13:09 [logger.py:42] Received request cmpl-636433aff9df4a48ada6e5ce012ebb6b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:13:09 [async_llm.py:261] Added request cmpl-636433aff9df4a48ada6e5ce012ebb6b-0.
INFO 03-02 00:13:10 [logger.py:42] Received request cmpl-6f2841a874ae40059be5a95611437c65-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:13:10 [async_llm.py:261] Added request cmpl-6f2841a874ae40059be5a95611437c65-0.
INFO 03-02 00:13:11 [logger.py:42] Received request cmpl-a2dd28f916bb470197159085060d712b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:13:11 [async_llm.py:261] Added request cmpl-a2dd28f916bb470197159085060d712b-0.
INFO 03-02 00:13:12 [logger.py:42] Received request cmpl-60f8aa369bca4331b1644effa5ef7a74-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:13:12 [async_llm.py:261] Added request cmpl-60f8aa369bca4331b1644effa5ef7a74-0.
INFO 03-02 00:13:14 [logger.py:42] Received request cmpl-9792465ea0bd4c5e97c46d4f60d4f17a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:13:14 [async_llm.py:261] Added request cmpl-9792465ea0bd4c5e97c46d4f60d4f17a-0.
INFO 03-02 00:13:15 [logger.py:42] Received request cmpl-ce48ae238a3c4375a9459794f9be14f5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:13:15 [async_llm.py:261] Added request cmpl-ce48ae238a3c4375a9459794f9be14f5-0.
INFO 03-02 00:13:16 [logger.py:42] Received request cmpl-7b7dd9552e264dcfa95d04d048684c8a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:13:16 [async_llm.py:261] Added request cmpl-7b7dd9552e264dcfa95d04d048684c8a-0.
INFO 03-02 00:13:17 [logger.py:42] Received request cmpl-e4e4a1305214410a84d0cad56c467ba7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:13:17 [async_llm.py:261] Added request cmpl-e4e4a1305214410a84d0cad56c467ba7-0.
INFO 03-02 00:13:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:13:18 [logger.py:42] Received request cmpl-34b85e42acc8440f9733e3103a61d6e3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:13:18 [async_llm.py:261] Added request cmpl-34b85e42acc8440f9733e3103a61d6e3-0.
INFO 03-02 00:13:19 [logger.py:42] Received request cmpl-a587b062175644589dd435523ad02c43-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:13:19 [async_llm.py:261] Added request cmpl-a587b062175644589dd435523ad02c43-0.
INFO 03-02 00:13:21 [logger.py:42] Received request cmpl-311748c38cbe451e9f19b05101057739-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:13:21 [async_llm.py:261] Added request cmpl-311748c38cbe451e9f19b05101057739-0.
INFO 03-02 00:13:22 [logger.py:42] Received request cmpl-4b9a4db3947b406d8e502c8d41c83c24-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:13:22 [async_llm.py:261] Added request cmpl-4b9a4db3947b406d8e502c8d41c83c24-0.
INFO 03-02 00:13:23 [logger.py:42] Received request cmpl-a55b2cd2420d454e90d95bed91459d8f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:13:23 [async_llm.py:261] Added request cmpl-a55b2cd2420d454e90d95bed91459d8f-0.
INFO 03-02 00:13:24 [logger.py:42] Received request cmpl-c2965d88a3584392a0882c487766bcbd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:13:24 [async_llm.py:261] Added request cmpl-c2965d88a3584392a0882c487766bcbd-0.
INFO 03-02 00:13:25 [logger.py:42] Received request cmpl-a62dbd0522e94e75996d8c39ae2c3be7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:13:25 [async_llm.py:261] Added request cmpl-a62dbd0522e94e75996d8c39ae2c3be7-0.
INFO 03-02 00:13:26 [logger.py:42] Received request cmpl-5192b0fcd4e54f2a89834289b2459221-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:13:26 [async_llm.py:261] Added request cmpl-5192b0fcd4e54f2a89834289b2459221-0.
INFO 03-02 00:13:27 [logger.py:42] Received request cmpl-73f4087f611a45798aefa6ff8ffa4bbc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:13:27 [async_llm.py:261] Added request cmpl-73f4087f611a45798aefa6ff8ffa4bbc-0.
INFO 03-02 00:13:28 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.2 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.3%, Prefix cache hit rate: 51.6%
INFO 03-02 00:13:29 [logger.py:42] Received request cmpl-09e342630c9a413897a79305db37676f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:13:29 [async_llm.py:261] Added request cmpl-09e342630c9a413897a79305db37676f-0.
INFO 03-02 00:13:30 [logger.py:42] Received request cmpl-a40e10057cd94dabb454186ef1efd63e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:13:30 [async_llm.py:261] Added request cmpl-a40e10057cd94dabb454186ef1efd63e-0.
INFO 03-02 00:13:31 [logger.py:42] Received request cmpl-75c1f4917ef04deba8d815ff0da8fb8f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:13:31 [async_llm.py:261] Added request cmpl-75c1f4917ef04deba8d815ff0da8fb8f-0.
INFO 03-02 00:13:32 [logger.py:42] Received request cmpl-d3132404d7244322af5f93909a6e1914-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:13:32 [async_llm.py:261] Added request cmpl-d3132404d7244322af5f93909a6e1914-0.
INFO 03-02 00:13:33 [logger.py:42] Received request cmpl-6e2d2ad2cccc46ecb59b9a5ade2db282-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:13:33 [async_llm.py:261] Added request cmpl-6e2d2ad2cccc46ecb59b9a5ade2db282-0.
INFO 03-02 00:13:34 [logger.py:42] Received request cmpl-22866217cc2e4e088f7a8200048d92a9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:13:34 [async_llm.py:261] Added request cmpl-22866217cc2e4e088f7a8200048d92a9-0.
INFO 03-02 00:13:36 [logger.py:42] Received request cmpl-6d944ce751df4247b797834de52115a7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:13:36 [async_llm.py:261] Added request cmpl-6d944ce751df4247b797834de52115a7-0.
INFO 03-02 00:13:37 [logger.py:42] Received request cmpl-8441b14293f642c9911aad8f34693440-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:13:37 [async_llm.py:261] Added request cmpl-8441b14293f642c9911aad8f34693440-0.
INFO 03-02 00:13:38 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.3 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:13:38 [logger.py:42] Received request cmpl-c38c9910b5b34e4bbce2bfeba761259d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:13:38 [async_llm.py:261] Added request cmpl-c38c9910b5b34e4bbce2bfeba761259d-0.
INFO 03-02 00:13:39 [logger.py:42] Received request cmpl-84c4a3e37b1e46f7b6c4da218cb52811-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:13:39 [async_llm.py:261] Added request cmpl-84c4a3e37b1e46f7b6c4da218cb52811-0.
INFO 03-02 00:13:40 [logger.py:42] Received request cmpl-f0a89bf2e8854c418d44fef46b9305f5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:13:40 [async_llm.py:261] Added request cmpl-f0a89bf2e8854c418d44fef46b9305f5-0.
INFO 03-02 00:13:41 [logger.py:42] Received request cmpl-b45bbc53d4bb422da2499a67497baf90-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:13:41 [async_llm.py:261] Added request cmpl-b45bbc53d4bb422da2499a67497baf90-0.
INFO 03-02 00:13:42 [logger.py:42] Received request cmpl-932a0aab8ad0481885c58f40b6624afb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:13:42 [async_llm.py:261] Added request cmpl-932a0aab8ad0481885c58f40b6624afb-0.
INFO 03-02 00:13:44 [logger.py:42] Received request cmpl-7fe6587f0dce4bebae4edd711a16c529-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:13:44 [async_llm.py:261] Added request cmpl-7fe6587f0dce4bebae4edd711a16c529-0.
INFO 03-02 00:13:45 [logger.py:42] Received request cmpl-36472e786aef4511b0fb077c51c9d5be-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:13:45 [async_llm.py:261] Added request cmpl-36472e786aef4511b0fb077c51c9d5be-0.
INFO 03-02 00:13:46 [logger.py:42] Received request cmpl-ed261f95478f49139e3070525d635579-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:13:46 [async_llm.py:261] Added request cmpl-ed261f95478f49139e3070525d635579-0.
INFO 03-02 00:13:47 [logger.py:42] Received request cmpl-8a2dc120d92d4e7e8d3b019d96c7548a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:13:47 [async_llm.py:261] Added request cmpl-8a2dc120d92d4e7e8d3b019d96c7548a-0.
INFO 03-02 00:13:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:13:48 [logger.py:42] Received request cmpl-844af2fcfeea4c0d86ba7dfdee0ec65e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:13:48 [async_llm.py:261] Added request cmpl-844af2fcfeea4c0d86ba7dfdee0ec65e-0.
INFO 03-02 00:13:49 [logger.py:42] Received request cmpl-f529f79828204314b48dc36f34f40181-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:13:49 [async_llm.py:261] Added request cmpl-f529f79828204314b48dc36f34f40181-0.
INFO 03-02 00:13:51 [logger.py:42] Received request cmpl-8e8b9e49015c4d66b0d80c9185015c3b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:13:51 [async_llm.py:261] Added request cmpl-8e8b9e49015c4d66b0d80c9185015c3b-0.
INFO 03-02 00:13:52 [logger.py:42] Received request cmpl-2204e8fb661740e1a714018d64cee042-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:13:52 [async_llm.py:261] Added request cmpl-2204e8fb661740e1a714018d64cee042-0.
INFO 03-02 00:13:53 [logger.py:42] Received request cmpl-4e43b6b783b042bcafd4605a445fa328-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:13:53 [async_llm.py:261] Added request cmpl-4e43b6b783b042bcafd4605a445fa328-0.
INFO 03-02 00:13:54 [logger.py:42] Received request cmpl-9edbc8e4e8fc41edb011792a9087eff8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:13:54 [async_llm.py:261] Added request cmpl-9edbc8e4e8fc41edb011792a9087eff8-0.
INFO 03-02 00:13:55 [logger.py:42] Received request cmpl-8cf5f022b69446878f80b766d6cf1b61-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:13:55 [async_llm.py:261] Added request cmpl-8cf5f022b69446878f80b766d6cf1b61-0.
INFO 03-02 00:13:56 [logger.py:42] Received request cmpl-fe4184da355c4445b07fec585b3edb2f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:13:56 [async_llm.py:261] Added request cmpl-fe4184da355c4445b07fec585b3edb2f-0.
INFO 03-02 00:13:57 [logger.py:42] Received request cmpl-cfeb8d50c93747c29873757931c687a6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:13:57 [async_llm.py:261] Added request cmpl-cfeb8d50c93747c29873757931c687a6-0.
INFO 03-02 00:13:58 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.3 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.4%, Prefix cache hit rate: 51.6%
INFO 03-02 00:13:59 [logger.py:42] Received request cmpl-5792c996752944eba67a05cb59692c8e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:13:59 [async_llm.py:261] Added request cmpl-5792c996752944eba67a05cb59692c8e-0.
INFO 03-02 00:14:00 [logger.py:42] Received request cmpl-b59ebeb097ec473e8d95e4559a3d96dd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:14:00 [async_llm.py:261] Added request cmpl-b59ebeb097ec473e8d95e4559a3d96dd-0.
INFO 03-02 00:14:01 [logger.py:42] Received request cmpl-991d9b6713374737837825bad39f63a5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:14:01 [async_llm.py:261] Added request cmpl-991d9b6713374737837825bad39f63a5-0.
INFO 03-02 00:14:02 [logger.py:42] Received request cmpl-de396739058b4286baf9357ee44f0767-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:14:02 [async_llm.py:261] Added request cmpl-de396739058b4286baf9357ee44f0767-0.
INFO 03-02 00:14:03 [logger.py:42] Received request cmpl-8ebb76fdd0d347b28424d692fe66f8c9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:14:03 [async_llm.py:261] Added request cmpl-8ebb76fdd0d347b28424d692fe66f8c9-0.
INFO 03-02 00:14:04 [logger.py:42] Received request cmpl-bd7ee61840e449748ead61ddea6efc61-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:14:04 [async_llm.py:261] Added request cmpl-bd7ee61840e449748ead61ddea6efc61-0.
INFO 03-02 00:14:06 [logger.py:42] Received request cmpl-d06797e30f754821b8d363cc9bd38e52-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:14:06 [async_llm.py:261] Added request cmpl-d06797e30f754821b8d363cc9bd38e52-0.
INFO 03-02 00:14:07 [logger.py:42] Received request cmpl-6212444699d04f5ca2ef50844fe400ee-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:14:07 [async_llm.py:261] Added request cmpl-6212444699d04f5ca2ef50844fe400ee-0.
INFO 03-02 00:14:08 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.2 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:14:08 [logger.py:42] Received request cmpl-2011d2c0725d430abb2f5a25ae2dd9fa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:14:08 [async_llm.py:261] Added request cmpl-2011d2c0725d430abb2f5a25ae2dd9fa-0.
INFO 03-02 00:14:09 [logger.py:42] Received request cmpl-23f1075faadc418886ee7b51ffa32211-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:14:09 [async_llm.py:261] Added request cmpl-23f1075faadc418886ee7b51ffa32211-0.
INFO 03-02 00:14:10 [logger.py:42] Received request cmpl-3ef86e53c7c04da9aaecf953c7d2d34d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:14:10 [async_llm.py:261] Added request cmpl-3ef86e53c7c04da9aaecf953c7d2d34d-0.
INFO 03-02 00:14:11 [logger.py:42] Received request cmpl-6fbe7e3c2c444fda9d0467f215da7c09-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:14:11 [async_llm.py:261] Added request cmpl-6fbe7e3c2c444fda9d0467f215da7c09-0.
INFO 03-02 00:14:12 [logger.py:42] Received request cmpl-050c00f49e3d437d88a3675041e91fcc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:14:12 [async_llm.py:261] Added request cmpl-050c00f49e3d437d88a3675041e91fcc-0.
INFO 03-02 00:14:14 [logger.py:42] Received request cmpl-f111491f9ced4f06bf700fe55cf35eb8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:14:14 [async_llm.py:261] Added request cmpl-f111491f9ced4f06bf700fe55cf35eb8-0.
INFO 03-02 00:14:15 [logger.py:42] Received request cmpl-6ecbae36530d4aa78f7e22143f14d40b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:14:15 [async_llm.py:261] Added request cmpl-6ecbae36530d4aa78f7e22143f14d40b-0.
INFO 03-02 00:14:16 [logger.py:42] Received request cmpl-6b0554973c5b4f31bac4561e7f03f4f4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:14:16 [async_llm.py:261] Added request cmpl-6b0554973c5b4f31bac4561e7f03f4f4-0.
INFO 03-02 00:14:17 [logger.py:42] Received request cmpl-194da37ded82424dbc379c67c63b059b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:14:17 [async_llm.py:261] Added request cmpl-194da37ded82424dbc379c67c63b059b-0.
INFO 03-02 00:14:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:14:18 [logger.py:42] Received request cmpl-b11a5c7fbf6649c5b2d9909bf9f8b8aa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:14:18 [async_llm.py:261] Added request cmpl-b11a5c7fbf6649c5b2d9909bf9f8b8aa-0.
INFO 03-02 00:14:19 [logger.py:42] Received request cmpl-77404b305dac41b2a32ea32bf261a7ae-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:14:19 [async_llm.py:261] Added request cmpl-77404b305dac41b2a32ea32bf261a7ae-0.
INFO 03-02 00:14:21 [logger.py:42] Received request cmpl-e7680880f43548e889326b8eb83544d2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:14:21 [async_llm.py:261] Added request cmpl-e7680880f43548e889326b8eb83544d2-0.
INFO 03-02 00:14:22 [logger.py:42] Received request cmpl-1764df9c0a454befadc17be26c4ca75b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:14:22 [async_llm.py:261] Added request cmpl-1764df9c0a454befadc17be26c4ca75b-0.
INFO 03-02 00:14:23 [logger.py:42] Received request cmpl-d19799461f104c808354bed0b87b8fba-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:14:23 [async_llm.py:261] Added request cmpl-d19799461f104c808354bed0b87b8fba-0.
INFO 03-02 00:14:24 [logger.py:42] Received request cmpl-68bd62ed223549b28c0cf8159106fd8c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:14:24 [async_llm.py:261] Added request cmpl-68bd62ed223549b28c0cf8159106fd8c-0.
INFO 03-02 00:14:25 [logger.py:42] Received request cmpl-aaec1f264f114e9d80a3ef0ad1d9dabb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:14:25 [async_llm.py:261] Added request cmpl-aaec1f264f114e9d80a3ef0ad1d9dabb-0.
INFO 03-02 00:14:26 [logger.py:42] Received request cmpl-79c88e7a9b60443393d898c4b01df085-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:14:26 [async_llm.py:261] Added request cmpl-79c88e7a9b60443393d898c4b01df085-0.
INFO 03-02 00:14:27 [logger.py:42] Received request cmpl-fe6f6f0cde564107ba6b093bfc18f16f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:14:27 [async_llm.py:261] Added request cmpl-fe6f6f0cde564107ba6b093bfc18f16f-0.
INFO 03-02 00:14:28 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.4 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.4%, Prefix cache hit rate: 51.6%
INFO 03-02 00:14:29 [logger.py:42] Received request cmpl-4ff7811fbdcc41e9b24b3e5005db5408-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:14:29 [async_llm.py:261] Added request cmpl-4ff7811fbdcc41e9b24b3e5005db5408-0.
INFO 03-02 00:14:30 [logger.py:42] Received request cmpl-04694409df194cf1968a63fdb7297799-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:14:30 [async_llm.py:261] Added request cmpl-04694409df194cf1968a63fdb7297799-0.
INFO 03-02 00:14:31 [logger.py:42] Received request cmpl-c359af96a58947498cbb0db1348c187d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:14:31 [async_llm.py:261] Added request cmpl-c359af96a58947498cbb0db1348c187d-0.
INFO 03-02 00:14:32 [logger.py:42] Received request cmpl-41a32c035dd54f6ba43603ac1bdc9314-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:14:32 [async_llm.py:261] Added request cmpl-41a32c035dd54f6ba43603ac1bdc9314-0.
INFO 03-02 00:14:33 [logger.py:42] Received request cmpl-8a8aab5b4b2149508082c147ecaec71a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:14:33 [async_llm.py:261] Added request cmpl-8a8aab5b4b2149508082c147ecaec71a-0.
INFO 03-02 00:14:34 [logger.py:42] Received request cmpl-dbdafb87f48a4d1e8523ec288ae7ee7b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:14:34 [async_llm.py:261] Added request cmpl-dbdafb87f48a4d1e8523ec288ae7ee7b-0.
INFO 03-02 00:14:35 [logger.py:42] Received request cmpl-cf510fc5bfe141a79ce589514cee6ae9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:14:35 [async_llm.py:261] Added request cmpl-cf510fc5bfe141a79ce589514cee6ae9-0.
INFO 03-02 00:14:37 [logger.py:42] Received request cmpl-1a5c838e91454460bebdf667657d12b2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:14:37 [async_llm.py:261] Added request cmpl-1a5c838e91454460bebdf667657d12b2-0.
INFO 03-02 00:14:38 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.1 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:14:38 [logger.py:42] Received request cmpl-f19a5b9627514c5f94a4e1988176166c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:14:38 [async_llm.py:261] Added request cmpl-f19a5b9627514c5f94a4e1988176166c-0.
INFO 03-02 00:14:39 [logger.py:42] Received request cmpl-071d1130c5224091a2077d1b0d476fe1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:14:39 [async_llm.py:261] Added request cmpl-071d1130c5224091a2077d1b0d476fe1-0.
INFO 03-02 00:14:40 [logger.py:42] Received request cmpl-14f71ddfb3ff425da9be50be5662b43e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:14:40 [async_llm.py:261] Added request cmpl-14f71ddfb3ff425da9be50be5662b43e-0.
INFO 03-02 00:14:41 [logger.py:42] Received request cmpl-faf831c1e90f4153b2aca9a88c0fd339-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:14:41 [async_llm.py:261] Added request cmpl-faf831c1e90f4153b2aca9a88c0fd339-0.
INFO 03-02 00:14:42 [logger.py:42] Received request cmpl-5936e3fcb60048e89490a3c0e285a415-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:14:42 [async_llm.py:261] Added request cmpl-5936e3fcb60048e89490a3c0e285a415-0.
INFO 03-02 00:14:44 [logger.py:42] Received request cmpl-a1b9829d8dd6406da30df4d70344977d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:14:44 [async_llm.py:261] Added request cmpl-a1b9829d8dd6406da30df4d70344977d-0.
INFO 03-02 00:14:45 [logger.py:42] Received request cmpl-9c9c0eb33313473783613dcf35ddbf8a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:14:45 [async_llm.py:261] Added request cmpl-9c9c0eb33313473783613dcf35ddbf8a-0.
INFO 03-02 00:14:46 [logger.py:42] Received request cmpl-2f0495165a7b49e7b48271f402304060-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:14:46 [async_llm.py:261] Added request cmpl-2f0495165a7b49e7b48271f402304060-0.
INFO 03-02 00:14:47 [logger.py:42] Received request cmpl-e635e5e32650406eb015c31b657e4e4e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:14:47 [async_llm.py:261] Added request cmpl-e635e5e32650406eb015c31b657e4e4e-0.
INFO 03-02 00:14:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:14:48 [logger.py:42] Received request cmpl-835be607de974357a1c7f84afa8d5ad1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:14:48 [async_llm.py:261] Added request cmpl-835be607de974357a1c7f84afa8d5ad1-0.
INFO 03-02 00:14:49 [logger.py:42] Received request cmpl-e75328d67a3347f085fead05714509f4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:14:49 [async_llm.py:261] Added request cmpl-e75328d67a3347f085fead05714509f4-0.
INFO 03-02 00:14:50 [logger.py:42] Received request cmpl-1fccd947dd6c4647a55f24c3b56609bd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:14:50 [async_llm.py:261] Added request cmpl-1fccd947dd6c4647a55f24c3b56609bd-0.
INFO 03-02 00:14:52 [logger.py:42] Received request cmpl-c5ab90016b0c4b579b803c67f773d97e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:14:52 [async_llm.py:261] Added request cmpl-c5ab90016b0c4b579b803c67f773d97e-0.
INFO 03-02 00:14:53 [logger.py:42] Received request cmpl-3741c612367b4a1d9743ec411a39483e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:14:53 [async_llm.py:261] Added request cmpl-3741c612367b4a1d9743ec411a39483e-0.
INFO 03-02 00:14:54 [logger.py:42] Received request cmpl-b65b7719b1334581ba33e1ae7fd0b2bd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:14:54 [async_llm.py:261] Added request cmpl-b65b7719b1334581ba33e1ae7fd0b2bd-0.
INFO 03-02 00:14:55 [logger.py:42] Received request cmpl-e140050a4786461c9603228f78ccb6c0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:14:55 [async_llm.py:261] Added request cmpl-e140050a4786461c9603228f78ccb6c0-0.
INFO 03-02 00:14:56 [logger.py:42] Received request cmpl-a3af3aa53f3d4b04ba5149a5d45942d3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:14:56 [async_llm.py:261] Added request cmpl-a3af3aa53f3d4b04ba5149a5d45942d3-0.
INFO 03-02 00:14:57 [logger.py:42] Received request cmpl-6f8aa3c59c1b4f3bae5bd8fef74e5f85-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:14:57 [async_llm.py:261] Added request cmpl-6f8aa3c59c1b4f3bae5bd8fef74e5f85-0.
INFO 03-02 00:14:58 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:14:59 [logger.py:42] Received request cmpl-71a4691280584e4baf19d2ad8f48f043-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:14:59 [async_llm.py:261] Added request cmpl-71a4691280584e4baf19d2ad8f48f043-0.
INFO 03-02 00:15:00 [logger.py:42] Received request cmpl-fdbece2178f84dc8a54a5577239e8375-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:15:00 [async_llm.py:261] Added request cmpl-fdbece2178f84dc8a54a5577239e8375-0.
INFO 03-02 00:15:01 [logger.py:42] Received request cmpl-6461a32479d84d2d90bccad62d423700-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:15:01 [async_llm.py:261] Added request cmpl-6461a32479d84d2d90bccad62d423700-0.
INFO 03-02 00:15:02 [logger.py:42] Received request cmpl-c8578713e1334e86afa5c1ed5f3b3b48-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:15:02 [async_llm.py:261] Added request cmpl-c8578713e1334e86afa5c1ed5f3b3b48-0.
INFO 03-02 00:15:03 [logger.py:42] Received request cmpl-a6379b6b788a4c6996cda1167f81a767-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:15:03 [async_llm.py:261] Added request cmpl-a6379b6b788a4c6996cda1167f81a767-0.
INFO 03-02 00:15:04 [logger.py:42] Received request cmpl-48d416569d6d4de0bd0a2aa924445ae4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:15:04 [async_llm.py:261] Added request cmpl-48d416569d6d4de0bd0a2aa924445ae4-0.
INFO 03-02 00:15:05 [logger.py:42] Received request cmpl-63ecc46654364971bc8a441c215ebcc7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:15:05 [async_llm.py:261] Added request cmpl-63ecc46654364971bc8a441c215ebcc7-0.
INFO 03-02 00:15:07 [logger.py:42] Received request cmpl-44543e809ca242e18eee85646a1268d2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:15:07 [async_llm.py:261] Added request cmpl-44543e809ca242e18eee85646a1268d2-0.
INFO 03-02 00:15:08 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:15:08 [logger.py:42] Received request cmpl-f3f16f325362458087732d1ce00b7364-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:15:08 [async_llm.py:261] Added request cmpl-f3f16f325362458087732d1ce00b7364-0.
INFO 03-02 00:15:09 [logger.py:42] Received request cmpl-1f7348f8c6db4e14b53afdcad16888cc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:15:09 [async_llm.py:261] Added request cmpl-1f7348f8c6db4e14b53afdcad16888cc-0.
INFO 03-02 00:15:10 [logger.py:42] Received request cmpl-9dbb7bbda454402aa4d3966abeebb56a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:15:10 [async_llm.py:261] Added request cmpl-9dbb7bbda454402aa4d3966abeebb56a-0.
INFO 03-02 00:15:11 [logger.py:42] Received request cmpl-4b63a83b1c5748b5aa9ed3c1d5d03dd6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:15:11 [async_llm.py:261] Added request cmpl-4b63a83b1c5748b5aa9ed3c1d5d03dd6-0.
INFO 03-02 00:15:12 [logger.py:42] Received request cmpl-5f6d739aa0a1463082e3af8ef03776a7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:15:12 [async_llm.py:261] Added request cmpl-5f6d739aa0a1463082e3af8ef03776a7-0.
INFO 03-02 00:15:14 [logger.py:42] Received request cmpl-6f91cb76cfd84bf89925fb979b5460e4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:15:14 [async_llm.py:261] Added request cmpl-6f91cb76cfd84bf89925fb979b5460e4-0.
INFO 03-02 00:15:15 [logger.py:42] Received request cmpl-a24d26d6b66e45c793ee547647244b52-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:15:15 [async_llm.py:261] Added request cmpl-a24d26d6b66e45c793ee547647244b52-0.
INFO 03-02 00:15:16 [logger.py:42] Received request cmpl-e741df79cd9d43eb9b31650afaf731bb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:15:16 [async_llm.py:261] Added request cmpl-e741df79cd9d43eb9b31650afaf731bb-0.
INFO 03-02 00:15:17 [logger.py:42] Received request cmpl-a9c6b42c06b046518b6da5a2b864e645-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:15:17 [async_llm.py:261] Added request cmpl-a9c6b42c06b046518b6da5a2b864e645-0.
INFO 03-02 00:15:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:15:18 [logger.py:42] Received request cmpl-1cd049a3fd864d4592599f27c6a41e03-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:15:18 [async_llm.py:261] Added request cmpl-1cd049a3fd864d4592599f27c6a41e03-0.
INFO 03-02 00:15:19 [logger.py:42] Received request cmpl-545e062e92fb488dac8fca90eebd5e2d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:15:19 [async_llm.py:261] Added request cmpl-545e062e92fb488dac8fca90eebd5e2d-0.
INFO 03-02 00:15:20 [logger.py:42] Received request cmpl-a2393d87a9504347864c84bb5774436b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:15:20 [async_llm.py:261] Added request cmpl-a2393d87a9504347864c84bb5774436b-0.
INFO 03-02 00:15:22 [logger.py:42] Received request cmpl-7f2ca73cbafd403792d30ba8415cdb21-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:15:22 [async_llm.py:261] Added request cmpl-7f2ca73cbafd403792d30ba8415cdb21-0.
INFO 03-02 00:15:23 [logger.py:42] Received request cmpl-c4128f76f17e4363b5cd3afcf92024d4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:15:23 [async_llm.py:261] Added request cmpl-c4128f76f17e4363b5cd3afcf92024d4-0.
INFO 03-02 00:15:24 [logger.py:42] Received request cmpl-1b3a107b70b64f3c8b4ac3bb61697fe3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:15:24 [async_llm.py:261] Added request cmpl-1b3a107b70b64f3c8b4ac3bb61697fe3-0.
INFO 03-02 00:15:25 [logger.py:42] Received request cmpl-f48fbeadf5c748129e5e1265dacfaade-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:15:25 [async_llm.py:261] Added request cmpl-f48fbeadf5c748129e5e1265dacfaade-0.
INFO 03-02 00:15:26 [logger.py:42] Received request cmpl-6e31adf02d9840d18821ae4cc3de5dde-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:15:26 [async_llm.py:261] Added request cmpl-6e31adf02d9840d18821ae4cc3de5dde-0.
INFO 03-02 00:15:27 [logger.py:42] Received request cmpl-81268c0250d9403c9a0ec19b82b165c6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:15:27 [async_llm.py:261] Added request cmpl-81268c0250d9403c9a0ec19b82b165c6-0.
INFO 03-02 00:15:28 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:15:29 [logger.py:42] Received request cmpl-65a9a5bebea5408382aa5f3c4d44b017-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:15:29 [async_llm.py:261] Added request cmpl-65a9a5bebea5408382aa5f3c4d44b017-0.
INFO 03-02 00:15:30 [logger.py:42] Received request cmpl-cd5e031014e94abf802480eb47120603-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:15:30 [async_llm.py:261] Added request cmpl-cd5e031014e94abf802480eb47120603-0.
INFO 03-02 00:15:31 [logger.py:42] Received request cmpl-821c990a19844db2af3c4f56b32324a7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:15:31 [async_llm.py:261] Added request cmpl-821c990a19844db2af3c4f56b32324a7-0.
INFO 03-02 00:15:32 [logger.py:42] Received request cmpl-bbbce159bfc9471d80a3fffd48a786cd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:15:32 [async_llm.py:261] Added request cmpl-bbbce159bfc9471d80a3fffd48a786cd-0.
INFO 03-02 00:15:33 [logger.py:42] Received request cmpl-956acb5327da4504b09b63fec1e9e28e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:15:33 [async_llm.py:261] Added request cmpl-956acb5327da4504b09b63fec1e9e28e-0.
INFO 03-02 00:15:34 [logger.py:42] Received request cmpl-b0afb04c486543e2916466c8fe05d402-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:15:34 [async_llm.py:261] Added request cmpl-b0afb04c486543e2916466c8fe05d402-0.
INFO 03-02 00:15:35 [logger.py:42] Received request cmpl-3ef54724e10047c49d250e746b2e09b5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:15:35 [async_llm.py:261] Added request cmpl-3ef54724e10047c49d250e746b2e09b5-0.
INFO 03-02 00:15:37 [logger.py:42] Received request cmpl-c72c48c9ec8d49ffa5fd01dc96a2119d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:15:37 [async_llm.py:261] Added request cmpl-c72c48c9ec8d49ffa5fd01dc96a2119d-0.
INFO 03-02 00:15:38 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:15:38 [logger.py:42] Received request cmpl-eb0b139830a44f20bbd5b5edaab8f437-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:15:38 [async_llm.py:261] Added request cmpl-eb0b139830a44f20bbd5b5edaab8f437-0.
INFO 03-02 00:15:39 [logger.py:42] Received request cmpl-d9d228f57fbf4a61b2bcfbe2271f35d3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:15:39 [async_llm.py:261] Added request cmpl-d9d228f57fbf4a61b2bcfbe2271f35d3-0.
INFO 03-02 00:15:40 [logger.py:42] Received request cmpl-9a632ec25bb44ecab83807d4b8725267-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:15:40 [async_llm.py:261] Added request cmpl-9a632ec25bb44ecab83807d4b8725267-0.
INFO 03-02 00:15:41 [logger.py:42] Received request cmpl-4d045c31b75741c491080e1741e42cb5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:15:41 [async_llm.py:261] Added request cmpl-4d045c31b75741c491080e1741e42cb5-0.
INFO 03-02 00:15:42 [logger.py:42] Received request cmpl-1f8bd5a01c0e4e218c04fea0fa93ffcc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:15:42 [async_llm.py:261] Added request cmpl-1f8bd5a01c0e4e218c04fea0fa93ffcc-0.
INFO 03-02 00:15:44 [logger.py:42] Received request cmpl-12618ec20cb14e51ba5ccd2466b59bc3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:15:44 [async_llm.py:261] Added request cmpl-12618ec20cb14e51ba5ccd2466b59bc3-0.
INFO 03-02 00:15:45 [logger.py:42] Received request cmpl-79a486b101e04fc09ffb62def71b476d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:15:45 [async_llm.py:261] Added request cmpl-79a486b101e04fc09ffb62def71b476d-0.
INFO 03-02 00:15:46 [logger.py:42] Received request cmpl-ed9d3d09bb2c49d6a03d7b88391dc3c7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:15:46 [async_llm.py:261] Added request cmpl-ed9d3d09bb2c49d6a03d7b88391dc3c7-0.
INFO 03-02 00:15:47 [logger.py:42] Received request cmpl-3f5ec93a87134d118043e218dc7ee9d0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:15:47 [async_llm.py:261] Added request cmpl-3f5ec93a87134d118043e218dc7ee9d0-0.
INFO 03-02 00:15:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:15:48 [logger.py:42] Received request cmpl-c345d38dc51c44e3a9c927d9844139d7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:15:48 [async_llm.py:261] Added request cmpl-c345d38dc51c44e3a9c927d9844139d7-0.
INFO 03-02 00:15:49 [logger.py:42] Received request cmpl-471ef74f14d8494a822fb27c937f4877-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:15:49 [async_llm.py:261] Added request cmpl-471ef74f14d8494a822fb27c937f4877-0.
INFO 03-02 00:15:50 [logger.py:42] Received request cmpl-984d7e9f62404b7cb13b42bbb1416e8e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:15:50 [async_llm.py:261] Added request cmpl-984d7e9f62404b7cb13b42bbb1416e8e-0.
INFO 03-02 00:15:52 [logger.py:42] Received request cmpl-62e56f3ee2104720a42eaac463220b25-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:15:52 [async_llm.py:261] Added request cmpl-62e56f3ee2104720a42eaac463220b25-0.
INFO 03-02 00:15:53 [logger.py:42] Received request cmpl-9dfd8dedbb3042f899c32ce233396883-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:15:53 [async_llm.py:261] Added request cmpl-9dfd8dedbb3042f899c32ce233396883-0.
INFO 03-02 00:15:54 [logger.py:42] Received request cmpl-d3c7271be6234ac8bf504de98bd4589a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:15:54 [async_llm.py:261] Added request cmpl-d3c7271be6234ac8bf504de98bd4589a-0.
INFO 03-02 00:15:55 [logger.py:42] Received request cmpl-5d765bdb37074c9f96c084624067ae6d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:15:55 [async_llm.py:261] Added request cmpl-5d765bdb37074c9f96c084624067ae6d-0.
INFO 03-02 00:15:56 [logger.py:42] Received request cmpl-b8bff56afa04451f982f92a5ffaaabed-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:15:56 [async_llm.py:261] Added request cmpl-b8bff56afa04451f982f92a5ffaaabed-0.
INFO 03-02 00:15:57 [logger.py:42] Received request cmpl-a9efac1914334af2b6aeb6cc2a38cdd7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:15:57 [async_llm.py:261] Added request cmpl-a9efac1914334af2b6aeb6cc2a38cdd7-0.
INFO 03-02 00:15:58 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:15:59 [logger.py:42] Received request cmpl-2864e2b2cab441b49ea40b42defdcc92-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:15:59 [async_llm.py:261] Added request cmpl-2864e2b2cab441b49ea40b42defdcc92-0.
INFO 03-02 00:16:00 [logger.py:42] Received request cmpl-194c2a3f6aa341ac90a9631ccaee40d3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:16:00 [async_llm.py:261] Added request cmpl-194c2a3f6aa341ac90a9631ccaee40d3-0.
INFO 03-02 00:16:01 [logger.py:42] Received request cmpl-4b89092f63c740c985e0a486ba36e0e2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:16:01 [async_llm.py:261] Added request cmpl-4b89092f63c740c985e0a486ba36e0e2-0.
INFO 03-02 00:16:02 [logger.py:42] Received request cmpl-85f40d7f135641ff9f3acea790ca03f8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:16:02 [async_llm.py:261] Added request cmpl-85f40d7f135641ff9f3acea790ca03f8-0.
INFO 03-02 00:16:03 [logger.py:42] Received request cmpl-29b77d85153e4fc0b6207549d7b8a903-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:16:03 [async_llm.py:261] Added request cmpl-29b77d85153e4fc0b6207549d7b8a903-0.
INFO 03-02 00:16:04 [logger.py:42] Received request cmpl-81f93267fe2b4be09b61bfd32f77b3f4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:16:04 [async_llm.py:261] Added request cmpl-81f93267fe2b4be09b61bfd32f77b3f4-0.
INFO 03-02 00:16:05 [logger.py:42] Received request cmpl-e297be46030247b8a42c885148950557-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:16:05 [async_llm.py:261] Added request cmpl-e297be46030247b8a42c885148950557-0.
INFO 03-02 00:16:07 [logger.py:42] Received request cmpl-a03ec11104a54cd6b18af54b9896913b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:16:07 [async_llm.py:261] Added request cmpl-a03ec11104a54cd6b18af54b9896913b-0.
INFO 03-02 00:16:08 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:16:08 [logger.py:42] Received request cmpl-4c9a42dca8c7430b9494b93782eb6d21-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:16:08 [async_llm.py:261] Added request cmpl-4c9a42dca8c7430b9494b93782eb6d21-0.
INFO 03-02 00:16:09 [logger.py:42] Received request cmpl-70ac032b27704edcba64542de6f1769e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:16:09 [async_llm.py:261] Added request cmpl-70ac032b27704edcba64542de6f1769e-0.
INFO 03-02 00:16:10 [logger.py:42] Received request cmpl-50761a66e1c34420bdb1cd826612f9ed-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:16:10 [async_llm.py:261] Added request cmpl-50761a66e1c34420bdb1cd826612f9ed-0.
INFO 03-02 00:16:11 [logger.py:42] Received request cmpl-828c83bd3bda4e059914fca064793641-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:16:11 [async_llm.py:261] Added request cmpl-828c83bd3bda4e059914fca064793641-0.
INFO 03-02 00:16:12 [logger.py:42] Received request cmpl-f0628d351bbe40a2976a273d0556b6ee-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:16:12 [async_llm.py:261] Added request cmpl-f0628d351bbe40a2976a273d0556b6ee-0.
INFO 03-02 00:16:13 [logger.py:42] Received request cmpl-27c82ed2846648d486960b23e6829b87-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:16:14 [async_llm.py:261] Added request cmpl-27c82ed2846648d486960b23e6829b87-0.
INFO 03-02 00:16:15 [logger.py:42] Received request cmpl-412acec8248c4e3bab4ac0ae52fc1acd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:16:15 [async_llm.py:261] Added request cmpl-412acec8248c4e3bab4ac0ae52fc1acd-0.
INFO 03-02 00:16:16 [logger.py:42] Received request cmpl-4c012a7a6f1f4b90a4ed2a893e0f3e07-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:16:16 [async_llm.py:261] Added request cmpl-4c012a7a6f1f4b90a4ed2a893e0f3e07-0.
INFO 03-02 00:16:17 [logger.py:42] Received request cmpl-51ea3ad894b6416bb7c7ffa307757fc9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:16:17 [async_llm.py:261] Added request cmpl-51ea3ad894b6416bb7c7ffa307757fc9-0.
INFO 03-02 00:16:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:16:18 [logger.py:42] Received request cmpl-4a7b0c2e77294125b6c3c687ed700f41-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:16:18 [async_llm.py:261] Added request cmpl-4a7b0c2e77294125b6c3c687ed700f41-0.
INFO 03-02 00:16:19 [logger.py:42] Received request cmpl-805e10a037104b00a9cf4d0103143516-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:16:19 [async_llm.py:261] Added request cmpl-805e10a037104b00a9cf4d0103143516-0.
INFO 03-02 00:16:20 [logger.py:42] Received request cmpl-ef979c98dcd043abb24799957f77438a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:16:20 [async_llm.py:261] Added request cmpl-ef979c98dcd043abb24799957f77438a-0.
INFO 03-02 00:16:22 [logger.py:42] Received request cmpl-8812a262270e4b818242c8aefe884b63-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:16:22 [async_llm.py:261] Added request cmpl-8812a262270e4b818242c8aefe884b63-0.
INFO 03-02 00:16:23 [logger.py:42] Received request cmpl-4eac032dd5f6452db2f8520a778ea1db-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:16:23 [async_llm.py:261] Added request cmpl-4eac032dd5f6452db2f8520a778ea1db-0.
INFO 03-02 00:16:24 [logger.py:42] Received request cmpl-fce48cee1e624bc7af356ffe9eb0e448-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:16:24 [async_llm.py:261] Added request cmpl-fce48cee1e624bc7af356ffe9eb0e448-0.
INFO 03-02 00:16:25 [logger.py:42] Received request cmpl-9230351f16754a48bdefdf87c817a987-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:16:25 [async_llm.py:261] Added request cmpl-9230351f16754a48bdefdf87c817a987-0.
INFO 03-02 00:16:26 [logger.py:42] Received request cmpl-d5717c0d14b74916aee26e9c0e403830-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:16:26 [async_llm.py:261] Added request cmpl-d5717c0d14b74916aee26e9c0e403830-0.
INFO 03-02 00:16:27 [logger.py:42] Received request cmpl-dfc5b7c7cd8a489fa8bec61ccefea4fc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:16:27 [async_llm.py:261] Added request cmpl-dfc5b7c7cd8a489fa8bec61ccefea4fc-0.
INFO 03-02 00:16:28 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:16:28 [logger.py:42] Received request cmpl-3829745b4af3415081004d3e442d6633-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:16:28 [async_llm.py:261] Added request cmpl-3829745b4af3415081004d3e442d6633-0.
INFO 03-02 00:16:30 [logger.py:42] Received request cmpl-177b1c4e2c424a988456eee849c623e0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:16:30 [async_llm.py:261] Added request cmpl-177b1c4e2c424a988456eee849c623e0-0.
INFO 03-02 00:16:31 [logger.py:42] Received request cmpl-763833674dc04a6f8cba14e5e4bb4161-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:16:31 [async_llm.py:261] Added request cmpl-763833674dc04a6f8cba14e5e4bb4161-0.
INFO 03-02 00:16:32 [logger.py:42] Received request cmpl-fd75802970784ff5abf430795585ef31-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:16:32 [async_llm.py:261] Added request cmpl-fd75802970784ff5abf430795585ef31-0.
INFO 03-02 00:16:33 [logger.py:42] Received request cmpl-f276a6ea4c714d089d73cd09cdaa1d77-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:16:33 [async_llm.py:261] Added request cmpl-f276a6ea4c714d089d73cd09cdaa1d77-0.
INFO 03-02 00:16:34 [logger.py:42] Received request cmpl-901fb0d6d9e8483ba92cf1dc618fc064-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:16:34 [async_llm.py:261] Added request cmpl-901fb0d6d9e8483ba92cf1dc618fc064-0.
INFO 03-02 00:16:35 [logger.py:42] Received request cmpl-1957f8deef77435f8e1b53e4a15e0a9b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:16:35 [async_llm.py:261] Added request cmpl-1957f8deef77435f8e1b53e4a15e0a9b-0.
INFO 03-02 00:16:37 [logger.py:42] Received request cmpl-39bbd7a4d69a44aba3aa87a34d7c767a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:16:37 [async_llm.py:261] Added request cmpl-39bbd7a4d69a44aba3aa87a34d7c767a-0.
INFO 03-02 00:16:38 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:16:38 [logger.py:42] Received request cmpl-e6911af660ca4f1393557b26e61ee167-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:16:38 [async_llm.py:261] Added request cmpl-e6911af660ca4f1393557b26e61ee167-0.
INFO 03-02 00:16:39 [logger.py:42] Received request cmpl-d996e120405a4d61aceadd9a5cf3c9b5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:16:39 [async_llm.py:261] Added request cmpl-d996e120405a4d61aceadd9a5cf3c9b5-0.
INFO 03-02 00:16:40 [logger.py:42] Received request cmpl-c79dc0dee99048738dddf5feaee92760-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:16:40 [async_llm.py:261] Added request cmpl-c79dc0dee99048738dddf5feaee92760-0.
INFO 03-02 00:16:41 [logger.py:42] Received request cmpl-30c5629848f6454694d940fcb243bd8e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:16:41 [async_llm.py:261] Added request cmpl-30c5629848f6454694d940fcb243bd8e-0.
INFO 03-02 00:16:42 [logger.py:42] Received request cmpl-0d261a4ed81e416f89fff5b63f914403-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:16:42 [async_llm.py:261] Added request cmpl-0d261a4ed81e416f89fff5b63f914403-0.
INFO 03-02 00:16:43 [logger.py:42] Received request cmpl-57ae2f7630a9460dbbd5bf2ae4506dd6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:16:43 [async_llm.py:261] Added request cmpl-57ae2f7630a9460dbbd5bf2ae4506dd6-0.
INFO 03-02 00:16:45 [logger.py:42] Received request cmpl-fb8d48256481405b830a76bd48d93bc1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:16:45 [async_llm.py:261] Added request cmpl-fb8d48256481405b830a76bd48d93bc1-0.
INFO 03-02 00:16:46 [logger.py:42] Received request cmpl-a72a76b5cf58432a8bf741355220b788-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:16:46 [async_llm.py:261] Added request cmpl-a72a76b5cf58432a8bf741355220b788-0.
INFO 03-02 00:16:47 [logger.py:42] Received request cmpl-591106cc67284c5f8640ec3a9b87784d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:16:47 [async_llm.py:261] Added request cmpl-591106cc67284c5f8640ec3a9b87784d-0.
INFO 03-02 00:16:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:16:48 [logger.py:42] Received request cmpl-3b8934430b7a4cc783d91009b1966f64-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:16:48 [async_llm.py:261] Added request cmpl-3b8934430b7a4cc783d91009b1966f64-0.
INFO 03-02 00:16:49 [logger.py:42] Received request cmpl-ccfbd551ad6b4b7b8226c077c0710326-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:16:49 [async_llm.py:261] Added request cmpl-ccfbd551ad6b4b7b8226c077c0710326-0.
INFO 03-02 00:16:50 [logger.py:42] Received request cmpl-65bda587d4f544fcad324a2d97497cb2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:16:50 [async_llm.py:261] Added request cmpl-65bda587d4f544fcad324a2d97497cb2-0.
INFO 03-02 00:16:52 [logger.py:42] Received request cmpl-5dd05afadff34b479815a457fb604d46-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:16:52 [async_llm.py:261] Added request cmpl-5dd05afadff34b479815a457fb604d46-0.
INFO 03-02 00:16:53 [logger.py:42] Received request cmpl-11ed6f9fb1e7447c97b20d01da096792-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:16:53 [async_llm.py:261] Added request cmpl-11ed6f9fb1e7447c97b20d01da096792-0.
INFO 03-02 00:16:54 [logger.py:42] Received request cmpl-8e6eaf818d3d45ffb9f2ae42bc157fe1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:16:54 [async_llm.py:261] Added request cmpl-8e6eaf818d3d45ffb9f2ae42bc157fe1-0.
INFO 03-02 00:16:55 [logger.py:42] Received request cmpl-4605cdaa2975464da70015f9e1718789-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:16:55 [async_llm.py:261] Added request cmpl-4605cdaa2975464da70015f9e1718789-0.
INFO 03-02 00:16:56 [logger.py:42] Received request cmpl-c23957c69d8e42af8ef2fc57e85031b3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:16:56 [async_llm.py:261] Added request cmpl-c23957c69d8e42af8ef2fc57e85031b3-0.
INFO 03-02 00:16:57 [logger.py:42] Received request cmpl-a7487c134a8b43d7a9adea3fc27d6c8e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:16:57 [async_llm.py:261] Added request cmpl-a7487c134a8b43d7a9adea3fc27d6c8e-0.
INFO 03-02 00:16:58 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:16:58 [logger.py:42] Received request cmpl-3e20a911bac44771a7a8e917e8e270d7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:16:58 [async_llm.py:261] Added request cmpl-3e20a911bac44771a7a8e917e8e270d7-0.
INFO 03-02 00:17:00 [logger.py:42] Received request cmpl-4e4caff66cb44400bcb354b1060060cc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:17:00 [async_llm.py:261] Added request cmpl-4e4caff66cb44400bcb354b1060060cc-0.
INFO 03-02 00:17:01 [logger.py:42] Received request cmpl-ed665b79b2c1450998995c32c713f933-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:17:01 [async_llm.py:261] Added request cmpl-ed665b79b2c1450998995c32c713f933-0.
INFO 03-02 00:17:02 [logger.py:42] Received request cmpl-59999d62565e409487f06bb13b18de79-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:17:02 [async_llm.py:261] Added request cmpl-59999d62565e409487f06bb13b18de79-0.
INFO 03-02 00:17:03 [logger.py:42] Received request cmpl-467d6474b9ca4937bb59fc34aea33f33-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:17:03 [async_llm.py:261] Added request cmpl-467d6474b9ca4937bb59fc34aea33f33-0.
INFO 03-02 00:17:04 [logger.py:42] Received request cmpl-9c9c8589a4ca47c2a8ed4715302b044b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:17:04 [async_llm.py:261] Added request cmpl-9c9c8589a4ca47c2a8ed4715302b044b-0.
INFO 03-02 00:17:05 [logger.py:42] Received request cmpl-bea6b7665663417a983db18907d906f2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:17:05 [async_llm.py:261] Added request cmpl-bea6b7665663417a983db18907d906f2-0.
INFO 03-02 00:17:07 [logger.py:42] Received request cmpl-bea170c81d2640af823efefb9b815f7e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:17:07 [async_llm.py:261] Added request cmpl-bea170c81d2640af823efefb9b815f7e-0.
INFO 03-02 00:17:08 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:17:08 [logger.py:42] Received request cmpl-f3bd0dda23204535b40409be9f31e472-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:17:08 [async_llm.py:261] Added request cmpl-f3bd0dda23204535b40409be9f31e472-0.
INFO 03-02 00:17:09 [logger.py:42] Received request cmpl-632a9aca7317498d81c409ef8e91f81d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:17:09 [async_llm.py:261] Added request cmpl-632a9aca7317498d81c409ef8e91f81d-0.
INFO 03-02 00:17:10 [logger.py:42] Received request cmpl-895648a0820844b5a7de964214518731-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:17:10 [async_llm.py:261] Added request cmpl-895648a0820844b5a7de964214518731-0.
INFO 03-02 00:17:11 [logger.py:42] Received request cmpl-ce2c9bcc15e3423b9ffe1e9ea9b470f4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:17:11 [async_llm.py:261] Added request cmpl-ce2c9bcc15e3423b9ffe1e9ea9b470f4-0.
INFO 03-02 00:17:12 [logger.py:42] Received request cmpl-1687c8f965444bc692c737fefbdef85c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:17:12 [async_llm.py:261] Added request cmpl-1687c8f965444bc692c737fefbdef85c-0.
INFO 03-02 00:17:13 [logger.py:42] Received request cmpl-30e6e6323a3c476c9d30bc2da8fb2000-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:17:13 [async_llm.py:261] Added request cmpl-30e6e6323a3c476c9d30bc2da8fb2000-0.
INFO 03-02 00:17:15 [logger.py:42] Received request cmpl-cb66aefea9644f8c83133998026c3e5a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:17:15 [async_llm.py:261] Added request cmpl-cb66aefea9644f8c83133998026c3e5a-0.
INFO 03-02 00:17:16 [logger.py:42] Received request cmpl-9ff9c6621bcb458186d3e4b5de47dacf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:17:16 [async_llm.py:261] Added request cmpl-9ff9c6621bcb458186d3e4b5de47dacf-0.
INFO 03-02 00:17:17 [logger.py:42] Received request cmpl-60751db05ffb4ab385d002d7336b7e0e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:17:17 [async_llm.py:261] Added request cmpl-60751db05ffb4ab385d002d7336b7e0e-0.
INFO 03-02 00:17:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:17:18 [logger.py:42] Received request cmpl-6806453f5f5e4fa0ae97b248067492aa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:17:18 [async_llm.py:261] Added request cmpl-6806453f5f5e4fa0ae97b248067492aa-0.
INFO 03-02 00:17:19 [logger.py:42] Received request cmpl-40756c4ef55f4ba5b9e3b0a711f3c486-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:17:19 [async_llm.py:261] Added request cmpl-40756c4ef55f4ba5b9e3b0a711f3c486-0.
INFO 03-02 00:17:20 [logger.py:42] Received request cmpl-dc51c83173894a6d81dd318fe14efb3c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:17:20 [async_llm.py:261] Added request cmpl-dc51c83173894a6d81dd318fe14efb3c-0.
INFO 03-02 00:17:22 [logger.py:42] Received request cmpl-4275c1b42b0241a5be2af10e807157d8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:17:22 [async_llm.py:261] Added request cmpl-4275c1b42b0241a5be2af10e807157d8-0.
INFO 03-02 00:17:23 [logger.py:42] Received request cmpl-ef644378f55640c49d2959293747a241-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:17:23 [async_llm.py:261] Added request cmpl-ef644378f55640c49d2959293747a241-0.
INFO 03-02 00:17:24 [logger.py:42] Received request cmpl-6278f8b00d45465b8240e096ee5666fb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:17:24 [async_llm.py:261] Added request cmpl-6278f8b00d45465b8240e096ee5666fb-0.
INFO 03-02 00:17:25 [logger.py:42] Received request cmpl-675db3e1641e4c56a61922a676e0db12-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:17:25 [async_llm.py:261] Added request cmpl-675db3e1641e4c56a61922a676e0db12-0.
INFO 03-02 00:17:26 [logger.py:42] Received request cmpl-7557ac4446be48439ad3ff6a02edbd3a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:17:26 [async_llm.py:261] Added request cmpl-7557ac4446be48439ad3ff6a02edbd3a-0.
INFO 03-02 00:17:27 [logger.py:42] Received request cmpl-3dea295b068845cba3cf99e9cc48b1a0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:17:27 [async_llm.py:261] Added request cmpl-3dea295b068845cba3cf99e9cc48b1a0-0.
INFO 03-02 00:17:28 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:17:28 [logger.py:42] Received request cmpl-b081d4a0ac6f43afb51010d584afa90a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:17:28 [async_llm.py:261] Added request cmpl-b081d4a0ac6f43afb51010d584afa90a-0.
INFO 03-02 00:17:30 [logger.py:42] Received request cmpl-75dc11e6d8e649849028e492475de530-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:17:30 [async_llm.py:261] Added request cmpl-75dc11e6d8e649849028e492475de530-0.
INFO 03-02 00:17:31 [logger.py:42] Received request cmpl-2e52edb2b29a48c98785bff6b265b8ce-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:17:31 [async_llm.py:261] Added request cmpl-2e52edb2b29a48c98785bff6b265b8ce-0.
INFO 03-02 00:17:32 [logger.py:42] Received request cmpl-6f54d16ef95b4b838c7ded5236553d86-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:17:32 [async_llm.py:261] Added request cmpl-6f54d16ef95b4b838c7ded5236553d86-0.
INFO 03-02 00:17:33 [logger.py:42] Received request cmpl-43441210da6f4ffa87f614a4322e2999-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:17:33 [async_llm.py:261] Added request cmpl-43441210da6f4ffa87f614a4322e2999-0.
INFO 03-02 00:17:34 [logger.py:42] Received request cmpl-10af07d425ac4f63bc777bbebb915357-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:17:34 [async_llm.py:261] Added request cmpl-10af07d425ac4f63bc777bbebb915357-0.
INFO 03-02 00:17:35 [logger.py:42] Received request cmpl-bfdd44bcf1b9424b942612938ce20e06-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:17:35 [async_llm.py:261] Added request cmpl-bfdd44bcf1b9424b942612938ce20e06-0.
INFO 03-02 00:17:37 [logger.py:42] Received request cmpl-f42397ba4eb442629ec9e35a619577c6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:17:37 [async_llm.py:261] Added request cmpl-f42397ba4eb442629ec9e35a619577c6-0.
INFO 03-02 00:17:38 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:17:38 [logger.py:42] Received request cmpl-e893e3d148bb44ebaed9bb122571dd9d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:17:38 [async_llm.py:261] Added request cmpl-e893e3d148bb44ebaed9bb122571dd9d-0.
INFO 03-02 00:17:39 [logger.py:42] Received request cmpl-76d64c87dc3d4e109c6321fb3e6d6e04-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:17:39 [async_llm.py:261] Added request cmpl-76d64c87dc3d4e109c6321fb3e6d6e04-0.
INFO 03-02 00:17:40 [logger.py:42] Received request cmpl-13371dca49af4a9ebcd5b4fab10f2453-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:17:40 [async_llm.py:261] Added request cmpl-13371dca49af4a9ebcd5b4fab10f2453-0.
INFO 03-02 00:17:41 [logger.py:42] Received request cmpl-47879606c128444a8cecb37390a93018-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:17:41 [async_llm.py:261] Added request cmpl-47879606c128444a8cecb37390a93018-0.
INFO 03-02 00:17:42 [logger.py:42] Received request cmpl-673735b550a347c48b255a404af2458a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:17:42 [async_llm.py:261] Added request cmpl-673735b550a347c48b255a404af2458a-0.
INFO 03-02 00:17:43 [logger.py:42] Received request cmpl-c2bf849e1fff4f1b81363b35f88529d5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:17:43 [async_llm.py:261] Added request cmpl-c2bf849e1fff4f1b81363b35f88529d5-0.
INFO 03-02 00:17:45 [logger.py:42] Received request cmpl-4c8aefc051464e389068e1ea710d096b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:17:45 [async_llm.py:261] Added request cmpl-4c8aefc051464e389068e1ea710d096b-0.
INFO 03-02 00:17:46 [logger.py:42] Received request cmpl-c51413228e8742ca855070335a5586f9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:17:46 [async_llm.py:261] Added request cmpl-c51413228e8742ca855070335a5586f9-0.
INFO 03-02 00:17:47 [logger.py:42] Received request cmpl-916484f067334fb68728b217caea2497-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:17:47 [async_llm.py:261] Added request cmpl-916484f067334fb68728b217caea2497-0.
INFO 03-02 00:17:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:17:48 [logger.py:42] Received request cmpl-7d516ec546ea4b62baf3c8e168e37dda-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:17:48 [async_llm.py:261] Added request cmpl-7d516ec546ea4b62baf3c8e168e37dda-0.
INFO 03-02 00:17:49 [logger.py:42] Received request cmpl-5512a66dedf349bfbf0d2c3c597ecf5c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:17:49 [async_llm.py:261] Added request cmpl-5512a66dedf349bfbf0d2c3c597ecf5c-0.
INFO 03-02 00:17:50 [logger.py:42] Received request cmpl-7dc07ad0e0f2459fa1eca1d073d83847-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:17:50 [async_llm.py:261] Added request cmpl-7dc07ad0e0f2459fa1eca1d073d83847-0.
INFO 03-02 00:17:52 [logger.py:42] Received request cmpl-4079e3bca4d045b4a8bbd317db7f9983-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:17:52 [async_llm.py:261] Added request cmpl-4079e3bca4d045b4a8bbd317db7f9983-0.
INFO 03-02 00:17:53 [logger.py:42] Received request cmpl-bff17dddf0c1463bbd14383fbbd14c54-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:17:53 [async_llm.py:261] Added request cmpl-bff17dddf0c1463bbd14383fbbd14c54-0.
INFO 03-02 00:17:54 [logger.py:42] Received request cmpl-2d8ec4ef67374a63b1ba4ce7fcb7d32e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:17:54 [async_llm.py:261] Added request cmpl-2d8ec4ef67374a63b1ba4ce7fcb7d32e-0.
INFO 03-02 00:17:55 [logger.py:42] Received request cmpl-e2f9e8f329a54c72bb7184e70d77d2f1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:17:55 [async_llm.py:261] Added request cmpl-e2f9e8f329a54c72bb7184e70d77d2f1-0.
INFO 03-02 00:17:56 [logger.py:42] Received request cmpl-1d88d86486004761878e23db22fbf97b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:17:56 [async_llm.py:261] Added request cmpl-1d88d86486004761878e23db22fbf97b-0.
INFO 03-02 00:17:57 [logger.py:42] Received request cmpl-30478c820ef242c4ac586b046f4d7d9a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:17:57 [async_llm.py:261] Added request cmpl-30478c820ef242c4ac586b046f4d7d9a-0.
INFO 03-02 00:17:58 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:17:58 [logger.py:42] Received request cmpl-ad2bdf03507a4a9ea4aac2d4c80bf625-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:17:58 [async_llm.py:261] Added request cmpl-ad2bdf03507a4a9ea4aac2d4c80bf625-0.
INFO 03-02 00:18:00 [logger.py:42] Received request cmpl-8c1ddbcdd853455593f0f0f3a92aaf1c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:18:00 [async_llm.py:261] Added request cmpl-8c1ddbcdd853455593f0f0f3a92aaf1c-0.
INFO 03-02 00:18:01 [logger.py:42] Received request cmpl-48822686795040a7b10b90d866fda545-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:18:01 [async_llm.py:261] Added request cmpl-48822686795040a7b10b90d866fda545-0.
INFO 03-02 00:18:02 [logger.py:42] Received request cmpl-02c3ad4627ab498cb481291c59ca8d1b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:18:02 [async_llm.py:261] Added request cmpl-02c3ad4627ab498cb481291c59ca8d1b-0.
INFO 03-02 00:18:03 [logger.py:42] Received request cmpl-3f98cf906d73495ab0bc08b0691f4687-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:18:03 [async_llm.py:261] Added request cmpl-3f98cf906d73495ab0bc08b0691f4687-0.
INFO 03-02 00:18:04 [logger.py:42] Received request cmpl-e812c429e0a54cfa8f32e76e2eb1c19a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:18:04 [async_llm.py:261] Added request cmpl-e812c429e0a54cfa8f32e76e2eb1c19a-0.
INFO 03-02 00:18:05 [logger.py:42] Received request cmpl-f624bdd9d1e74281a7c792c18479d7c6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:18:05 [async_llm.py:261] Added request cmpl-f624bdd9d1e74281a7c792c18479d7c6-0.
INFO 03-02 00:18:06 [logger.py:42] Received request cmpl-62d58e69852748289465e9571c5ea12f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:18:06 [async_llm.py:261] Added request cmpl-62d58e69852748289465e9571c5ea12f-0.
INFO 03-02 00:18:08 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:18:08 [logger.py:42] Received request cmpl-dd17773e9d76451d8ad8cf7e2a4c11a9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:18:08 [async_llm.py:261] Added request cmpl-dd17773e9d76451d8ad8cf7e2a4c11a9-0.
INFO 03-02 00:18:09 [logger.py:42] Received request cmpl-d9f8d43b0ce3492cb796f0d84db263ab-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:18:09 [async_llm.py:261] Added request cmpl-d9f8d43b0ce3492cb796f0d84db263ab-0.
INFO 03-02 00:18:10 [logger.py:42] Received request cmpl-bf090dc5b7a64aa2ba46227e5c48ec58-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:18:10 [async_llm.py:261] Added request cmpl-bf090dc5b7a64aa2ba46227e5c48ec58-0.
INFO 03-02 00:18:11 [logger.py:42] Received request cmpl-b20fb996dc7d4a94b0cac5b25a898538-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:18:11 [async_llm.py:261] Added request cmpl-b20fb996dc7d4a94b0cac5b25a898538-0.
INFO 03-02 00:18:12 [logger.py:42] Received request cmpl-e7e365e3008c40bda67af43abc655397-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:18:12 [async_llm.py:261] Added request cmpl-e7e365e3008c40bda67af43abc655397-0.
INFO 03-02 00:18:13 [logger.py:42] Received request cmpl-6c7fa47dab4e4ca393eeed32dc136183-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:18:13 [async_llm.py:261] Added request cmpl-6c7fa47dab4e4ca393eeed32dc136183-0.
INFO 03-02 00:18:15 [logger.py:42] Received request cmpl-c274845f06f043d28a41d99d2c0a6513-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:18:15 [async_llm.py:261] Added request cmpl-c274845f06f043d28a41d99d2c0a6513-0.
INFO 03-02 00:18:16 [logger.py:42] Received request cmpl-499a754a4d09426d90e92c2c08149df7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:18:16 [async_llm.py:261] Added request cmpl-499a754a4d09426d90e92c2c08149df7-0.
INFO 03-02 00:18:17 [logger.py:42] Received request cmpl-d68c2caefb054dd0a1ee38345f098246-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:18:17 [async_llm.py:261] Added request cmpl-d68c2caefb054dd0a1ee38345f098246-0.
INFO 03-02 00:18:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:18:18 [logger.py:42] Received request cmpl-f1608a63eb9241a5933e9637d16bfeea-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:18:18 [async_llm.py:261] Added request cmpl-f1608a63eb9241a5933e9637d16bfeea-0.
INFO 03-02 00:18:19 [logger.py:42] Received request cmpl-28832aae26f04605992aeda8d3c7eb55-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:18:19 [async_llm.py:261] Added request cmpl-28832aae26f04605992aeda8d3c7eb55-0.
INFO 03-02 00:18:20 [logger.py:42] Received request cmpl-facb5938d8804db284d7e606ef96b5ee-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:18:20 [async_llm.py:261] Added request cmpl-facb5938d8804db284d7e606ef96b5ee-0.
INFO 03-02 00:18:21 [logger.py:42] Received request cmpl-a99c490ec268401291693b580c83c720-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:18:21 [async_llm.py:261] Added request cmpl-a99c490ec268401291693b580c83c720-0.
INFO 03-02 00:18:23 [logger.py:42] Received request cmpl-cf9e131ff4514870bde7954b200da3ef-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:18:23 [async_llm.py:261] Added request cmpl-cf9e131ff4514870bde7954b200da3ef-0.
INFO 03-02 00:18:24 [logger.py:42] Received request cmpl-deeeef01b1fd4b4f82b0ce81058980e6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:18:24 [async_llm.py:261] Added request cmpl-deeeef01b1fd4b4f82b0ce81058980e6-0.
INFO 03-02 00:18:25 [logger.py:42] Received request cmpl-9afe37fd5d024214aafe9efe494d64c0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:18:25 [async_llm.py:261] Added request cmpl-9afe37fd5d024214aafe9efe494d64c0-0.
INFO 03-02 00:18:26 [logger.py:42] Received request cmpl-0029707fd1ef4e739aedb9f4ab255918-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:18:26 [async_llm.py:261] Added request cmpl-0029707fd1ef4e739aedb9f4ab255918-0.
INFO 03-02 00:18:27 [logger.py:42] Received request cmpl-106b5117b8da4683a8f04bce871b46ad-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:18:27 [async_llm.py:261] Added request cmpl-106b5117b8da4683a8f04bce871b46ad-0.
INFO 03-02 00:18:28 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:18:28 [logger.py:42] Received request cmpl-a311bb3ec782404a975f2eaae8dcebf8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:18:28 [async_llm.py:261] Added request cmpl-a311bb3ec782404a975f2eaae8dcebf8-0.
INFO 03-02 00:18:30 [logger.py:42] Received request cmpl-ef1f7d5564dd46eea81ca36081bdedcd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:18:30 [async_llm.py:261] Added request cmpl-ef1f7d5564dd46eea81ca36081bdedcd-0.
INFO 03-02 00:18:31 [logger.py:42] Received request cmpl-0c4e788443da4d20a1ee1b3e98b7e12f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:18:31 [async_llm.py:261] Added request cmpl-0c4e788443da4d20a1ee1b3e98b7e12f-0.
INFO 03-02 00:18:32 [logger.py:42] Received request cmpl-73fe0ebe88644b7fb62383fea4b1db70-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:18:32 [async_llm.py:261] Added request cmpl-73fe0ebe88644b7fb62383fea4b1db70-0.
INFO 03-02 00:18:33 [logger.py:42] Received request cmpl-a779c2714e4c48849acc1b751aefbff4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:18:33 [async_llm.py:261] Added request cmpl-a779c2714e4c48849acc1b751aefbff4-0.
INFO 03-02 00:18:34 [logger.py:42] Received request cmpl-5181e525221041e3a4d2fa2af2a1d879-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:18:34 [async_llm.py:261] Added request cmpl-5181e525221041e3a4d2fa2af2a1d879-0.
INFO 03-02 00:18:35 [logger.py:42] Received request cmpl-3c303118d9ff4a7eb7a3236732a63b76-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:18:35 [async_llm.py:261] Added request cmpl-3c303118d9ff4a7eb7a3236732a63b76-0.
INFO 03-02 00:18:36 [logger.py:42] Received request cmpl-74d093779b684bfc87186c8cddb2d125-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:18:36 [async_llm.py:261] Added request cmpl-74d093779b684bfc87186c8cddb2d125-0.
INFO 03-02 00:18:38 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:18:38 [logger.py:42] Received request cmpl-8aba35ae24c64f4c8a361449e0969d2b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:18:38 [async_llm.py:261] Added request cmpl-8aba35ae24c64f4c8a361449e0969d2b-0.
INFO 03-02 00:18:39 [logger.py:42] Received request cmpl-8930e915492d49f8abc71d7138a946f2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:18:39 [async_llm.py:261] Added request cmpl-8930e915492d49f8abc71d7138a946f2-0.
INFO 03-02 00:18:40 [logger.py:42] Received request cmpl-c8ad0218d3bf473ca4875d1c57e6f736-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:18:40 [async_llm.py:261] Added request cmpl-c8ad0218d3bf473ca4875d1c57e6f736-0.
INFO 03-02 00:18:41 [logger.py:42] Received request cmpl-1e1c73423b33438e9df411a9854f61f7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:18:41 [async_llm.py:261] Added request cmpl-1e1c73423b33438e9df411a9854f61f7-0.
INFO 03-02 00:18:42 [logger.py:42] Received request cmpl-7188bbef8a8a4bc0af423bee66928ef6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:18:42 [async_llm.py:261] Added request cmpl-7188bbef8a8a4bc0af423bee66928ef6-0.
INFO 03-02 00:18:43 [logger.py:42] Received request cmpl-077b684404e14b4d8ea1967f4afd8c71-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:18:43 [async_llm.py:261] Added request cmpl-077b684404e14b4d8ea1967f4afd8c71-0.
INFO 03-02 00:18:45 [logger.py:42] Received request cmpl-50eea5561fb64b71800c4a727aa7f3fb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:18:45 [async_llm.py:261] Added request cmpl-50eea5561fb64b71800c4a727aa7f3fb-0.
INFO 03-02 00:18:46 [logger.py:42] Received request cmpl-645269ebe2364d34899e134b17013f22-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:18:46 [async_llm.py:261] Added request cmpl-645269ebe2364d34899e134b17013f22-0.
INFO 03-02 00:18:47 [logger.py:42] Received request cmpl-d2aae235af404f859e2f641595fb50db-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:18:47 [async_llm.py:261] Added request cmpl-d2aae235af404f859e2f641595fb50db-0.
INFO 03-02 00:18:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:18:48 [logger.py:42] Received request cmpl-86709674820149ee8ddbf4f62264e1ba-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:18:48 [async_llm.py:261] Added request cmpl-86709674820149ee8ddbf4f62264e1ba-0.
INFO 03-02 00:18:49 [logger.py:42] Received request cmpl-b4614224c6374e3fbbdbd94d76284004-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:18:49 [async_llm.py:261] Added request cmpl-b4614224c6374e3fbbdbd94d76284004-0.
INFO 03-02 00:18:50 [logger.py:42] Received request cmpl-4a351deae5c6447ca704dadb915528ea-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:18:50 [async_llm.py:261] Added request cmpl-4a351deae5c6447ca704dadb915528ea-0.
INFO 03-02 00:18:51 [logger.py:42] Received request cmpl-50694e161b8947f5827e36cd91ea8d84-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:18:51 [async_llm.py:261] Added request cmpl-50694e161b8947f5827e36cd91ea8d84-0.
INFO 03-02 00:18:53 [logger.py:42] Received request cmpl-cc565f6a8fad47dbb9aed71790aa49bb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:18:53 [async_llm.py:261] Added request cmpl-cc565f6a8fad47dbb9aed71790aa49bb-0.
INFO 03-02 00:18:54 [logger.py:42] Received request cmpl-c03d05db503f455d9ea589c78f7592c9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:18:54 [async_llm.py:261] Added request cmpl-c03d05db503f455d9ea589c78f7592c9-0.
INFO 03-02 00:18:55 [logger.py:42] Received request cmpl-ec98a365fe9748abaa6402d7f2d41fea-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:18:55 [async_llm.py:261] Added request cmpl-ec98a365fe9748abaa6402d7f2d41fea-0.
INFO 03-02 00:18:56 [logger.py:42] Received request cmpl-6e733a5bd32b422ebfc7c7ec9fbead4b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:18:56 [async_llm.py:261] Added request cmpl-6e733a5bd32b422ebfc7c7ec9fbead4b-0.
INFO 03-02 00:18:57 [logger.py:42] Received request cmpl-0f8da5c86a2948cbab28e0daa24f7b79-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:18:57 [async_llm.py:261] Added request cmpl-0f8da5c86a2948cbab28e0daa24f7b79-0.
INFO 03-02 00:18:58 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:18:58 [logger.py:42] Received request cmpl-c4328c53831341bbadb26f6cfa28791c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:18:58 [async_llm.py:261] Added request cmpl-c4328c53831341bbadb26f6cfa28791c-0.
INFO 03-02 00:19:00 [logger.py:42] Received request cmpl-81696adecd774949a905d255fe0fc1e2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:19:00 [async_llm.py:261] Added request cmpl-81696adecd774949a905d255fe0fc1e2-0.
INFO 03-02 00:19:01 [logger.py:42] Received request cmpl-b1cee17c611b4e9d9a3ecbca18ea9ee1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:19:01 [async_llm.py:261] Added request cmpl-b1cee17c611b4e9d9a3ecbca18ea9ee1-0.
INFO 03-02 00:19:02 [logger.py:42] Received request cmpl-cfe5fbb11be9441c9cf287bdfb347f0f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:19:02 [async_llm.py:261] Added request cmpl-cfe5fbb11be9441c9cf287bdfb347f0f-0.
INFO 03-02 00:19:03 [logger.py:42] Received request cmpl-67df3421ef174240871474f4a3e39e07-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:19:03 [async_llm.py:261] Added request cmpl-67df3421ef174240871474f4a3e39e07-0.
INFO 03-02 00:19:04 [logger.py:42] Received request cmpl-354c5598bf4843f289aae8a8e798050f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:19:04 [async_llm.py:261] Added request cmpl-354c5598bf4843f289aae8a8e798050f-0.
INFO 03-02 00:19:05 [logger.py:42] Received request cmpl-ace7a063775f48aab38c4d1af06f6d6e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:19:05 [async_llm.py:261] Added request cmpl-ace7a063775f48aab38c4d1af06f6d6e-0.
INFO 03-02 00:19:06 [logger.py:42] Received request cmpl-2e48c095ea4045fba6b3aee738363a0a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:19:06 [async_llm.py:261] Added request cmpl-2e48c095ea4045fba6b3aee738363a0a-0.
INFO 03-02 00:19:08 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:19:08 [logger.py:42] Received request cmpl-91474e5ef4204e3591cb9e335b704aa1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:19:08 [async_llm.py:261] Added request cmpl-91474e5ef4204e3591cb9e335b704aa1-0.
INFO 03-02 00:19:09 [logger.py:42] Received request cmpl-105044b027f24a28a76e812b5fc60300-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:19:09 [async_llm.py:261] Added request cmpl-105044b027f24a28a76e812b5fc60300-0.
INFO 03-02 00:19:10 [logger.py:42] Received request cmpl-29d0cbc1ff814e0e99aff83cd088a896-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:19:10 [async_llm.py:261] Added request cmpl-29d0cbc1ff814e0e99aff83cd088a896-0.
INFO 03-02 00:19:11 [logger.py:42] Received request cmpl-4ac6e89a60694d5bbaae1a9f31b750ca-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:19:11 [async_llm.py:261] Added request cmpl-4ac6e89a60694d5bbaae1a9f31b750ca-0.
INFO 03-02 00:19:12 [logger.py:42] Received request cmpl-fdc90249edd8430085c90fabd7762b41-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:19:12 [async_llm.py:261] Added request cmpl-fdc90249edd8430085c90fabd7762b41-0.
INFO 03-02 00:19:13 [logger.py:42] Received request cmpl-4e7e02171f8849f3a127483f712fb23e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:19:13 [async_llm.py:261] Added request cmpl-4e7e02171f8849f3a127483f712fb23e-0.
INFO 03-02 00:19:15 [logger.py:42] Received request cmpl-592d570dcfd543aba6799c63271b980e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:19:15 [async_llm.py:261] Added request cmpl-592d570dcfd543aba6799c63271b980e-0.
INFO 03-02 00:19:16 [logger.py:42] Received request cmpl-73d303e5ede94d4e85aec9ffb6a8796a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:19:16 [async_llm.py:261] Added request cmpl-73d303e5ede94d4e85aec9ffb6a8796a-0.
INFO 03-02 00:19:17 [logger.py:42] Received request cmpl-238136d585b94912b38d0fffc6512a48-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:19:17 [async_llm.py:261] Added request cmpl-238136d585b94912b38d0fffc6512a48-0.
INFO 03-02 00:19:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:19:18 [logger.py:42] Received request cmpl-92de0ada6bc44fed911c07f5c7018500-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:19:18 [async_llm.py:261] Added request cmpl-92de0ada6bc44fed911c07f5c7018500-0.
INFO 03-02 00:19:19 [logger.py:42] Received request cmpl-3d24fc21f38b457a8f3ed8dd290fd030-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:19:19 [async_llm.py:261] Added request cmpl-3d24fc21f38b457a8f3ed8dd290fd030-0.
INFO 03-02 00:19:20 [logger.py:42] Received request cmpl-7748193b7fee4314a69742985e67c710-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:19:20 [async_llm.py:261] Added request cmpl-7748193b7fee4314a69742985e67c710-0.
INFO 03-02 00:19:21 [logger.py:42] Received request cmpl-a0b99cda70754d6880323ba6dba1c496-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:19:21 [async_llm.py:261] Added request cmpl-a0b99cda70754d6880323ba6dba1c496-0.
INFO 03-02 00:19:23 [logger.py:42] Received request cmpl-5107d7401e6f4c3589b6478626cb3034-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:19:23 [async_llm.py:261] Added request cmpl-5107d7401e6f4c3589b6478626cb3034-0.
INFO 03-02 00:19:24 [logger.py:42] Received request cmpl-9fc1ddc6aa104577af237eba6e13cc72-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:19:24 [async_llm.py:261] Added request cmpl-9fc1ddc6aa104577af237eba6e13cc72-0.
INFO 03-02 00:19:25 [logger.py:42] Received request cmpl-42e95395796142e78cdd68bb0d0e42eb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:19:25 [async_llm.py:261] Added request cmpl-42e95395796142e78cdd68bb0d0e42eb-0.
INFO 03-02 00:19:26 [logger.py:42] Received request cmpl-6ccbfb00e0af400880131b3480d25f3e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:19:26 [async_llm.py:261] Added request cmpl-6ccbfb00e0af400880131b3480d25f3e-0.
INFO 03-02 00:19:27 [logger.py:42] Received request cmpl-7bdd8ba395164fc89c568267861c7158-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:19:27 [async_llm.py:261] Added request cmpl-7bdd8ba395164fc89c568267861c7158-0.
INFO 03-02 00:19:28 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:19:28 [logger.py:42] Received request cmpl-532293816b7a4be6addebb41d1af7748-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:19:28 [async_llm.py:261] Added request cmpl-532293816b7a4be6addebb41d1af7748-0.
INFO 03-02 00:19:29 [logger.py:42] Received request cmpl-6b850048717d492db7f2f9423343f4c7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:19:29 [async_llm.py:261] Added request cmpl-6b850048717d492db7f2f9423343f4c7-0.
INFO 03-02 00:19:31 [logger.py:42] Received request cmpl-9b314913d21d4374b99a8d3adccaea75-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:19:31 [async_llm.py:261] Added request cmpl-9b314913d21d4374b99a8d3adccaea75-0.
INFO 03-02 00:19:32 [logger.py:42] Received request cmpl-1efe46fa3c1f4176b46325d6c2a62529-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:19:32 [async_llm.py:261] Added request cmpl-1efe46fa3c1f4176b46325d6c2a62529-0.
INFO 03-02 00:19:33 [logger.py:42] Received request cmpl-75f837adb4dc4f399cd9190f744c7ef2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:19:33 [async_llm.py:261] Added request cmpl-75f837adb4dc4f399cd9190f744c7ef2-0.
INFO 03-02 00:19:34 [logger.py:42] Received request cmpl-7106daf701c54b069a8312f807f4370f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:19:34 [async_llm.py:261] Added request cmpl-7106daf701c54b069a8312f807f4370f-0.
INFO 03-02 00:19:35 [logger.py:42] Received request cmpl-b4fceedcddaa481189ca4a469272f7ad-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:19:35 [async_llm.py:261] Added request cmpl-b4fceedcddaa481189ca4a469272f7ad-0.
INFO 03-02 00:19:36 [logger.py:42] Received request cmpl-b779c535b7c44b498a85c3d4be8906d8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:19:36 [async_llm.py:261] Added request cmpl-b779c535b7c44b498a85c3d4be8906d8-0.
INFO 03-02 00:19:38 [logger.py:42] Received request cmpl-c4e45d4c57ca427cb50ba6dd8b2c92b7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:19:38 [async_llm.py:261] Added request cmpl-c4e45d4c57ca427cb50ba6dd8b2c92b7-0.
INFO 03-02 00:19:38 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:19:39 [logger.py:42] Received request cmpl-35ae2aa42237462881ef0264909976e7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:19:39 [async_llm.py:261] Added request cmpl-35ae2aa42237462881ef0264909976e7-0.
INFO 03-02 00:19:40 [logger.py:42] Received request cmpl-a336040a10a148c18a8b8b839649d892-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:19:40 [async_llm.py:261] Added request cmpl-a336040a10a148c18a8b8b839649d892-0.
INFO 03-02 00:19:41 [logger.py:42] Received request cmpl-dfac5458dca64317924703df910c65e1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:19:41 [async_llm.py:261] Added request cmpl-dfac5458dca64317924703df910c65e1-0.
INFO 03-02 00:19:42 [logger.py:42] Received request cmpl-017f7d535b93417098cc9ee2e4b3e16e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:19:42 [async_llm.py:261] Added request cmpl-017f7d535b93417098cc9ee2e4b3e16e-0.
INFO 03-02 00:19:43 [logger.py:42] Received request cmpl-126ad399285741fb98527bbd0fee813d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:19:43 [async_llm.py:261] Added request cmpl-126ad399285741fb98527bbd0fee813d-0.
INFO 03-02 00:19:44 [logger.py:42] Received request cmpl-e6ce403f549c4d4ca91def2d91adc8e8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:19:44 [async_llm.py:261] Added request cmpl-e6ce403f549c4d4ca91def2d91adc8e8-0.
INFO 03-02 00:19:46 [logger.py:42] Received request cmpl-e6820f182cf4441f844453999b52ee4b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:19:46 [async_llm.py:261] Added request cmpl-e6820f182cf4441f844453999b52ee4b-0.
INFO 03-02 00:19:47 [logger.py:42] Received request cmpl-f969d98778714c85ae5d951df076ae27-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:19:47 [async_llm.py:261] Added request cmpl-f969d98778714c85ae5d951df076ae27-0.
INFO 03-02 00:19:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:19:48 [logger.py:42] Received request cmpl-14ef272f4d44429eaed2cb01c34e9d51-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:19:48 [async_llm.py:261] Added request cmpl-14ef272f4d44429eaed2cb01c34e9d51-0.
INFO 03-02 00:19:49 [logger.py:42] Received request cmpl-54d44bcab75f4a80aae639eee218998b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:19:49 [async_llm.py:261] Added request cmpl-54d44bcab75f4a80aae639eee218998b-0.
INFO 03-02 00:19:50 [logger.py:42] Received request cmpl-0f060caf7d5e4bcfa0f85d9a3ca5808f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:19:50 [async_llm.py:261] Added request cmpl-0f060caf7d5e4bcfa0f85d9a3ca5808f-0.
INFO 03-02 00:19:51 [logger.py:42] Received request cmpl-fedcbc5718b746929827e2d9ba77c6ec-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:19:51 [async_llm.py:261] Added request cmpl-fedcbc5718b746929827e2d9ba77c6ec-0.
INFO 03-02 00:19:53 [logger.py:42] Received request cmpl-9157889f4fd4486ab769cf964572a9f1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:19:53 [async_llm.py:261] Added request cmpl-9157889f4fd4486ab769cf964572a9f1-0.
INFO 03-02 00:19:54 [logger.py:42] Received request cmpl-01d6df955c8544fd9d1b1f492f4bf090-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:19:54 [async_llm.py:261] Added request cmpl-01d6df955c8544fd9d1b1f492f4bf090-0.
INFO 03-02 00:19:55 [logger.py:42] Received request cmpl-e05090112b634909ac190e607401a5f1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:19:55 [async_llm.py:261] Added request cmpl-e05090112b634909ac190e607401a5f1-0.
INFO 03-02 00:19:56 [logger.py:42] Received request cmpl-786e0fec3ebd41d09eef1aa159734230-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:19:56 [async_llm.py:261] Added request cmpl-786e0fec3ebd41d09eef1aa159734230-0.
INFO 03-02 00:19:57 [logger.py:42] Received request cmpl-c99374b95eed4b16a858076114c9f253-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:19:57 [async_llm.py:261] Added request cmpl-c99374b95eed4b16a858076114c9f253-0.
INFO 03-02 00:19:58 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:19:58 [logger.py:42] Received request cmpl-382ad13154904fe59fc600673c5f16b1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:19:58 [async_llm.py:261] Added request cmpl-382ad13154904fe59fc600673c5f16b1-0.
INFO 03-02 00:19:59 [logger.py:42] Received request cmpl-758090f1bc0b4973925f8dccc94416a8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:19:59 [async_llm.py:261] Added request cmpl-758090f1bc0b4973925f8dccc94416a8-0.
INFO 03-02 00:20:01 [logger.py:42] Received request cmpl-0a0fc42ea229441eb51eedfacd57184d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:20:01 [async_llm.py:261] Added request cmpl-0a0fc42ea229441eb51eedfacd57184d-0.
INFO 03-02 00:20:02 [logger.py:42] Received request cmpl-38cf3fd334a74695a61cc14d3baafeec-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:20:02 [async_llm.py:261] Added request cmpl-38cf3fd334a74695a61cc14d3baafeec-0.
INFO 03-02 00:20:03 [logger.py:42] Received request cmpl-ec7cbebcd21f4b26a9d6a1253e47c4e0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:20:03 [async_llm.py:261] Added request cmpl-ec7cbebcd21f4b26a9d6a1253e47c4e0-0.
INFO 03-02 00:20:04 [logger.py:42] Received request cmpl-e5c769c5252d43ddba311ae34d76b232-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:20:04 [async_llm.py:261] Added request cmpl-e5c769c5252d43ddba311ae34d76b232-0.
INFO 03-02 00:20:05 [logger.py:42] Received request cmpl-9e7352be0dde4558bc8d81e8a1be2d6a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:20:05 [async_llm.py:261] Added request cmpl-9e7352be0dde4558bc8d81e8a1be2d6a-0.
INFO 03-02 00:20:06 [logger.py:42] Received request cmpl-de0af974dcea4d33a2519c8c73049bd9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:20:06 [async_llm.py:261] Added request cmpl-de0af974dcea4d33a2519c8c73049bd9-0.
INFO 03-02 00:20:08 [logger.py:42] Received request cmpl-30c7591e86754d33b4b77e35a23f7f52-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:20:08 [async_llm.py:261] Added request cmpl-30c7591e86754d33b4b77e35a23f7f52-0.
INFO 03-02 00:20:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.1 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.3%, Prefix cache hit rate: 51.6%
INFO 03-02 00:20:09 [logger.py:42] Received request cmpl-3a8ed64017894b27b4038d355f129d41-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:20:09 [async_llm.py:261] Added request cmpl-3a8ed64017894b27b4038d355f129d41-0.
INFO 03-02 00:20:10 [logger.py:42] Received request cmpl-9d05d4e23824449289561cb87ab1c849-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:20:10 [async_llm.py:261] Added request cmpl-9d05d4e23824449289561cb87ab1c849-0.
INFO 03-02 00:20:11 [logger.py:42] Received request cmpl-2cc554c800bf41d5a2e23265af894efc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:20:11 [async_llm.py:261] Added request cmpl-2cc554c800bf41d5a2e23265af894efc-0.
INFO 03-02 00:20:12 [logger.py:42] Received request cmpl-aa6f22f687854e84ac6709c339be5458-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:20:12 [async_llm.py:261] Added request cmpl-aa6f22f687854e84ac6709c339be5458-0.
INFO 03-02 00:20:13 [logger.py:42] Received request cmpl-fa67881ffd8d4eaba8ad2d46035f9784-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:20:13 [async_llm.py:261] Added request cmpl-fa67881ffd8d4eaba8ad2d46035f9784-0.
INFO 03-02 00:20:14 [logger.py:42] Received request cmpl-d6a5c05f7edc481eac61fac629b0f7e8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:20:14 [async_llm.py:261] Added request cmpl-d6a5c05f7edc481eac61fac629b0f7e8-0.
INFO 03-02 00:20:16 [logger.py:42] Received request cmpl-9f93d699a7824ba8acecd2bdeb049f69-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:20:16 [async_llm.py:261] Added request cmpl-9f93d699a7824ba8acecd2bdeb049f69-0.
INFO 03-02 00:20:17 [logger.py:42] Received request cmpl-4c40d553e2334b55b829c5dbda977da5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:20:17 [async_llm.py:261] Added request cmpl-4c40d553e2334b55b829c5dbda977da5-0.
INFO 03-02 00:20:18 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.4 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:20:18 [logger.py:42] Received request cmpl-2caf886d248748ae946cc0fbd137122c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:20:18 [async_llm.py:261] Added request cmpl-2caf886d248748ae946cc0fbd137122c-0.
INFO 03-02 00:20:19 [logger.py:42] Received request cmpl-b3abf6026392498184ea95f191bf36ef-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:20:19 [async_llm.py:261] Added request cmpl-b3abf6026392498184ea95f191bf36ef-0.
INFO 03-02 00:20:20 [logger.py:42] Received request cmpl-1cac584060f94770ad73f2e78628465e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:20:20 [async_llm.py:261] Added request cmpl-1cac584060f94770ad73f2e78628465e-0.
INFO 03-02 00:20:21 [logger.py:42] Received request cmpl-a9601aa86b9e42fd9cb21029719caf4d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:20:21 [async_llm.py:261] Added request cmpl-a9601aa86b9e42fd9cb21029719caf4d-0.
INFO 03-02 00:20:23 [logger.py:42] Received request cmpl-69c9a88ff5b4454096cec1adffc5abfe-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:20:23 [async_llm.py:261] Added request cmpl-69c9a88ff5b4454096cec1adffc5abfe-0.
INFO 03-02 00:20:24 [logger.py:42] Received request cmpl-84823e7ebc2741048c24e5485607835c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:20:24 [async_llm.py:261] Added request cmpl-84823e7ebc2741048c24e5485607835c-0.
INFO 03-02 00:20:25 [logger.py:42] Received request cmpl-52eec99cfb5d43ef8372ba4121760089-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:20:25 [async_llm.py:261] Added request cmpl-52eec99cfb5d43ef8372ba4121760089-0.
INFO 03-02 00:20:26 [logger.py:42] Received request cmpl-bac42f18175341259a7a4617d2e60b05-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:20:26 [async_llm.py:261] Added request cmpl-bac42f18175341259a7a4617d2e60b05-0.
INFO 03-02 00:20:27 [logger.py:42] Received request cmpl-7c3f38718c95481e9df06e71b85d25b7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:20:27 [async_llm.py:261] Added request cmpl-7c3f38718c95481e9df06e71b85d25b7-0.
INFO 03-02 00:20:28 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:20:28 [logger.py:42] Received request cmpl-587bf58b4a75400aa54d774c238af225-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:20:28 [async_llm.py:261] Added request cmpl-587bf58b4a75400aa54d774c238af225-0.
INFO 03-02 00:20:29 [logger.py:42] Received request cmpl-4c9d904b35b346a385c7726558ba2142-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:20:29 [async_llm.py:261] Added request cmpl-4c9d904b35b346a385c7726558ba2142-0.
INFO 03-02 00:20:31 [logger.py:42] Received request cmpl-ed34d9d2a08c47038bc38f7dc4dee5da-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:20:31 [async_llm.py:261] Added request cmpl-ed34d9d2a08c47038bc38f7dc4dee5da-0.
INFO 03-02 00:20:32 [logger.py:42] Received request cmpl-4ff348c110a34e4d996171736e116293-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:20:32 [async_llm.py:261] Added request cmpl-4ff348c110a34e4d996171736e116293-0.
INFO 03-02 00:20:33 [logger.py:42] Received request cmpl-b5f308a78b9f41a6833727dc8aa08c9e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:20:33 [async_llm.py:261] Added request cmpl-b5f308a78b9f41a6833727dc8aa08c9e-0.
INFO 03-02 00:20:34 [logger.py:42] Received request cmpl-5e5226956dba4c3a90df101b2a37b8c7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:20:34 [async_llm.py:261] Added request cmpl-5e5226956dba4c3a90df101b2a37b8c7-0.
INFO 03-02 00:20:35 [logger.py:42] Received request cmpl-5df6a338cdde47abaffe73f83b8889ac-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:20:35 [async_llm.py:261] Added request cmpl-5df6a338cdde47abaffe73f83b8889ac-0.
INFO 03-02 00:20:36 [logger.py:42] Received request cmpl-6ad0160e06e64e34a0ac67145f46902b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:20:36 [async_llm.py:261] Added request cmpl-6ad0160e06e64e34a0ac67145f46902b-0.
INFO 03-02 00:20:38 [logger.py:42] Received request cmpl-d8acabc394b14f8e89f28d514c1f002c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:20:38 [async_llm.py:261] Added request cmpl-d8acabc394b14f8e89f28d514c1f002c-0.
INFO 03-02 00:20:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.2 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.3%, Prefix cache hit rate: 51.6%
INFO 03-02 00:20:39 [logger.py:42] Received request cmpl-b3578aa6fe554586ac2d0c51582edcd6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:20:39 [async_llm.py:261] Added request cmpl-b3578aa6fe554586ac2d0c51582edcd6-0.
INFO 03-02 00:20:40 [logger.py:42] Received request cmpl-56024776f18b42cd925e273a94ada46b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:20:40 [async_llm.py:261] Added request cmpl-56024776f18b42cd925e273a94ada46b-0.
INFO 03-02 00:20:41 [logger.py:42] Received request cmpl-4390f96611484d039d981deadb285711-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:20:41 [async_llm.py:261] Added request cmpl-4390f96611484d039d981deadb285711-0.
INFO 03-02 00:20:42 [logger.py:42] Received request cmpl-8a621061f5fa487594654827cb42866e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:20:42 [async_llm.py:261] Added request cmpl-8a621061f5fa487594654827cb42866e-0.
INFO 03-02 00:20:43 [logger.py:42] Received request cmpl-dd207147c1d44de2b7f5228cce7b9c58-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:20:43 [async_llm.py:261] Added request cmpl-dd207147c1d44de2b7f5228cce7b9c58-0.
INFO 03-02 00:20:44 [logger.py:42] Received request cmpl-cbbb0e7615e0437c8202f9f199b549c1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:20:44 [async_llm.py:261] Added request cmpl-cbbb0e7615e0437c8202f9f199b549c1-0.
INFO 03-02 00:20:46 [logger.py:42] Received request cmpl-c8667403ebc04c2096cc4e24054d20ec-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:20:46 [async_llm.py:261] Added request cmpl-c8667403ebc04c2096cc4e24054d20ec-0.
INFO 03-02 00:20:47 [logger.py:42] Received request cmpl-12c8fab9860f457db8ccc8ade256b068-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:20:47 [async_llm.py:261] Added request cmpl-12c8fab9860f457db8ccc8ade256b068-0.
INFO 03-02 00:20:48 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.3 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:20:48 [logger.py:42] Received request cmpl-3044930296f54e42a9d2ca227927a9d6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:20:48 [async_llm.py:261] Added request cmpl-3044930296f54e42a9d2ca227927a9d6-0.
INFO 03-02 00:20:49 [logger.py:42] Received request cmpl-0d587210a92c4dab94283dc5252d23a4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:20:49 [async_llm.py:261] Added request cmpl-0d587210a92c4dab94283dc5252d23a4-0.
INFO 03-02 00:20:50 [logger.py:42] Received request cmpl-349706efa94b4de392e7554eb8f7c176-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:20:50 [async_llm.py:261] Added request cmpl-349706efa94b4de392e7554eb8f7c176-0.
INFO 03-02 00:20:51 [logger.py:42] Received request cmpl-121d2ce4d58f4118b1ce26cf88e6a91c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:20:51 [async_llm.py:261] Added request cmpl-121d2ce4d58f4118b1ce26cf88e6a91c-0.
INFO 03-02 00:20:53 [logger.py:42] Received request cmpl-9e01819e94ba486c9ac170ea37eab88c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:20:53 [async_llm.py:261] Added request cmpl-9e01819e94ba486c9ac170ea37eab88c-0.
INFO 03-02 00:20:54 [logger.py:42] Received request cmpl-b7fa2890a6a6431492de1bfdc40dcb51-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:20:54 [async_llm.py:261] Added request cmpl-b7fa2890a6a6431492de1bfdc40dcb51-0.
INFO 03-02 00:20:55 [logger.py:42] Received request cmpl-364c279ab9814bca8e2da819d5ab825d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:20:55 [async_llm.py:261] Added request cmpl-364c279ab9814bca8e2da819d5ab825d-0.
INFO 03-02 00:20:56 [logger.py:42] Received request cmpl-b2d2d85f60d54385ad9cd9e094e0a6af-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:20:56 [async_llm.py:261] Added request cmpl-b2d2d85f60d54385ad9cd9e094e0a6af-0.
INFO 03-02 00:20:57 [logger.py:42] Received request cmpl-2a203bd528814ab4afc1d8f9875bed9e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:20:57 [async_llm.py:261] Added request cmpl-2a203bd528814ab4afc1d8f9875bed9e-0.
INFO 03-02 00:20:58 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:20:58 [logger.py:42] Received request cmpl-34a898f76a09473e8b1396fcb2693689-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:20:58 [async_llm.py:261] Added request cmpl-34a898f76a09473e8b1396fcb2693689-0.
INFO 03-02 00:20:59 [logger.py:42] Received request cmpl-14cf203ebebf44e486fa31cbf12b462b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:20:59 [async_llm.py:261] Added request cmpl-14cf203ebebf44e486fa31cbf12b462b-0.
INFO 03-02 00:21:01 [logger.py:42] Received request cmpl-6ba137e202be4820bbe80eef12ce4306-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:21:01 [async_llm.py:261] Added request cmpl-6ba137e202be4820bbe80eef12ce4306-0.
INFO 03-02 00:21:02 [logger.py:42] Received request cmpl-3879b845b4444516b1872aee8966b472-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:21:02 [async_llm.py:261] Added request cmpl-3879b845b4444516b1872aee8966b472-0.
INFO 03-02 00:21:03 [logger.py:42] Received request cmpl-dc499d7407204fc694f08cfa8206fa03-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:21:03 [async_llm.py:261] Added request cmpl-dc499d7407204fc694f08cfa8206fa03-0.
INFO 03-02 00:21:04 [logger.py:42] Received request cmpl-2eaa4ab664814a8eb5c746f2578ecd9a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:21:04 [async_llm.py:261] Added request cmpl-2eaa4ab664814a8eb5c746f2578ecd9a-0.
INFO 03-02 00:21:05 [logger.py:42] Received request cmpl-3bc662ad9e754bb49059107830eaf266-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:21:05 [async_llm.py:261] Added request cmpl-3bc662ad9e754bb49059107830eaf266-0.
INFO 03-02 00:21:06 [logger.py:42] Received request cmpl-1499188861bc4d379ade7be1da734141-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:21:06 [async_llm.py:261] Added request cmpl-1499188861bc4d379ade7be1da734141-0.
INFO 03-02 00:21:08 [logger.py:42] Received request cmpl-a68144505ef54d81bae329a3e3883a78-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:21:08 [async_llm.py:261] Added request cmpl-a68144505ef54d81bae329a3e3883a78-0.
INFO 03-02 00:21:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.2 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.3%, Prefix cache hit rate: 51.6%
INFO 03-02 00:21:09 [logger.py:42] Received request cmpl-ae652ee4a9874f4d884206e7ef4c32d1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:21:09 [async_llm.py:261] Added request cmpl-ae652ee4a9874f4d884206e7ef4c32d1-0.
INFO 03-02 00:21:10 [logger.py:42] Received request cmpl-b1d2ae1925c9452a87078c9a893384c5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:21:10 [async_llm.py:261] Added request cmpl-b1d2ae1925c9452a87078c9a893384c5-0.
INFO 03-02 00:21:11 [logger.py:42] Received request cmpl-e6d2ff07888148bdafd7a7fddf2d75ce-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:21:11 [async_llm.py:261] Added request cmpl-e6d2ff07888148bdafd7a7fddf2d75ce-0.
INFO 03-02 00:21:12 [logger.py:42] Received request cmpl-8f8b71b551b64027b52e9da8c3f7772c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:21:12 [async_llm.py:261] Added request cmpl-8f8b71b551b64027b52e9da8c3f7772c-0.
INFO 03-02 00:21:13 [logger.py:42] Received request cmpl-369aae51b0ce4360be812ef45859501a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:21:13 [async_llm.py:261] Added request cmpl-369aae51b0ce4360be812ef45859501a-0.
INFO 03-02 00:21:14 [logger.py:42] Received request cmpl-44bb7bc9af7d4595af5248cdb1627e5e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:21:14 [async_llm.py:261] Added request cmpl-44bb7bc9af7d4595af5248cdb1627e5e-0.
INFO 03-02 00:21:16 [logger.py:42] Received request cmpl-a2db5c9f0e974687a0cd821b3e299e32-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:21:16 [async_llm.py:261] Added request cmpl-a2db5c9f0e974687a0cd821b3e299e32-0.
INFO 03-02 00:21:17 [logger.py:42] Received request cmpl-131c20b4bb624be0ae8947d44122e834-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:21:17 [async_llm.py:261] Added request cmpl-131c20b4bb624be0ae8947d44122e834-0.
INFO 03-02 00:21:18 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.3 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:21:18 [logger.py:42] Received request cmpl-110815defe2e4dbdbf54836d4e1ee9fd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:21:18 [async_llm.py:261] Added request cmpl-110815defe2e4dbdbf54836d4e1ee9fd-0.
INFO 03-02 00:21:19 [logger.py:42] Received request cmpl-be1d0b9f95b4408b8923ddc741eb44e8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:21:19 [async_llm.py:261] Added request cmpl-be1d0b9f95b4408b8923ddc741eb44e8-0.
INFO 03-02 00:21:20 [logger.py:42] Received request cmpl-b3a5dcbedaba4ef7a7a23a40302c0dfa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:21:20 [async_llm.py:261] Added request cmpl-b3a5dcbedaba4ef7a7a23a40302c0dfa-0.
INFO 03-02 00:21:21 [logger.py:42] Received request cmpl-94ade42e656a4e289f8b850b39060951-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:21:21 [async_llm.py:261] Added request cmpl-94ade42e656a4e289f8b850b39060951-0.
INFO 03-02 00:21:22 [logger.py:42] Received request cmpl-d084fc763a224a0e80a2720139a87154-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:21:22 [async_llm.py:261] Added request cmpl-d084fc763a224a0e80a2720139a87154-0.
INFO 03-02 00:21:24 [logger.py:42] Received request cmpl-d45d3d685a90403495e4bf3a4e5ce4cc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:21:24 [async_llm.py:261] Added request cmpl-d45d3d685a90403495e4bf3a4e5ce4cc-0.
INFO 03-02 00:21:25 [logger.py:42] Received request cmpl-d931534f89914abd80c7028f29044e25-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:21:25 [async_llm.py:261] Added request cmpl-d931534f89914abd80c7028f29044e25-0.
INFO 03-02 00:21:26 [logger.py:42] Received request cmpl-a666f0625d954967856f528e8ba5d9ac-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:21:26 [async_llm.py:261] Added request cmpl-a666f0625d954967856f528e8ba5d9ac-0.
INFO 03-02 00:21:27 [logger.py:42] Received request cmpl-174daf386e564b1f8a7c8c0d1322f94b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:21:27 [async_llm.py:261] Added request cmpl-174daf386e564b1f8a7c8c0d1322f94b-0.
INFO 03-02 00:21:28 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:21:28 [logger.py:42] Received request cmpl-f675b97eb75e4ced9f99f0e81b32737a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:21:28 [async_llm.py:261] Added request cmpl-f675b97eb75e4ced9f99f0e81b32737a-0.
INFO 03-02 00:21:29 [logger.py:42] Received request cmpl-6dc1614858b9420db8b96da17c9cbf14-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:21:29 [async_llm.py:261] Added request cmpl-6dc1614858b9420db8b96da17c9cbf14-0.
INFO 03-02 00:21:31 [logger.py:42] Received request cmpl-4e668d2a9f894fbcac7686aa968d4c26-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:21:31 [async_llm.py:261] Added request cmpl-4e668d2a9f894fbcac7686aa968d4c26-0.
INFO 03-02 00:21:32 [logger.py:42] Received request cmpl-fd721503fd9e4ff7982d954d6d805834-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:21:32 [async_llm.py:261] Added request cmpl-fd721503fd9e4ff7982d954d6d805834-0.
INFO 03-02 00:21:33 [logger.py:42] Received request cmpl-9c68a9d753e247029ba467e6a5fea3dd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:21:33 [async_llm.py:261] Added request cmpl-9c68a9d753e247029ba467e6a5fea3dd-0.
INFO 03-02 00:21:34 [logger.py:42] Received request cmpl-bc1f79310922496e9b0167ad730159b5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:21:34 [async_llm.py:261] Added request cmpl-bc1f79310922496e9b0167ad730159b5-0.
INFO 03-02 00:21:35 [logger.py:42] Received request cmpl-6b884a91f51b413ea87a7b634addf66c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:21:35 [async_llm.py:261] Added request cmpl-6b884a91f51b413ea87a7b634addf66c-0.
INFO 03-02 00:21:36 [logger.py:42] Received request cmpl-ba04fe72a258404c98b5272ce85e7ee7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:21:36 [async_llm.py:261] Added request cmpl-ba04fe72a258404c98b5272ce85e7ee7-0.
INFO 03-02 00:21:37 [logger.py:42] Received request cmpl-fbf0ec93e4ea4cad96b62f09749bba3f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:21:37 [async_llm.py:261] Added request cmpl-fbf0ec93e4ea4cad96b62f09749bba3f-0.
INFO 03-02 00:21:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.3 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.4%, Prefix cache hit rate: 51.6%
INFO 03-02 00:21:39 [logger.py:42] Received request cmpl-42db36a93d814cc6a3380054752b9e20-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:21:39 [async_llm.py:261] Added request cmpl-42db36a93d814cc6a3380054752b9e20-0.
INFO 03-02 00:21:40 [logger.py:42] Received request cmpl-4fcdaf1cb6c740358786ab227d9e415d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:21:40 [async_llm.py:261] Added request cmpl-4fcdaf1cb6c740358786ab227d9e415d-0.
INFO 03-02 00:21:41 [logger.py:42] Received request cmpl-1da770e09bc64ec4b080e788ad356ba9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:21:41 [async_llm.py:261] Added request cmpl-1da770e09bc64ec4b080e788ad356ba9-0.
INFO 03-02 00:21:42 [logger.py:42] Received request cmpl-f8135b0b571c425e8db26be028ceb841-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:21:42 [async_llm.py:261] Added request cmpl-f8135b0b571c425e8db26be028ceb841-0.
INFO 03-02 00:21:43 [logger.py:42] Received request cmpl-f127fae9a772449ca6a62d043a51852a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:21:43 [async_llm.py:261] Added request cmpl-f127fae9a772449ca6a62d043a51852a-0.
INFO 03-02 00:21:44 [logger.py:42] Received request cmpl-063c98d6971f4550b9add51987b37289-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:21:44 [async_llm.py:261] Added request cmpl-063c98d6971f4550b9add51987b37289-0.
INFO 03-02 00:21:46 [logger.py:42] Received request cmpl-9ddf2fe1824a43358260b71c0a1066c7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:21:46 [async_llm.py:261] Added request cmpl-9ddf2fe1824a43358260b71c0a1066c7-0.
INFO 03-02 00:21:47 [logger.py:42] Received request cmpl-c702fa513f12480495a0078af15a26e1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:21:47 [async_llm.py:261] Added request cmpl-c702fa513f12480495a0078af15a26e1-0.
INFO 03-02 00:21:48 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.2 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:21:48 [logger.py:42] Received request cmpl-a902cdf89d434a4b9200a8db7db788c0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:21:48 [async_llm.py:261] Added request cmpl-a902cdf89d434a4b9200a8db7db788c0-0.
INFO 03-02 00:21:49 [logger.py:42] Received request cmpl-52f80850855447139f9bb508f382d590-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:21:49 [async_llm.py:261] Added request cmpl-52f80850855447139f9bb508f382d590-0.
INFO 03-02 00:21:50 [logger.py:42] Received request cmpl-13382bfa213943c1a315d3bc6f828caa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:21:50 [async_llm.py:261] Added request cmpl-13382bfa213943c1a315d3bc6f828caa-0.
INFO 03-02 00:21:51 [logger.py:42] Received request cmpl-6e0afd718f6a4aebb4aca60e5ee06e4a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:21:51 [async_llm.py:261] Added request cmpl-6e0afd718f6a4aebb4aca60e5ee06e4a-0.
INFO 03-02 00:21:52 [logger.py:42] Received request cmpl-192001d12ea740cebc5e332991061b3c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:21:52 [async_llm.py:261] Added request cmpl-192001d12ea740cebc5e332991061b3c-0.
INFO 03-02 00:21:54 [logger.py:42] Received request cmpl-933c402d29d94835a6176c5871f306cf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:21:54 [async_llm.py:261] Added request cmpl-933c402d29d94835a6176c5871f306cf-0.
INFO 03-02 00:21:55 [logger.py:42] Received request cmpl-f1ebe72c91e3436ab009dada1607bd9e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:21:55 [async_llm.py:261] Added request cmpl-f1ebe72c91e3436ab009dada1607bd9e-0.
INFO 03-02 00:21:56 [logger.py:42] Received request cmpl-e0f3e9dcb0e943c3bda97744c5c1a7a8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:21:56 [async_llm.py:261] Added request cmpl-e0f3e9dcb0e943c3bda97744c5c1a7a8-0.
INFO 03-02 00:21:57 [logger.py:42] Received request cmpl-2608a15a865f4617b5d8906e2706d4c6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:21:57 [async_llm.py:261] Added request cmpl-2608a15a865f4617b5d8906e2706d4c6-0.
INFO 03-02 00:21:58 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:21:58 [logger.py:42] Received request cmpl-07104a5d4d4e40098b11e601a8895a18-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:21:58 [async_llm.py:261] Added request cmpl-07104a5d4d4e40098b11e601a8895a18-0.
INFO 03-02 00:21:59 [logger.py:42] Received request cmpl-3741425233d9443d925c127cbdb40dd0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:21:59 [async_llm.py:261] Added request cmpl-3741425233d9443d925c127cbdb40dd0-0.
INFO 03-02 00:22:01 [logger.py:42] Received request cmpl-21ac502bf3474b0080a08204e2e3d909-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:22:01 [async_llm.py:261] Added request cmpl-21ac502bf3474b0080a08204e2e3d909-0.
INFO 03-02 00:22:02 [logger.py:42] Received request cmpl-67b2db93c1e14e05b75a9f3005ce2e5a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:22:02 [async_llm.py:261] Added request cmpl-67b2db93c1e14e05b75a9f3005ce2e5a-0.
INFO 03-02 00:22:03 [logger.py:42] Received request cmpl-c5c0c8947ba542e39db4188d447975ef-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:22:03 [async_llm.py:261] Added request cmpl-c5c0c8947ba542e39db4188d447975ef-0.
INFO 03-02 00:22:04 [logger.py:42] Received request cmpl-ebafa0fcc6fe4cb7854c68e8f74ffeb3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:22:04 [async_llm.py:261] Added request cmpl-ebafa0fcc6fe4cb7854c68e8f74ffeb3-0.
INFO 03-02 00:22:05 [logger.py:42] Received request cmpl-e9d787150c6e441cb80e8ff7ac8099aa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:22:05 [async_llm.py:261] Added request cmpl-e9d787150c6e441cb80e8ff7ac8099aa-0.
INFO 03-02 00:22:06 [logger.py:42] Received request cmpl-e92f772162a84f51aca1b6a88b59e32c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:22:06 [async_llm.py:261] Added request cmpl-e92f772162a84f51aca1b6a88b59e32c-0.
INFO 03-02 00:22:07 [logger.py:42] Received request cmpl-85828314671943dd8bb5337c05a9354f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:22:07 [async_llm.py:261] Added request cmpl-85828314671943dd8bb5337c05a9354f-0.
INFO 03-02 00:22:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.3 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.4%, Prefix cache hit rate: 51.6%
INFO 03-02 00:22:09 [logger.py:42] Received request cmpl-ac7e2f2d0258470bab13a1d746552cc6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:22:09 [async_llm.py:261] Added request cmpl-ac7e2f2d0258470bab13a1d746552cc6-0.
INFO 03-02 00:22:10 [logger.py:42] Received request cmpl-f1b5f37f5a334fbb8ae10fabcba58893-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:22:10 [async_llm.py:261] Added request cmpl-f1b5f37f5a334fbb8ae10fabcba58893-0.
INFO 03-02 00:22:11 [logger.py:42] Received request cmpl-00d5eaa695c54f4dbf40cfd03ee5dd1f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:22:11 [async_llm.py:261] Added request cmpl-00d5eaa695c54f4dbf40cfd03ee5dd1f-0.
INFO 03-02 00:22:12 [logger.py:42] Received request cmpl-53d6ff8fb82145d48e4558400ebc04e0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:22:12 [async_llm.py:261] Added request cmpl-53d6ff8fb82145d48e4558400ebc04e0-0.
INFO 03-02 00:22:13 [logger.py:42] Received request cmpl-3ca60d3b72654a93bdefcc3623c8d425-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:22:13 [async_llm.py:261] Added request cmpl-3ca60d3b72654a93bdefcc3623c8d425-0.
INFO 03-02 00:22:14 [logger.py:42] Received request cmpl-768aa4e313dc486caec103ebb9f86eff-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:22:14 [async_llm.py:261] Added request cmpl-768aa4e313dc486caec103ebb9f86eff-0.
INFO 03-02 00:22:16 [logger.py:42] Received request cmpl-2153455ee57c4c4094946eebdfa10208-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:22:16 [async_llm.py:261] Added request cmpl-2153455ee57c4c4094946eebdfa10208-0.
INFO 03-02 00:22:17 [logger.py:42] Received request cmpl-8972d06c662848d7b60e8e79dae201aa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:22:17 [async_llm.py:261] Added request cmpl-8972d06c662848d7b60e8e79dae201aa-0.
INFO 03-02 00:22:18 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.2 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:22:18 [logger.py:42] Received request cmpl-c30b418b61c54ca68117bc1701b3212d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:22:18 [async_llm.py:261] Added request cmpl-c30b418b61c54ca68117bc1701b3212d-0.
INFO 03-02 00:22:19 [logger.py:42] Received request cmpl-a3487086e5684675a41d5e8572528154-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:22:19 [async_llm.py:261] Added request cmpl-a3487086e5684675a41d5e8572528154-0.
INFO 03-02 00:22:20 [logger.py:42] Received request cmpl-6d607a247ba141c99a025d23c5293a4b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:22:20 [async_llm.py:261] Added request cmpl-6d607a247ba141c99a025d23c5293a4b-0.
INFO 03-02 00:22:21 [logger.py:42] Received request cmpl-23ceaa757f404bafb22fab3b5a0a7e9e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:22:21 [async_llm.py:261] Added request cmpl-23ceaa757f404bafb22fab3b5a0a7e9e-0.
INFO 03-02 00:22:22 [logger.py:42] Received request cmpl-b165fc30a9d744c7ac987cb0fe6f68c0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:22:22 [async_llm.py:261] Added request cmpl-b165fc30a9d744c7ac987cb0fe6f68c0-0.
INFO 03-02 00:22:24 [logger.py:42] Received request cmpl-ffea512d70014da9a2f74142bd7b31ac-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:22:24 [async_llm.py:261] Added request cmpl-ffea512d70014da9a2f74142bd7b31ac-0.
INFO 03-02 00:22:25 [logger.py:42] Received request cmpl-f0fa56ec1a024017956146794bd26e88-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:22:25 [async_llm.py:261] Added request cmpl-f0fa56ec1a024017956146794bd26e88-0.
INFO 03-02 00:22:26 [logger.py:42] Received request cmpl-7a96680dc37143eba43994f91b8f86d4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:22:26 [async_llm.py:261] Added request cmpl-7a96680dc37143eba43994f91b8f86d4-0.
INFO 03-02 00:22:27 [logger.py:42] Received request cmpl-26036256866a4c5abdf4157fbe6c6a80-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:22:27 [async_llm.py:261] Added request cmpl-26036256866a4c5abdf4157fbe6c6a80-0.
INFO 03-02 00:22:28 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:22:28 [logger.py:42] Received request cmpl-843c16c2e9ca4cc69fea85dc290941f2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:22:28 [async_llm.py:261] Added request cmpl-843c16c2e9ca4cc69fea85dc290941f2-0.
INFO 03-02 00:22:29 [logger.py:42] Received request cmpl-736c56d28a2b4eb0a351d8c2a898fe93-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:22:29 [async_llm.py:261] Added request cmpl-736c56d28a2b4eb0a351d8c2a898fe93-0.
INFO 03-02 00:22:31 [logger.py:42] Received request cmpl-e543d4ae06f24f488b802c2cd9ed13b7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:22:31 [async_llm.py:261] Added request cmpl-e543d4ae06f24f488b802c2cd9ed13b7-0.
INFO 03-02 00:22:32 [logger.py:42] Received request cmpl-2450c28bdb164f8a8c8c9784872a40dc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:22:32 [async_llm.py:261] Added request cmpl-2450c28bdb164f8a8c8c9784872a40dc-0.
INFO 03-02 00:22:33 [logger.py:42] Received request cmpl-c43b5a5163a14ada89c78ccaecdc777c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:22:33 [async_llm.py:261] Added request cmpl-c43b5a5163a14ada89c78ccaecdc777c-0.
INFO 03-02 00:22:34 [logger.py:42] Received request cmpl-ac039917da0c4a5b80d97d1304e69e97-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:22:34 [async_llm.py:261] Added request cmpl-ac039917da0c4a5b80d97d1304e69e97-0.
INFO 03-02 00:22:35 [logger.py:42] Received request cmpl-178566cc8e6a4c66b7505bb9000e13da-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:22:35 [async_llm.py:261] Added request cmpl-178566cc8e6a4c66b7505bb9000e13da-0.
INFO 03-02 00:22:36 [logger.py:42] Received request cmpl-079693f2c07b4b55a16357c9b6439394-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:22:36 [async_llm.py:261] Added request cmpl-079693f2c07b4b55a16357c9b6439394-0.
INFO 03-02 00:22:37 [logger.py:42] Received request cmpl-58a908e5a8f84cad9ba6e8746034cffe-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:22:37 [async_llm.py:261] Added request cmpl-58a908e5a8f84cad9ba6e8746034cffe-0.
INFO 03-02 00:22:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.3 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.4%, Prefix cache hit rate: 51.6%
INFO 03-02 00:22:39 [logger.py:42] Received request cmpl-ffff546f59034430a321c54a81676a47-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:22:39 [async_llm.py:261] Added request cmpl-ffff546f59034430a321c54a81676a47-0.
INFO 03-02 00:22:40 [logger.py:42] Received request cmpl-7c96b13092a94a728530f2d720661445-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:22:40 [async_llm.py:261] Added request cmpl-7c96b13092a94a728530f2d720661445-0.
INFO 03-02 00:22:41 [logger.py:42] Received request cmpl-36488865bfd34bfaacccda28e2ec1de3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:22:41 [async_llm.py:261] Added request cmpl-36488865bfd34bfaacccda28e2ec1de3-0.
INFO 03-02 00:22:42 [logger.py:42] Received request cmpl-69c681d72d104841a77ba67274bbdb6a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:22:42 [async_llm.py:261] Added request cmpl-69c681d72d104841a77ba67274bbdb6a-0.
INFO 03-02 00:22:43 [logger.py:42] Received request cmpl-35e66cfc12a6453cac135e0cb37825d5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:22:43 [async_llm.py:261] Added request cmpl-35e66cfc12a6453cac135e0cb37825d5-0.
INFO 03-02 00:22:44 [logger.py:42] Received request cmpl-18dd1fef54b64f88a37c3d2e5461716a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:22:44 [async_llm.py:261] Added request cmpl-18dd1fef54b64f88a37c3d2e5461716a-0.
INFO 03-02 00:22:46 [logger.py:42] Received request cmpl-caf3812dac684e129653db5d68b2a08a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:22:46 [async_llm.py:261] Added request cmpl-caf3812dac684e129653db5d68b2a08a-0.
INFO 03-02 00:22:47 [logger.py:42] Received request cmpl-6e653831898e417abb77800e8e7c4d15-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:22:47 [async_llm.py:261] Added request cmpl-6e653831898e417abb77800e8e7c4d15-0.
INFO 03-02 00:22:48 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.2 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:22:48 [logger.py:42] Received request cmpl-f7e69c3ece3543f2afb196a8f0b39b4e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:22:48 [async_llm.py:261] Added request cmpl-f7e69c3ece3543f2afb196a8f0b39b4e-0.
INFO 03-02 00:22:49 [logger.py:42] Received request cmpl-8d967ed6e31e4bb9b0daf2302de8adc6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:22:49 [async_llm.py:261] Added request cmpl-8d967ed6e31e4bb9b0daf2302de8adc6-0.
INFO 03-02 00:22:50 [logger.py:42] Received request cmpl-106e808e1bbc4afaaa4e598f076c63c8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:22:50 [async_llm.py:261] Added request cmpl-106e808e1bbc4afaaa4e598f076c63c8-0.
INFO 03-02 00:22:51 [logger.py:42] Received request cmpl-fe920300b6b74a7ba8947592938f3a22-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:22:51 [async_llm.py:261] Added request cmpl-fe920300b6b74a7ba8947592938f3a22-0.
INFO 03-02 00:22:52 [logger.py:42] Received request cmpl-a6d99210a7dd4c8f936dc9fe5acffeb4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:22:52 [async_llm.py:261] Added request cmpl-a6d99210a7dd4c8f936dc9fe5acffeb4-0.
INFO 03-02 00:22:54 [logger.py:42] Received request cmpl-c19260321271456195a8ea072eab7846-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:22:54 [async_llm.py:261] Added request cmpl-c19260321271456195a8ea072eab7846-0.
INFO 03-02 00:22:55 [logger.py:42] Received request cmpl-828390815f4a4d2c8327f9445180414f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:22:55 [async_llm.py:261] Added request cmpl-828390815f4a4d2c8327f9445180414f-0.
INFO 03-02 00:22:56 [logger.py:42] Received request cmpl-1bd0d6f1410146c7be3c3e0ac8a4c198-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:22:56 [async_llm.py:261] Added request cmpl-1bd0d6f1410146c7be3c3e0ac8a4c198-0.
INFO 03-02 00:22:57 [logger.py:42] Received request cmpl-cff673e4b6654bbb86548dc6e0f0db13-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:22:57 [async_llm.py:261] Added request cmpl-cff673e4b6654bbb86548dc6e0f0db13-0.
INFO 03-02 00:22:58 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:22:58 [logger.py:42] Received request cmpl-1dbf19de6bcd4c67a663bfc8e1eb3a5c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:22:58 [async_llm.py:261] Added request cmpl-1dbf19de6bcd4c67a663bfc8e1eb3a5c-0.
INFO 03-02 00:22:59 [logger.py:42] Received request cmpl-985c21bbc5494c0ca9c30f96a79cc92d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:22:59 [async_llm.py:261] Added request cmpl-985c21bbc5494c0ca9c30f96a79cc92d-0.
INFO 03-02 00:23:01 [logger.py:42] Received request cmpl-7477a208d4e144b0b1e2b868b8ba28ea-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:23:01 [async_llm.py:261] Added request cmpl-7477a208d4e144b0b1e2b868b8ba28ea-0.
INFO 03-02 00:23:02 [logger.py:42] Received request cmpl-4030cd90e7a74247a85091111688ed63-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:23:02 [async_llm.py:261] Added request cmpl-4030cd90e7a74247a85091111688ed63-0.
INFO 03-02 00:23:03 [logger.py:42] Received request cmpl-d0e193f3f2544cc79e40944e9c23ea18-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:23:03 [async_llm.py:261] Added request cmpl-d0e193f3f2544cc79e40944e9c23ea18-0.
INFO 03-02 00:23:04 [logger.py:42] Received request cmpl-bab51264b7554734bd0adeb10efd2799-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:23:04 [async_llm.py:261] Added request cmpl-bab51264b7554734bd0adeb10efd2799-0.
INFO 03-02 00:23:05 [logger.py:42] Received request cmpl-ef131fe687194f689d440c1c969a5ae8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:23:05 [async_llm.py:261] Added request cmpl-ef131fe687194f689d440c1c969a5ae8-0.
INFO 03-02 00:23:06 [logger.py:42] Received request cmpl-a655cac3e03e42a09aa7f5900d1bff92-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:23:06 [async_llm.py:261] Added request cmpl-a655cac3e03e42a09aa7f5900d1bff92-0.
INFO 03-02 00:23:07 [logger.py:42] Received request cmpl-1188fd67d0c242e4a61e7e5552bf44dc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:23:07 [async_llm.py:261] Added request cmpl-1188fd67d0c242e4a61e7e5552bf44dc-0.
INFO 03-02 00:23:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.4 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.4%, Prefix cache hit rate: 51.6%
INFO 03-02 00:23:09 [logger.py:42] Received request cmpl-38c9923b543c4019927737b01a8bbf10-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:23:09 [async_llm.py:261] Added request cmpl-38c9923b543c4019927737b01a8bbf10-0.
INFO 03-02 00:23:10 [logger.py:42] Received request cmpl-16d601fbefec4ddaac5853fbef8b643d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:23:10 [async_llm.py:261] Added request cmpl-16d601fbefec4ddaac5853fbef8b643d-0.
INFO 03-02 00:23:11 [logger.py:42] Received request cmpl-bb74c940e4fd4601b1033c791c64d76a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:23:11 [async_llm.py:261] Added request cmpl-bb74c940e4fd4601b1033c791c64d76a-0.
INFO 03-02 00:23:12 [logger.py:42] Received request cmpl-e02ec662f7f647fd9061a60400fc06ea-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:23:12 [async_llm.py:261] Added request cmpl-e02ec662f7f647fd9061a60400fc06ea-0.
INFO 03-02 00:23:13 [logger.py:42] Received request cmpl-a2b9d096ea6f4988b5712eedcbdbd365-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:23:13 [async_llm.py:261] Added request cmpl-a2b9d096ea6f4988b5712eedcbdbd365-0.
INFO 03-02 00:23:14 [logger.py:42] Received request cmpl-c7c0437b232a4295996832d4df0ed37d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:23:14 [async_llm.py:261] Added request cmpl-c7c0437b232a4295996832d4df0ed37d-0.
INFO 03-02 00:23:16 [logger.py:42] Received request cmpl-5f910173157f433d8ae1c0e4f6e7043b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:23:16 [async_llm.py:261] Added request cmpl-5f910173157f433d8ae1c0e4f6e7043b-0.
INFO 03-02 00:23:17 [logger.py:42] Received request cmpl-65b57377d37e40d890c86badf5b00695-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:23:17 [async_llm.py:261] Added request cmpl-65b57377d37e40d890c86badf5b00695-0.
INFO 03-02 00:23:18 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.1 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:23:18 [logger.py:42] Received request cmpl-711ed0c309e4468f8d69b881f72880f7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:23:18 [async_llm.py:261] Added request cmpl-711ed0c309e4468f8d69b881f72880f7-0.
INFO 03-02 00:23:19 [logger.py:42] Received request cmpl-3483aa1883b74f6795509d0563f690d0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:23:19 [async_llm.py:261] Added request cmpl-3483aa1883b74f6795509d0563f690d0-0.
INFO 03-02 00:23:20 [logger.py:42] Received request cmpl-cd1bf0e2330d4f129a4a0d1e50974b89-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:23:20 [async_llm.py:261] Added request cmpl-cd1bf0e2330d4f129a4a0d1e50974b89-0.
INFO 03-02 00:23:21 [logger.py:42] Received request cmpl-785f6ebc083a489fbf6c6d2055233c08-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:23:21 [async_llm.py:261] Added request cmpl-785f6ebc083a489fbf6c6d2055233c08-0.
INFO 03-02 00:23:22 [logger.py:42] Received request cmpl-63676c5a7f194bd6aec4e2230b80fe17-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:23:22 [async_llm.py:261] Added request cmpl-63676c5a7f194bd6aec4e2230b80fe17-0.
INFO 03-02 00:23:24 [logger.py:42] Received request cmpl-b17c407991d145f3a1b42fc2d3b0b5df-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:23:24 [async_llm.py:261] Added request cmpl-b17c407991d145f3a1b42fc2d3b0b5df-0.
INFO 03-02 00:23:25 [logger.py:42] Received request cmpl-4e17e2c72410479490ba1325f90ffbc2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:23:25 [async_llm.py:261] Added request cmpl-4e17e2c72410479490ba1325f90ffbc2-0.
INFO 03-02 00:23:26 [logger.py:42] Received request cmpl-1f3d9d03e72e4986972d57c87ae50009-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:23:26 [async_llm.py:261] Added request cmpl-1f3d9d03e72e4986972d57c87ae50009-0.
INFO 03-02 00:23:27 [logger.py:42] Received request cmpl-d087d2be03104d8583aeef5a82bef11f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:23:27 [async_llm.py:261] Added request cmpl-d087d2be03104d8583aeef5a82bef11f-0.
INFO 03-02 00:23:28 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:23:28 [logger.py:42] Received request cmpl-7d989d31acef4568affb14660ffc5717-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:23:28 [async_llm.py:261] Added request cmpl-7d989d31acef4568affb14660ffc5717-0.
INFO 03-02 00:23:29 [logger.py:42] Received request cmpl-8dad0f8c1e704b10bf0b84a0fb5db65d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:23:29 [async_llm.py:261] Added request cmpl-8dad0f8c1e704b10bf0b84a0fb5db65d-0.
INFO 03-02 00:23:31 [logger.py:42] Received request cmpl-64e81552f27f41da8c792e60f72d197c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:23:31 [async_llm.py:261] Added request cmpl-64e81552f27f41da8c792e60f72d197c-0.
INFO 03-02 00:23:32 [logger.py:42] Received request cmpl-bd8b2576997e4448893d0c6cd4262146-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:23:32 [async_llm.py:261] Added request cmpl-bd8b2576997e4448893d0c6cd4262146-0.
INFO 03-02 00:23:33 [logger.py:42] Received request cmpl-6617b7dfc359444b93894cbb5ab51f70-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:23:33 [async_llm.py:261] Added request cmpl-6617b7dfc359444b93894cbb5ab51f70-0.
INFO 03-02 00:23:34 [logger.py:42] Received request cmpl-79859147b77044b28e33f9733187c492-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:23:34 [async_llm.py:261] Added request cmpl-79859147b77044b28e33f9733187c492-0.
INFO 03-02 00:23:35 [logger.py:42] Received request cmpl-e47bb14b2f444b0ebedde145aea16f70-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:23:35 [async_llm.py:261] Added request cmpl-e47bb14b2f444b0ebedde145aea16f70-0.
INFO 03-02 00:23:36 [logger.py:42] Received request cmpl-dffbcd32097743c3a833eb83b05c3658-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:23:36 [async_llm.py:261] Added request cmpl-dffbcd32097743c3a833eb83b05c3658-0.
INFO 03-02 00:23:37 [logger.py:42] Received request cmpl-9b124644ce7d4b78b2b356f3720f2eb5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:23:37 [async_llm.py:261] Added request cmpl-9b124644ce7d4b78b2b356f3720f2eb5-0.
INFO 03-02 00:23:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.4 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.4%, Prefix cache hit rate: 51.6%
INFO 03-02 00:23:39 [logger.py:42] Received request cmpl-8bafc7d1bafb4f31b3effe76d7996d8d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:23:39 [async_llm.py:261] Added request cmpl-8bafc7d1bafb4f31b3effe76d7996d8d-0.
INFO 03-02 00:23:40 [logger.py:42] Received request cmpl-882e34e726e84ae6a5ddeb991bd7f7b5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:23:40 [async_llm.py:261] Added request cmpl-882e34e726e84ae6a5ddeb991bd7f7b5-0.
INFO 03-02 00:23:41 [logger.py:42] Received request cmpl-23e22dcde0ee40d3948377b4f22ad6d2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:23:41 [async_llm.py:261] Added request cmpl-23e22dcde0ee40d3948377b4f22ad6d2-0.
INFO 03-02 00:23:42 [logger.py:42] Received request cmpl-d73d00f557d64e6eafee6b766ada8a65-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:23:42 [async_llm.py:261] Added request cmpl-d73d00f557d64e6eafee6b766ada8a65-0.
INFO 03-02 00:23:43 [logger.py:42] Received request cmpl-60aafdbfd95748398734e4f6b334f15e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:23:43 [async_llm.py:261] Added request cmpl-60aafdbfd95748398734e4f6b334f15e-0.
INFO 03-02 00:23:44 [logger.py:42] Received request cmpl-b6805f27f4524c67a098b4525a2dc447-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:23:44 [async_llm.py:261] Added request cmpl-b6805f27f4524c67a098b4525a2dc447-0.
INFO 03-02 00:23:46 [logger.py:42] Received request cmpl-26884ac136f447559810d2fe66fb770f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:23:46 [async_llm.py:261] Added request cmpl-26884ac136f447559810d2fe66fb770f-0.
INFO 03-02 00:23:47 [logger.py:42] Received request cmpl-5fe84d58204940bd9ad8a3195db8566b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:23:47 [async_llm.py:261] Added request cmpl-5fe84d58204940bd9ad8a3195db8566b-0.
INFO 03-02 00:23:48 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.1 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:23:48 [logger.py:42] Received request cmpl-634b04b0ce8e46da9dede277c106baf3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:23:48 [async_llm.py:261] Added request cmpl-634b04b0ce8e46da9dede277c106baf3-0.
INFO 03-02 00:23:49 [logger.py:42] Received request cmpl-a1b32e179d2f4311a0e01d85dc77ba90-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:23:49 [async_llm.py:261] Added request cmpl-a1b32e179d2f4311a0e01d85dc77ba90-0.
INFO 03-02 00:23:50 [logger.py:42] Received request cmpl-09ec2157aa7d4085a0d679eefa78ede9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:23:50 [async_llm.py:261] Added request cmpl-09ec2157aa7d4085a0d679eefa78ede9-0.
INFO 03-02 00:23:51 [logger.py:42] Received request cmpl-7cc9eea75ce14d12be973d26258567da-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:23:51 [async_llm.py:261] Added request cmpl-7cc9eea75ce14d12be973d26258567da-0.
INFO 03-02 00:23:52 [logger.py:42] Received request cmpl-f0808ec15ac347a0a0bf3e6f04bdd82a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:23:52 [async_llm.py:261] Added request cmpl-f0808ec15ac347a0a0bf3e6f04bdd82a-0.
INFO 03-02 00:23:54 [logger.py:42] Received request cmpl-1405f6348b694c529a57f6cb6d2a3c30-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:23:54 [async_llm.py:261] Added request cmpl-1405f6348b694c529a57f6cb6d2a3c30-0.
INFO 03-02 00:23:55 [logger.py:42] Received request cmpl-8c4b0f72c89945979f7a9c0fc1845d91-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:23:55 [async_llm.py:261] Added request cmpl-8c4b0f72c89945979f7a9c0fc1845d91-0.
INFO 03-02 00:23:56 [logger.py:42] Received request cmpl-7a621581ad2b4eeb9c15919a3d4b859c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:23:56 [async_llm.py:261] Added request cmpl-7a621581ad2b4eeb9c15919a3d4b859c-0.
INFO 03-02 00:23:57 [logger.py:42] Received request cmpl-40d8f74a157e459b9c45d0784b3c1e2b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:23:57 [async_llm.py:261] Added request cmpl-40d8f74a157e459b9c45d0784b3c1e2b-0.
INFO 03-02 00:23:58 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:23:58 [logger.py:42] Received request cmpl-a1f97b6e5c284b14a71dc0223bb881b0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:23:58 [async_llm.py:261] Added request cmpl-a1f97b6e5c284b14a71dc0223bb881b0-0.
INFO 03-02 00:23:59 [logger.py:42] Received request cmpl-59da53e7538b43248f68c23b8a3de1b4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:23:59 [async_llm.py:261] Added request cmpl-59da53e7538b43248f68c23b8a3de1b4-0.
INFO 03-02 00:24:00 [logger.py:42] Received request cmpl-bef9340cf2d14aa4b450cc5b8e7e9dbe-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:24:00 [async_llm.py:261] Added request cmpl-bef9340cf2d14aa4b450cc5b8e7e9dbe-0.
INFO 03-02 00:24:02 [logger.py:42] Received request cmpl-9911b7909e5f4d43ad63e85b383f4ccb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:24:02 [async_llm.py:261] Added request cmpl-9911b7909e5f4d43ad63e85b383f4ccb-0.
INFO 03-02 00:24:03 [logger.py:42] Received request cmpl-b77fa7ec1bdf4172aee2b3c3248e27bb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:24:03 [async_llm.py:261] Added request cmpl-b77fa7ec1bdf4172aee2b3c3248e27bb-0.
INFO 03-02 00:24:04 [logger.py:42] Received request cmpl-5a20fd9f9f66488ab6960dec2c15934a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:24:04 [async_llm.py:261] Added request cmpl-5a20fd9f9f66488ab6960dec2c15934a-0.
INFO 03-02 00:24:05 [logger.py:42] Received request cmpl-51b2b9843030418caa220f9340d1db46-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:24:05 [async_llm.py:261] Added request cmpl-51b2b9843030418caa220f9340d1db46-0.
INFO 03-02 00:24:06 [logger.py:42] Received request cmpl-29021db3b4114a068068a2a20c9d91cd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:24:06 [async_llm.py:261] Added request cmpl-29021db3b4114a068068a2a20c9d91cd-0.
INFO 03-02 00:24:07 [logger.py:42] Received request cmpl-fe15ebf59c2e447a92ce8eba10e1f9e4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:24:07 [async_llm.py:261] Added request cmpl-fe15ebf59c2e447a92ce8eba10e1f9e4-0.
INFO 03-02 00:24:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:24:09 [logger.py:42] Received request cmpl-a09bcd4f227d460baca77824ab884d0b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:24:09 [async_llm.py:261] Added request cmpl-a09bcd4f227d460baca77824ab884d0b-0.
INFO 03-02 00:24:10 [logger.py:42] Received request cmpl-4fedea21a8234cd3b416da4e7b845628-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:24:10 [async_llm.py:261] Added request cmpl-4fedea21a8234cd3b416da4e7b845628-0.
INFO 03-02 00:24:11 [logger.py:42] Received request cmpl-3e24e426805043fe8ec81317f5066293-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:24:11 [async_llm.py:261] Added request cmpl-3e24e426805043fe8ec81317f5066293-0.
INFO 03-02 00:24:12 [logger.py:42] Received request cmpl-180e3b46b5794293bd14fe7325885ef2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:24:12 [async_llm.py:261] Added request cmpl-180e3b46b5794293bd14fe7325885ef2-0.
INFO 03-02 00:24:13 [logger.py:42] Received request cmpl-44ad7138e42047a4813d3daf4507768a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:24:13 [async_llm.py:261] Added request cmpl-44ad7138e42047a4813d3daf4507768a-0.
INFO 03-02 00:24:14 [logger.py:42] Received request cmpl-550fe56cb8824cd392efbdce568b0336-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:24:14 [async_llm.py:261] Added request cmpl-550fe56cb8824cd392efbdce568b0336-0.
INFO 03-02 00:24:15 [logger.py:42] Received request cmpl-b00d57077659401db0b548af1e4b2fd6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:24:15 [async_llm.py:261] Added request cmpl-b00d57077659401db0b548af1e4b2fd6-0.
INFO 03-02 00:24:17 [logger.py:42] Received request cmpl-5c8d4f54b7bd4d879a6595d35823160a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:24:17 [async_llm.py:261] Added request cmpl-5c8d4f54b7bd4d879a6595d35823160a-0.
INFO 03-02 00:24:18 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:24:18 [logger.py:42] Received request cmpl-21e32a62558a4be1ab929769ccbb654c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:24:18 [async_llm.py:261] Added request cmpl-21e32a62558a4be1ab929769ccbb654c-0.
INFO 03-02 00:24:19 [logger.py:42] Received request cmpl-621cfc2b8b8040c995483e715f809a3e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:24:19 [async_llm.py:261] Added request cmpl-621cfc2b8b8040c995483e715f809a3e-0.
INFO 03-02 00:24:20 [logger.py:42] Received request cmpl-38f80aa4ed164cd68ad8b45cc7b8550c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:24:20 [async_llm.py:261] Added request cmpl-38f80aa4ed164cd68ad8b45cc7b8550c-0.
INFO 03-02 00:24:21 [logger.py:42] Received request cmpl-d157377c384f410785986bc816ff952d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:24:21 [async_llm.py:261] Added request cmpl-d157377c384f410785986bc816ff952d-0.
INFO 03-02 00:24:22 [logger.py:42] Received request cmpl-7dea5b62135749beb0dada8834617411-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:24:22 [async_llm.py:261] Added request cmpl-7dea5b62135749beb0dada8834617411-0.
INFO 03-02 00:24:24 [logger.py:42] Received request cmpl-477fa555fe7041b59a47dd789c296f40-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:24:24 [async_llm.py:261] Added request cmpl-477fa555fe7041b59a47dd789c296f40-0.
INFO 03-02 00:24:25 [logger.py:42] Received request cmpl-06e44275f4f64a89af290e184814e08b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:24:25 [async_llm.py:261] Added request cmpl-06e44275f4f64a89af290e184814e08b-0.
INFO 03-02 00:24:26 [logger.py:42] Received request cmpl-a584fd78f37d46688c615d078b008ef7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:24:26 [async_llm.py:261] Added request cmpl-a584fd78f37d46688c615d078b008ef7-0.
INFO 03-02 00:24:27 [logger.py:42] Received request cmpl-9eec8ee29c124b91b6365b442cf3666f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:24:27 [async_llm.py:261] Added request cmpl-9eec8ee29c124b91b6365b442cf3666f-0.
INFO 03-02 00:24:28 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:24:28 [logger.py:42] Received request cmpl-4deea5ecdb494fb288fc86aca40596b0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:24:28 [async_llm.py:261] Added request cmpl-4deea5ecdb494fb288fc86aca40596b0-0.
INFO 03-02 00:24:29 [logger.py:42] Received request cmpl-f5da36ada09d44bd8c5ee2b9ab1f401f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:24:29 [async_llm.py:261] Added request cmpl-f5da36ada09d44bd8c5ee2b9ab1f401f-0.
INFO 03-02 00:24:30 [logger.py:42] Received request cmpl-5959bf6e67634a94afc56d9513459547-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:24:30 [async_llm.py:261] Added request cmpl-5959bf6e67634a94afc56d9513459547-0.
INFO 03-02 00:24:32 [logger.py:42] Received request cmpl-6ccc55a6961b423e91b884c1e87dfe80-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:24:32 [async_llm.py:261] Added request cmpl-6ccc55a6961b423e91b884c1e87dfe80-0.
INFO 03-02 00:24:33 [logger.py:42] Received request cmpl-7248e9745fd44cb084a90a563fc3c02a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:24:33 [async_llm.py:261] Added request cmpl-7248e9745fd44cb084a90a563fc3c02a-0.
INFO 03-02 00:24:34 [logger.py:42] Received request cmpl-a4559748426d44b69a051a7f1daf5c3a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:24:34 [async_llm.py:261] Added request cmpl-a4559748426d44b69a051a7f1daf5c3a-0.
INFO 03-02 00:24:35 [logger.py:42] Received request cmpl-a4c0441e8a164963877d13a116c71356-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:24:35 [async_llm.py:261] Added request cmpl-a4c0441e8a164963877d13a116c71356-0.
INFO 03-02 00:24:36 [logger.py:42] Received request cmpl-700e86818d6b4e50be9e0c312293fa72-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:24:36 [async_llm.py:261] Added request cmpl-700e86818d6b4e50be9e0c312293fa72-0.
INFO 03-02 00:24:37 [logger.py:42] Received request cmpl-311251c0aa4745d2b880fda9b75ab72f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:24:37 [async_llm.py:261] Added request cmpl-311251c0aa4745d2b880fda9b75ab72f-0.
INFO 03-02 00:24:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:24:39 [logger.py:42] Received request cmpl-ec94ddc3068642c3950cfc14e0ecaa41-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:24:39 [async_llm.py:261] Added request cmpl-ec94ddc3068642c3950cfc14e0ecaa41-0.
INFO 03-02 00:24:40 [logger.py:42] Received request cmpl-ab4bad3a6a694e5e9fa22ffc3b50914a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:24:40 [async_llm.py:261] Added request cmpl-ab4bad3a6a694e5e9fa22ffc3b50914a-0.
INFO 03-02 00:24:41 [logger.py:42] Received request cmpl-467ede1697c040c0a2177d312e27e6e6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:24:41 [async_llm.py:261] Added request cmpl-467ede1697c040c0a2177d312e27e6e6-0.
INFO 03-02 00:24:42 [logger.py:42] Received request cmpl-b2445d963f9d4160a632012ad0c2f5d6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:24:42 [async_llm.py:261] Added request cmpl-b2445d963f9d4160a632012ad0c2f5d6-0.
INFO 03-02 00:24:43 [logger.py:42] Received request cmpl-6910e1088d394694bf52476f41df67d1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:24:43 [async_llm.py:261] Added request cmpl-6910e1088d394694bf52476f41df67d1-0.
INFO 03-02 00:24:44 [logger.py:42] Received request cmpl-0676799a2ccd434e932934ae350770ad-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:24:44 [async_llm.py:261] Added request cmpl-0676799a2ccd434e932934ae350770ad-0.
INFO 03-02 00:24:45 [logger.py:42] Received request cmpl-5940a96f4b0e405489ecfc130aca49e4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:24:45 [async_llm.py:261] Added request cmpl-5940a96f4b0e405489ecfc130aca49e4-0.
INFO 03-02 00:24:47 [logger.py:42] Received request cmpl-61074c4076ac4bc8aae27eac5ec9416a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:24:47 [async_llm.py:261] Added request cmpl-61074c4076ac4bc8aae27eac5ec9416a-0.
INFO 03-02 00:24:48 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:24:48 [logger.py:42] Received request cmpl-643f7346ce0c495985ec090a99a26e7a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:24:48 [async_llm.py:261] Added request cmpl-643f7346ce0c495985ec090a99a26e7a-0.
INFO 03-02 00:24:49 [logger.py:42] Received request cmpl-92f53cd8f9ba43f0aeb4f2554bfe9077-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:24:49 [async_llm.py:261] Added request cmpl-92f53cd8f9ba43f0aeb4f2554bfe9077-0.
INFO 03-02 00:24:50 [logger.py:42] Received request cmpl-ed28790721134bf68d26e401eaee6ca2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:24:50 [async_llm.py:261] Added request cmpl-ed28790721134bf68d26e401eaee6ca2-0.
INFO 03-02 00:24:51 [logger.py:42] Received request cmpl-cb42112faabd45b2a99d5670e38a4b84-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:24:51 [async_llm.py:261] Added request cmpl-cb42112faabd45b2a99d5670e38a4b84-0.
INFO 03-02 00:24:52 [logger.py:42] Received request cmpl-0e60b66fa838484f81539a365a361381-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:24:52 [async_llm.py:261] Added request cmpl-0e60b66fa838484f81539a365a361381-0.
INFO 03-02 00:24:54 [logger.py:42] Received request cmpl-d7bb08a143894037a38e64affa10fa50-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:24:54 [async_llm.py:261] Added request cmpl-d7bb08a143894037a38e64affa10fa50-0.
INFO 03-02 00:24:55 [logger.py:42] Received request cmpl-4d89d8a1f8f64d45ba1c3f3d209cb9a5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:24:55 [async_llm.py:261] Added request cmpl-4d89d8a1f8f64d45ba1c3f3d209cb9a5-0.
INFO 03-02 00:24:56 [logger.py:42] Received request cmpl-5d1b17dc7330484a944811374007acbd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:24:56 [async_llm.py:261] Added request cmpl-5d1b17dc7330484a944811374007acbd-0.
INFO 03-02 00:24:57 [logger.py:42] Received request cmpl-2b618e4489354befbda068686bf4bea0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:24:57 [async_llm.py:261] Added request cmpl-2b618e4489354befbda068686bf4bea0-0.
INFO 03-02 00:24:58 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:24:58 [logger.py:42] Received request cmpl-5b61ed66d9b34a278698ee63c1f79e38-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:24:58 [async_llm.py:261] Added request cmpl-5b61ed66d9b34a278698ee63c1f79e38-0.
INFO 03-02 00:24:59 [logger.py:42] Received request cmpl-b64431db4f9d42a18a876526606c6972-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:24:59 [async_llm.py:261] Added request cmpl-b64431db4f9d42a18a876526606c6972-0.
INFO 03-02 00:25:00 [logger.py:42] Received request cmpl-b211f70b9a174817ba1c05df096eed95-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:25:00 [async_llm.py:261] Added request cmpl-b211f70b9a174817ba1c05df096eed95-0.
INFO 03-02 00:25:02 [logger.py:42] Received request cmpl-7d048d76243a413c869c49d6ca5b4a5e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:25:02 [async_llm.py:261] Added request cmpl-7d048d76243a413c869c49d6ca5b4a5e-0.
INFO 03-02 00:25:03 [logger.py:42] Received request cmpl-415e7e5079f54a28898dbb9d60a8a72a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:25:03 [async_llm.py:261] Added request cmpl-415e7e5079f54a28898dbb9d60a8a72a-0.
INFO 03-02 00:25:04 [logger.py:42] Received request cmpl-7fd88b9801bd48a0b6255b3eecc267c6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:25:04 [async_llm.py:261] Added request cmpl-7fd88b9801bd48a0b6255b3eecc267c6-0.
INFO 03-02 00:25:05 [logger.py:42] Received request cmpl-d6804178cf3d44f7aa735a037ce371ae-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:25:05 [async_llm.py:261] Added request cmpl-d6804178cf3d44f7aa735a037ce371ae-0.
INFO 03-02 00:25:06 [logger.py:42] Received request cmpl-c0e7e226030e4d62a3f22056045f42c6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:25:06 [async_llm.py:261] Added request cmpl-c0e7e226030e4d62a3f22056045f42c6-0.
INFO 03-02 00:25:07 [logger.py:42] Received request cmpl-91bd46a4c9ec4b2e9766b4eb8250db6c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:25:07 [async_llm.py:261] Added request cmpl-91bd46a4c9ec4b2e9766b4eb8250db6c-0.
INFO 03-02 00:25:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:25:09 [logger.py:42] Received request cmpl-5a17d81ec5494d588e56b89242f3c97e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:25:09 [async_llm.py:261] Added request cmpl-5a17d81ec5494d588e56b89242f3c97e-0.
INFO 03-02 00:25:10 [logger.py:42] Received request cmpl-69e4487bc88c46798048a030b0a9d90c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:25:10 [async_llm.py:261] Added request cmpl-69e4487bc88c46798048a030b0a9d90c-0.
INFO 03-02 00:25:11 [logger.py:42] Received request cmpl-7ac50199ad184098a78c08e6f791e9b8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:25:11 [async_llm.py:261] Added request cmpl-7ac50199ad184098a78c08e6f791e9b8-0.
INFO 03-02 00:25:12 [logger.py:42] Received request cmpl-04213998e22c4c019919f0e2d4bbe1b8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:25:12 [async_llm.py:261] Added request cmpl-04213998e22c4c019919f0e2d4bbe1b8-0.
INFO 03-02 00:25:13 [logger.py:42] Received request cmpl-418fe7ed5cbe4b248c03a0f48d6bc8b9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:25:13 [async_llm.py:261] Added request cmpl-418fe7ed5cbe4b248c03a0f48d6bc8b9-0.
INFO 03-02 00:25:14 [logger.py:42] Received request cmpl-e688361d287640b58ecc18cf16d5a615-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:25:14 [async_llm.py:261] Added request cmpl-e688361d287640b58ecc18cf16d5a615-0.
INFO 03-02 00:25:15 [logger.py:42] Received request cmpl-75e01e7484ff4f378fc9c82d29650a7f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:25:15 [async_llm.py:261] Added request cmpl-75e01e7484ff4f378fc9c82d29650a7f-0.
INFO 03-02 00:25:17 [logger.py:42] Received request cmpl-c83ab6456a2c48daa3f91d58b814c00b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:25:17 [async_llm.py:261] Added request cmpl-c83ab6456a2c48daa3f91d58b814c00b-0.
INFO 03-02 00:25:18 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:25:18 [logger.py:42] Received request cmpl-ff560b89bf784b32acab5f03c0570d35-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:25:18 [async_llm.py:261] Added request cmpl-ff560b89bf784b32acab5f03c0570d35-0.
INFO 03-02 00:25:19 [logger.py:42] Received request cmpl-f3d7e0acd2b04e2a82cd216b1c453830-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:25:19 [async_llm.py:261] Added request cmpl-f3d7e0acd2b04e2a82cd216b1c453830-0.
INFO 03-02 00:25:20 [logger.py:42] Received request cmpl-4e8cdcbc440f4c33a80f40e7fbdd2617-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:25:20 [async_llm.py:261] Added request cmpl-4e8cdcbc440f4c33a80f40e7fbdd2617-0.
INFO 03-02 00:25:21 [logger.py:42] Received request cmpl-7e36c50e64b14211a5ecdd9f6f67dbf5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:25:21 [async_llm.py:261] Added request cmpl-7e36c50e64b14211a5ecdd9f6f67dbf5-0.
INFO 03-02 00:25:22 [logger.py:42] Received request cmpl-beb350cedf4d450bb3fc6e39ac379d32-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:25:22 [async_llm.py:261] Added request cmpl-beb350cedf4d450bb3fc6e39ac379d32-0.
INFO 03-02 00:25:24 [logger.py:42] Received request cmpl-e53b808dd98c40d0aa3be5d02ac86ca2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:25:24 [async_llm.py:261] Added request cmpl-e53b808dd98c40d0aa3be5d02ac86ca2-0.
INFO 03-02 00:25:25 [logger.py:42] Received request cmpl-e96ba50b432a49048dad98ba79b05663-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:25:25 [async_llm.py:261] Added request cmpl-e96ba50b432a49048dad98ba79b05663-0.
INFO 03-02 00:25:26 [logger.py:42] Received request cmpl-47c8efcccc844c6cb1fc4e7128a828e5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:25:26 [async_llm.py:261] Added request cmpl-47c8efcccc844c6cb1fc4e7128a828e5-0.
INFO 03-02 00:25:27 [logger.py:42] Received request cmpl-a69555aa268e437bb5d48feb094a900d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:25:27 [async_llm.py:261] Added request cmpl-a69555aa268e437bb5d48feb094a900d-0.
INFO 03-02 00:25:28 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:25:28 [logger.py:42] Received request cmpl-e86ff1360ed943518f9642722e0890d6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:25:28 [async_llm.py:261] Added request cmpl-e86ff1360ed943518f9642722e0890d6-0.
INFO 03-02 00:25:29 [logger.py:42] Received request cmpl-e3834428073e4fccade4fc4a9024a450-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:25:29 [async_llm.py:261] Added request cmpl-e3834428073e4fccade4fc4a9024a450-0.
INFO 03-02 00:25:31 [logger.py:42] Received request cmpl-245b3d9282f24ac3a9dba0aa9d48cb8e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:25:31 [async_llm.py:261] Added request cmpl-245b3d9282f24ac3a9dba0aa9d48cb8e-0.
INFO 03-02 00:25:32 [logger.py:42] Received request cmpl-d7007ce3c852444fad583485da5a3032-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:25:32 [async_llm.py:261] Added request cmpl-d7007ce3c852444fad583485da5a3032-0.
INFO 03-02 00:25:33 [logger.py:42] Received request cmpl-9da33f6a546b44f5a393b98dc88dbfd1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:25:33 [async_llm.py:261] Added request cmpl-9da33f6a546b44f5a393b98dc88dbfd1-0.
INFO 03-02 00:25:34 [logger.py:42] Received request cmpl-3bdbe3661a26432b80ce9beb0ab6d0c7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:25:34 [async_llm.py:261] Added request cmpl-3bdbe3661a26432b80ce9beb0ab6d0c7-0.
INFO 03-02 00:25:35 [logger.py:42] Received request cmpl-cd3940ddfb7f429cbc804ef1a1c5c6ba-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:25:35 [async_llm.py:261] Added request cmpl-cd3940ddfb7f429cbc804ef1a1c5c6ba-0.
INFO 03-02 00:25:36 [logger.py:42] Received request cmpl-2a91a85f7832467e9563c04272087755-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:25:36 [async_llm.py:261] Added request cmpl-2a91a85f7832467e9563c04272087755-0.
INFO 03-02 00:25:37 [logger.py:42] Received request cmpl-5a09e045c1ba4021b1071c2163dac8b1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:25:37 [async_llm.py:261] Added request cmpl-5a09e045c1ba4021b1071c2163dac8b1-0.
INFO 03-02 00:25:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:25:39 [logger.py:42] Received request cmpl-e3b910f42f68462da221de9a94249d0a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:25:39 [async_llm.py:261] Added request cmpl-e3b910f42f68462da221de9a94249d0a-0.
INFO 03-02 00:25:40 [logger.py:42] Received request cmpl-6cb6908cda74422394b4954f36bd7d51-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:25:40 [async_llm.py:261] Added request cmpl-6cb6908cda74422394b4954f36bd7d51-0.
INFO 03-02 00:25:41 [logger.py:42] Received request cmpl-dded3ac7e8d04509acb2a6fa71f33146-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:25:41 [async_llm.py:261] Added request cmpl-dded3ac7e8d04509acb2a6fa71f33146-0.
INFO 03-02 00:25:42 [logger.py:42] Received request cmpl-3079f84320cd459eb84cb5d17298397e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:25:42 [async_llm.py:261] Added request cmpl-3079f84320cd459eb84cb5d17298397e-0.
INFO 03-02 00:25:43 [logger.py:42] Received request cmpl-3dc88bd4543e43289fd836ad5856f470-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:25:43 [async_llm.py:261] Added request cmpl-3dc88bd4543e43289fd836ad5856f470-0.
INFO 03-02 00:25:44 [logger.py:42] Received request cmpl-1806c94312e74f7d853e1e5ee1805558-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:25:44 [async_llm.py:261] Added request cmpl-1806c94312e74f7d853e1e5ee1805558-0.
INFO 03-02 00:25:46 [logger.py:42] Received request cmpl-0b3eb3779e5b4eea8f1fc385179f7828-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:25:46 [async_llm.py:261] Added request cmpl-0b3eb3779e5b4eea8f1fc385179f7828-0.
INFO 03-02 00:25:47 [logger.py:42] Received request cmpl-4b2d8230cbdb4bb4af2bf543a3515f28-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:25:47 [async_llm.py:261] Added request cmpl-4b2d8230cbdb4bb4af2bf543a3515f28-0.
INFO 03-02 00:25:48 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:25:48 [logger.py:42] Received request cmpl-988417771c5b45baa8bc9a7368db663e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:25:48 [async_llm.py:261] Added request cmpl-988417771c5b45baa8bc9a7368db663e-0.
INFO 03-02 00:25:49 [logger.py:42] Received request cmpl-dd9dfdccd958422d87eddc4e58dc3690-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:25:49 [async_llm.py:261] Added request cmpl-dd9dfdccd958422d87eddc4e58dc3690-0.
INFO 03-02 00:25:50 [logger.py:42] Received request cmpl-33f2ff761e9d44839c2b86f12d16bba7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:25:50 [async_llm.py:261] Added request cmpl-33f2ff761e9d44839c2b86f12d16bba7-0.
INFO 03-02 00:25:51 [logger.py:42] Received request cmpl-6688f34138e34ff58f397878a22ff08c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:25:51 [async_llm.py:261] Added request cmpl-6688f34138e34ff58f397878a22ff08c-0.
INFO 03-02 00:25:52 [logger.py:42] Received request cmpl-9773d60348304a82b781d62825b2453d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:25:52 [async_llm.py:261] Added request cmpl-9773d60348304a82b781d62825b2453d-0.
INFO 03-02 00:25:54 [logger.py:42] Received request cmpl-8ebcf311649c4092a4b1a09d61b32623-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:25:54 [async_llm.py:261] Added request cmpl-8ebcf311649c4092a4b1a09d61b32623-0.
INFO 03-02 00:25:55 [logger.py:42] Received request cmpl-4a2964c6a56b4aedac7db96f16b08b3d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:25:55 [async_llm.py:261] Added request cmpl-4a2964c6a56b4aedac7db96f16b08b3d-0.
INFO 03-02 00:25:56 [logger.py:42] Received request cmpl-de06e874f8b44e20ac7834de93994022-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:25:56 [async_llm.py:261] Added request cmpl-de06e874f8b44e20ac7834de93994022-0.
INFO 03-02 00:25:57 [logger.py:42] Received request cmpl-234d51086a0f47ea8158779214526da6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:25:57 [async_llm.py:261] Added request cmpl-234d51086a0f47ea8158779214526da6-0.
INFO 03-02 00:25:58 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:25:58 [logger.py:42] Received request cmpl-e87a4d0c81fc4100a2acd86c551933ab-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:25:58 [async_llm.py:261] Added request cmpl-e87a4d0c81fc4100a2acd86c551933ab-0.
INFO 03-02 00:25:59 [logger.py:42] Received request cmpl-e58beda6a7094c87a5bac4b479f78fb7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:25:59 [async_llm.py:261] Added request cmpl-e58beda6a7094c87a5bac4b479f78fb7-0.
INFO 03-02 00:26:00 [logger.py:42] Received request cmpl-0d6c749193ce4f49ae580133a23e9846-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:26:00 [async_llm.py:261] Added request cmpl-0d6c749193ce4f49ae580133a23e9846-0.
INFO 03-02 00:26:02 [logger.py:42] Received request cmpl-18a021103b77442ebbba799455d1deb3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:26:02 [async_llm.py:261] Added request cmpl-18a021103b77442ebbba799455d1deb3-0.
INFO 03-02 00:26:03 [logger.py:42] Received request cmpl-5bf2a0c2233c42e78ed79bd5843be8f1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:26:03 [async_llm.py:261] Added request cmpl-5bf2a0c2233c42e78ed79bd5843be8f1-0.
INFO 03-02 00:26:04 [logger.py:42] Received request cmpl-470475af91754617bab1a3f9eac4fed4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:26:04 [async_llm.py:261] Added request cmpl-470475af91754617bab1a3f9eac4fed4-0.
INFO 03-02 00:26:05 [logger.py:42] Received request cmpl-60f743bc4d034dc3aba97681198df2c7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:26:05 [async_llm.py:261] Added request cmpl-60f743bc4d034dc3aba97681198df2c7-0.
INFO 03-02 00:26:06 [logger.py:42] Received request cmpl-036570e65cc046deb48b54d52e41eb20-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:26:06 [async_llm.py:261] Added request cmpl-036570e65cc046deb48b54d52e41eb20-0.
INFO 03-02 00:26:07 [logger.py:42] Received request cmpl-4f8bd143509a451d8d57f60d45dfa11b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:26:07 [async_llm.py:261] Added request cmpl-4f8bd143509a451d8d57f60d45dfa11b-0.
INFO 03-02 00:26:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:26:09 [logger.py:42] Received request cmpl-f31b24ee40634f8ba3451387adc137ee-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:26:09 [async_llm.py:261] Added request cmpl-f31b24ee40634f8ba3451387adc137ee-0.
INFO 03-02 00:26:10 [logger.py:42] Received request cmpl-ee1664e8dbc44fdd8a104298fc799fc0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:26:10 [async_llm.py:261] Added request cmpl-ee1664e8dbc44fdd8a104298fc799fc0-0.
INFO 03-02 00:26:11 [logger.py:42] Received request cmpl-087b4d391626410bba3afa76338c9ebb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:26:11 [async_llm.py:261] Added request cmpl-087b4d391626410bba3afa76338c9ebb-0.
INFO 03-02 00:26:12 [logger.py:42] Received request cmpl-c45e6368e87b40a8b13641896ea48715-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:26:12 [async_llm.py:261] Added request cmpl-c45e6368e87b40a8b13641896ea48715-0.
INFO 03-02 00:26:13 [logger.py:42] Received request cmpl-2a28b9446a3a43588d1b7eb1987b64e8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:26:13 [async_llm.py:261] Added request cmpl-2a28b9446a3a43588d1b7eb1987b64e8-0.
INFO 03-02 00:26:14 [logger.py:42] Received request cmpl-9160e546a3f540c9bc9408ba3746a5b1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:26:14 [async_llm.py:261] Added request cmpl-9160e546a3f540c9bc9408ba3746a5b1-0.
INFO 03-02 00:26:15 [logger.py:42] Received request cmpl-fe09d3a10e714c01a4f777c71a95e387-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:26:15 [async_llm.py:261] Added request cmpl-fe09d3a10e714c01a4f777c71a95e387-0.
INFO 03-02 00:26:17 [logger.py:42] Received request cmpl-80069c980935401ab4e5cad9bba206cb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:26:17 [async_llm.py:261] Added request cmpl-80069c980935401ab4e5cad9bba206cb-0.
INFO 03-02 00:26:18 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:26:18 [logger.py:42] Received request cmpl-26172bfb406a4c50aef5ded3d5ae1127-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:26:18 [async_llm.py:261] Added request cmpl-26172bfb406a4c50aef5ded3d5ae1127-0.
INFO 03-02 00:26:19 [logger.py:42] Received request cmpl-a163fc1799e0443e8d2617845ecf59da-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:26:19 [async_llm.py:261] Added request cmpl-a163fc1799e0443e8d2617845ecf59da-0.
INFO 03-02 00:26:20 [logger.py:42] Received request cmpl-78a8b13ea9a442eab30f44b76e33e53c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:26:20 [async_llm.py:261] Added request cmpl-78a8b13ea9a442eab30f44b76e33e53c-0.
INFO 03-02 00:26:21 [logger.py:42] Received request cmpl-3249899b4c364976bd69be56e28d4399-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:26:21 [async_llm.py:261] Added request cmpl-3249899b4c364976bd69be56e28d4399-0.
INFO 03-02 00:26:22 [logger.py:42] Received request cmpl-2cc68b310a87448da0f1953cbbf6f8fb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:26:22 [async_llm.py:261] Added request cmpl-2cc68b310a87448da0f1953cbbf6f8fb-0.
INFO 03-02 00:26:24 [logger.py:42] Received request cmpl-56fd7f7784e747e89cdd84202517a825-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:26:24 [async_llm.py:261] Added request cmpl-56fd7f7784e747e89cdd84202517a825-0.
INFO 03-02 00:26:25 [logger.py:42] Received request cmpl-4d1a6d0f6e9c4c4498ada8c6a67c29f5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:26:25 [async_llm.py:261] Added request cmpl-4d1a6d0f6e9c4c4498ada8c6a67c29f5-0.
INFO 03-02 00:26:26 [logger.py:42] Received request cmpl-6aa84c949d714c01aeb24713edade42c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:26:26 [async_llm.py:261] Added request cmpl-6aa84c949d714c01aeb24713edade42c-0.
INFO 03-02 00:26:27 [logger.py:42] Received request cmpl-1d1d582fc3784926b424f659b763bd2b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:26:27 [async_llm.py:261] Added request cmpl-1d1d582fc3784926b424f659b763bd2b-0.
INFO 03-02 00:26:28 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:26:28 [logger.py:42] Received request cmpl-85fc4b79cabe47278a14297378704f18-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:26:28 [async_llm.py:261] Added request cmpl-85fc4b79cabe47278a14297378704f18-0.
INFO 03-02 00:26:29 [logger.py:42] Received request cmpl-bfa0945c7f7e4f149c4ce262d30948d0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:26:29 [async_llm.py:261] Added request cmpl-bfa0945c7f7e4f149c4ce262d30948d0-0.
INFO 03-02 00:26:31 [logger.py:42] Received request cmpl-b7e4fc19dbd146b78cd42df039f64ba5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:26:31 [async_llm.py:261] Added request cmpl-b7e4fc19dbd146b78cd42df039f64ba5-0.
INFO 03-02 00:26:32 [logger.py:42] Received request cmpl-4e127577159e48598916a3863e94c9a4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:26:32 [async_llm.py:261] Added request cmpl-4e127577159e48598916a3863e94c9a4-0.
INFO 03-02 00:26:33 [logger.py:42] Received request cmpl-09e2cfea8d5e408996ea94576711c5b0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:26:33 [async_llm.py:261] Added request cmpl-09e2cfea8d5e408996ea94576711c5b0-0.
INFO 03-02 00:26:34 [logger.py:42] Received request cmpl-0b57f3f512ce40f8a6e92f6e24c25cc5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:26:34 [async_llm.py:261] Added request cmpl-0b57f3f512ce40f8a6e92f6e24c25cc5-0.
INFO 03-02 00:26:35 [logger.py:42] Received request cmpl-3904983444ff4a91a0fc44e4fd701d96-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:26:35 [async_llm.py:261] Added request cmpl-3904983444ff4a91a0fc44e4fd701d96-0.
INFO 03-02 00:26:36 [logger.py:42] Received request cmpl-04f7c7a9904c4baebe08027b23ed420e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:26:36 [async_llm.py:261] Added request cmpl-04f7c7a9904c4baebe08027b23ed420e-0.
INFO 03-02 00:26:37 [logger.py:42] Received request cmpl-25809af75804485aae7aa5c6b13537be-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:26:37 [async_llm.py:261] Added request cmpl-25809af75804485aae7aa5c6b13537be-0.
INFO 03-02 00:26:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:26:39 [logger.py:42] Received request cmpl-c5b1895417b24e8e9899c8db3ee0b837-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:26:39 [async_llm.py:261] Added request cmpl-c5b1895417b24e8e9899c8db3ee0b837-0.
INFO 03-02 00:26:40 [logger.py:42] Received request cmpl-6d32b863e37f4c9baa6b1bb1488203a9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:26:40 [async_llm.py:261] Added request cmpl-6d32b863e37f4c9baa6b1bb1488203a9-0.
INFO 03-02 00:26:41 [logger.py:42] Received request cmpl-aedab525fc20437bb649c9c1bbbffd0a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:26:41 [async_llm.py:261] Added request cmpl-aedab525fc20437bb649c9c1bbbffd0a-0.
INFO 03-02 00:26:42 [logger.py:42] Received request cmpl-5e0abb150a774327b1982c44e5ade9f2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:26:42 [async_llm.py:261] Added request cmpl-5e0abb150a774327b1982c44e5ade9f2-0.
INFO 03-02 00:26:43 [logger.py:42] Received request cmpl-e8a0112d537940d68041ba940d8e54f3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:26:43 [async_llm.py:261] Added request cmpl-e8a0112d537940d68041ba940d8e54f3-0.
INFO 03-02 00:26:44 [logger.py:42] Received request cmpl-d07d922d2d9449d294f0cb723f36f1bd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:26:44 [async_llm.py:261] Added request cmpl-d07d922d2d9449d294f0cb723f36f1bd-0.
INFO 03-02 00:26:46 [logger.py:42] Received request cmpl-9bd2b39b49c7466c97127d8560197dc7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:26:46 [async_llm.py:261] Added request cmpl-9bd2b39b49c7466c97127d8560197dc7-0.
INFO 03-02 00:26:47 [logger.py:42] Received request cmpl-6d93687d50da47cc98f88340d5adbd56-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:26:47 [async_llm.py:261] Added request cmpl-6d93687d50da47cc98f88340d5adbd56-0.
INFO 03-02 00:26:48 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:26:48 [logger.py:42] Received request cmpl-b9e23b23252f4ec09d25ea7da4c1f172-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:26:48 [async_llm.py:261] Added request cmpl-b9e23b23252f4ec09d25ea7da4c1f172-0.
INFO 03-02 00:26:49 [logger.py:42] Received request cmpl-3278f30f3db44199952284444cf754e0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:26:49 [async_llm.py:261] Added request cmpl-3278f30f3db44199952284444cf754e0-0.
INFO 03-02 00:26:50 [logger.py:42] Received request cmpl-07f8a767ff6047378afeee6f169cff69-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:26:50 [async_llm.py:261] Added request cmpl-07f8a767ff6047378afeee6f169cff69-0.
INFO 03-02 00:26:51 [logger.py:42] Received request cmpl-54f8820552644ad59e007039c3443cf6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:26:51 [async_llm.py:261] Added request cmpl-54f8820552644ad59e007039c3443cf6-0.
INFO 03-02 00:26:52 [logger.py:42] Received request cmpl-da51c849078445a1986c938aaaccbc19-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:26:52 [async_llm.py:261] Added request cmpl-da51c849078445a1986c938aaaccbc19-0.
INFO 03-02 00:26:54 [logger.py:42] Received request cmpl-52c743fc0036487298daf2dfff218a8d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:26:54 [async_llm.py:261] Added request cmpl-52c743fc0036487298daf2dfff218a8d-0.
INFO 03-02 00:26:55 [logger.py:42] Received request cmpl-97b4d96f29364d909ca010274326796f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:26:55 [async_llm.py:261] Added request cmpl-97b4d96f29364d909ca010274326796f-0.
INFO 03-02 00:26:56 [logger.py:42] Received request cmpl-ebb768f3fa8b4908839fa788bbb6934e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:26:56 [async_llm.py:261] Added request cmpl-ebb768f3fa8b4908839fa788bbb6934e-0.
INFO 03-02 00:26:57 [logger.py:42] Received request cmpl-9519bd4d993f4efc9df9bdb536bfe91d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:26:57 [async_llm.py:261] Added request cmpl-9519bd4d993f4efc9df9bdb536bfe91d-0.
INFO 03-02 00:26:58 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:26:58 [logger.py:42] Received request cmpl-e4bb055420a24f41a6b124868962b77f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:26:58 [async_llm.py:261] Added request cmpl-e4bb055420a24f41a6b124868962b77f-0.
INFO 03-02 00:26:59 [logger.py:42] Received request cmpl-d107563e9e084a40858b80273822ef5a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:26:59 [async_llm.py:261] Added request cmpl-d107563e9e084a40858b80273822ef5a-0.
INFO 03-02 00:27:00 [logger.py:42] Received request cmpl-b10cc516467b458581b0bee29936d429-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:27:00 [async_llm.py:261] Added request cmpl-b10cc516467b458581b0bee29936d429-0.
INFO 03-02 00:27:02 [logger.py:42] Received request cmpl-5fb278dadb8d4ceba19a33ba5f74da54-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:27:02 [async_llm.py:261] Added request cmpl-5fb278dadb8d4ceba19a33ba5f74da54-0.
INFO 03-02 00:27:03 [logger.py:42] Received request cmpl-7bf736adbfa04a5b8a95cee01425d3d6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:27:03 [async_llm.py:261] Added request cmpl-7bf736adbfa04a5b8a95cee01425d3d6-0.
INFO 03-02 00:27:04 [logger.py:42] Received request cmpl-2f41c6b09f594839b8d710fa45c2a078-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:27:04 [async_llm.py:261] Added request cmpl-2f41c6b09f594839b8d710fa45c2a078-0.
INFO 03-02 00:27:05 [logger.py:42] Received request cmpl-dd118225d38b43fba606fbc690998805-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:27:05 [async_llm.py:261] Added request cmpl-dd118225d38b43fba606fbc690998805-0.
INFO 03-02 00:27:06 [logger.py:42] Received request cmpl-e484330b1f674671bdfe59314ccc1d99-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:27:06 [async_llm.py:261] Added request cmpl-e484330b1f674671bdfe59314ccc1d99-0.
INFO 03-02 00:27:07 [logger.py:42] Received request cmpl-46c2a84a529b4ed096c4060d6f4a1113-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:27:07 [async_llm.py:261] Added request cmpl-46c2a84a529b4ed096c4060d6f4a1113-0.
INFO 03-02 00:27:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:27:09 [logger.py:42] Received request cmpl-8c2b259cbd914205a6364113263c73b7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:27:09 [async_llm.py:261] Added request cmpl-8c2b259cbd914205a6364113263c73b7-0.
INFO 03-02 00:27:10 [logger.py:42] Received request cmpl-dbb485cc5d3a4828a356e98d20a453d2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:27:10 [async_llm.py:261] Added request cmpl-dbb485cc5d3a4828a356e98d20a453d2-0.
INFO 03-02 00:27:11 [logger.py:42] Received request cmpl-3113ee55e0af44af8ab2a42bc907e888-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:27:11 [async_llm.py:261] Added request cmpl-3113ee55e0af44af8ab2a42bc907e888-0.
INFO 03-02 00:27:12 [logger.py:42] Received request cmpl-3890de16ab9144fe897cc325435280c1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:27:12 [async_llm.py:261] Added request cmpl-3890de16ab9144fe897cc325435280c1-0.
INFO 03-02 00:27:13 [logger.py:42] Received request cmpl-38e819c9b98f4c129d80780c551e37ca-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:27:13 [async_llm.py:261] Added request cmpl-38e819c9b98f4c129d80780c551e37ca-0.
INFO 03-02 00:27:14 [logger.py:42] Received request cmpl-5ad96d9af39b4a0996c7cc1db2d6401e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:27:14 [async_llm.py:261] Added request cmpl-5ad96d9af39b4a0996c7cc1db2d6401e-0.
INFO 03-02 00:27:15 [logger.py:42] Received request cmpl-2ad2dff8911c43e1966236f84f6fbc1c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:27:15 [async_llm.py:261] Added request cmpl-2ad2dff8911c43e1966236f84f6fbc1c-0.
INFO 03-02 00:27:17 [logger.py:42] Received request cmpl-2d7ef41df41a4fd08d963a6a841f17e4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:27:17 [async_llm.py:261] Added request cmpl-2d7ef41df41a4fd08d963a6a841f17e4-0.
INFO 03-02 00:27:18 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:27:18 [logger.py:42] Received request cmpl-e264fad4718a40cd8d2274fb087bb264-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:27:18 [async_llm.py:261] Added request cmpl-e264fad4718a40cd8d2274fb087bb264-0.
INFO 03-02 00:27:19 [logger.py:42] Received request cmpl-991c4781528e4436aca59008b33052fa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:27:19 [async_llm.py:261] Added request cmpl-991c4781528e4436aca59008b33052fa-0.
INFO 03-02 00:27:20 [logger.py:42] Received request cmpl-1bea0f6a374e4a689e49caecec172064-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:27:20 [async_llm.py:261] Added request cmpl-1bea0f6a374e4a689e49caecec172064-0.
INFO 03-02 00:27:21 [logger.py:42] Received request cmpl-6d3b8868af7d480b92c3122c1ae3609c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:27:21 [async_llm.py:261] Added request cmpl-6d3b8868af7d480b92c3122c1ae3609c-0.
INFO 03-02 00:27:22 [logger.py:42] Received request cmpl-1cc9841aebda4cd4afe6292b56740b5d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:27:22 [async_llm.py:261] Added request cmpl-1cc9841aebda4cd4afe6292b56740b5d-0.
INFO 03-02 00:27:24 [logger.py:42] Received request cmpl-73e20bf5a1b34d6b9e92cf4fb12e9a03-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:27:24 [async_llm.py:261] Added request cmpl-73e20bf5a1b34d6b9e92cf4fb12e9a03-0.
INFO 03-02 00:27:25 [logger.py:42] Received request cmpl-db809c8a90944f7c96d2a7e55b674e39-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:27:25 [async_llm.py:261] Added request cmpl-db809c8a90944f7c96d2a7e55b674e39-0.
INFO 03-02 00:27:26 [logger.py:42] Received request cmpl-30ff9f1e276245a1b2ac42c5c9b98861-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:27:26 [async_llm.py:261] Added request cmpl-30ff9f1e276245a1b2ac42c5c9b98861-0.
INFO 03-02 00:27:27 [logger.py:42] Received request cmpl-fbd879ad964540eaa3ddef63ca910c39-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:27:27 [async_llm.py:261] Added request cmpl-fbd879ad964540eaa3ddef63ca910c39-0.
INFO 03-02 00:27:28 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:27:28 [logger.py:42] Received request cmpl-16d7d2aa7c1d411f8418a5d1e5518980-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:27:28 [async_llm.py:261] Added request cmpl-16d7d2aa7c1d411f8418a5d1e5518980-0.
INFO 03-02 00:27:29 [logger.py:42] Received request cmpl-af2fa0f71c80436b955293ad51a8c965-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:27:29 [async_llm.py:261] Added request cmpl-af2fa0f71c80436b955293ad51a8c965-0.
INFO 03-02 00:27:30 [logger.py:42] Received request cmpl-2365479d9a344c27b380e54a5f967380-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:27:30 [async_llm.py:261] Added request cmpl-2365479d9a344c27b380e54a5f967380-0.
INFO 03-02 00:27:32 [logger.py:42] Received request cmpl-3827f34c446d4fad8ef212208c303286-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:27:32 [async_llm.py:261] Added request cmpl-3827f34c446d4fad8ef212208c303286-0.
INFO 03-02 00:27:33 [logger.py:42] Received request cmpl-435f5060f19c488dbc5cd936536446a9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:27:33 [async_llm.py:261] Added request cmpl-435f5060f19c488dbc5cd936536446a9-0.
INFO 03-02 00:27:34 [logger.py:42] Received request cmpl-0862bc18617e49cf98bf73308882f2b4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:27:34 [async_llm.py:261] Added request cmpl-0862bc18617e49cf98bf73308882f2b4-0.
INFO 03-02 00:27:35 [logger.py:42] Received request cmpl-7bba0cb26c3746628e101dcdbc7588cc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:27:35 [async_llm.py:261] Added request cmpl-7bba0cb26c3746628e101dcdbc7588cc-0.
INFO 03-02 00:27:36 [logger.py:42] Received request cmpl-ea2a7d8eb53d4b2abbeef0e4162625ed-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:27:36 [async_llm.py:261] Added request cmpl-ea2a7d8eb53d4b2abbeef0e4162625ed-0.
INFO 03-02 00:27:37 [logger.py:42] Received request cmpl-393b9febc522418a9c38ab44021cc0bd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:27:37 [async_llm.py:261] Added request cmpl-393b9febc522418a9c38ab44021cc0bd-0.
INFO 03-02 00:27:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:27:39 [logger.py:42] Received request cmpl-35f047ab1fce4bc59585016ebcee080d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:27:39 [async_llm.py:261] Added request cmpl-35f047ab1fce4bc59585016ebcee080d-0.
INFO 03-02 00:27:40 [logger.py:42] Received request cmpl-9645c4020f4e4225bd9d7eaf3005420b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:27:40 [async_llm.py:261] Added request cmpl-9645c4020f4e4225bd9d7eaf3005420b-0.
INFO 03-02 00:27:41 [logger.py:42] Received request cmpl-488574a1b52e4e97a881884227c51db6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:27:41 [async_llm.py:261] Added request cmpl-488574a1b52e4e97a881884227c51db6-0.
INFO 03-02 00:27:42 [logger.py:42] Received request cmpl-afe0790020c44de59d9b1f5eb31a2435-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:27:42 [async_llm.py:261] Added request cmpl-afe0790020c44de59d9b1f5eb31a2435-0.
INFO 03-02 00:27:43 [logger.py:42] Received request cmpl-068aeafd59464e52a2c63fc0180fbdc0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:27:43 [async_llm.py:261] Added request cmpl-068aeafd59464e52a2c63fc0180fbdc0-0.
INFO 03-02 00:27:44 [logger.py:42] Received request cmpl-444a27b3cac94570aa803307d4c41738-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:27:44 [async_llm.py:261] Added request cmpl-444a27b3cac94570aa803307d4c41738-0.
INFO 03-02 00:27:45 [logger.py:42] Received request cmpl-80a434c4788440f39fed83616b19c980-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:27:45 [async_llm.py:261] Added request cmpl-80a434c4788440f39fed83616b19c980-0.
INFO 03-02 00:27:47 [logger.py:42] Received request cmpl-999cef7965a34a14ae72f9b8db118788-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:27:47 [async_llm.py:261] Added request cmpl-999cef7965a34a14ae72f9b8db118788-0.
INFO 03-02 00:27:48 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:27:48 [logger.py:42] Received request cmpl-59f796c66d7c489abfa4620972a4926d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:27:48 [async_llm.py:261] Added request cmpl-59f796c66d7c489abfa4620972a4926d-0.
INFO 03-02 00:27:49 [logger.py:42] Received request cmpl-50b62a5cc9ee4291aa39960a8cb79a15-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:27:49 [async_llm.py:261] Added request cmpl-50b62a5cc9ee4291aa39960a8cb79a15-0.
INFO 03-02 00:27:50 [logger.py:42] Received request cmpl-583abf27a94147ae827111a16899fbb0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:27:50 [async_llm.py:261] Added request cmpl-583abf27a94147ae827111a16899fbb0-0.
INFO 03-02 00:27:51 [logger.py:42] Received request cmpl-c8b87f0335aa4d9dbe702dd7bd5243f4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:27:51 [async_llm.py:261] Added request cmpl-c8b87f0335aa4d9dbe702dd7bd5243f4-0.
INFO 03-02 00:27:52 [logger.py:42] Received request cmpl-43500cf5cbca49eca653327e9c332dc0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:27:52 [async_llm.py:261] Added request cmpl-43500cf5cbca49eca653327e9c332dc0-0.
INFO 03-02 00:27:54 [logger.py:42] Received request cmpl-76ddff7c4c2742e7b8a6f32d328f6650-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:27:54 [async_llm.py:261] Added request cmpl-76ddff7c4c2742e7b8a6f32d328f6650-0.
INFO 03-02 00:27:55 [logger.py:42] Received request cmpl-154c9409c8ea4266b271192f641e3df7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:27:55 [async_llm.py:261] Added request cmpl-154c9409c8ea4266b271192f641e3df7-0.
INFO 03-02 00:27:56 [logger.py:42] Received request cmpl-82bb562817cf41c99a442a640756f5cc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:27:56 [async_llm.py:261] Added request cmpl-82bb562817cf41c99a442a640756f5cc-0.
INFO 03-02 00:27:57 [logger.py:42] Received request cmpl-a12625e22e034a92a0c92fafcfc8729b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:27:57 [async_llm.py:261] Added request cmpl-a12625e22e034a92a0c92fafcfc8729b-0.
INFO 03-02 00:27:58 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:27:58 [logger.py:42] Received request cmpl-7a94d8d8bb9540f3bc30125ae2553864-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:27:58 [async_llm.py:261] Added request cmpl-7a94d8d8bb9540f3bc30125ae2553864-0.
INFO 03-02 00:27:59 [logger.py:42] Received request cmpl-fbe679ebc57b42e1bff3e00d28bc1486-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:27:59 [async_llm.py:261] Added request cmpl-fbe679ebc57b42e1bff3e00d28bc1486-0.
INFO 03-02 00:28:00 [logger.py:42] Received request cmpl-feeb0cda774d4db0b18a366b5a1d6f84-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:28:00 [async_llm.py:261] Added request cmpl-feeb0cda774d4db0b18a366b5a1d6f84-0.
INFO 03-02 00:28:02 [logger.py:42] Received request cmpl-83a8d94d7f6746ffa44eee1aa8f17204-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:28:02 [async_llm.py:261] Added request cmpl-83a8d94d7f6746ffa44eee1aa8f17204-0.
INFO 03-02 00:28:03 [logger.py:42] Received request cmpl-00f44849ae204d9081f7081cdb6caea9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:28:03 [async_llm.py:261] Added request cmpl-00f44849ae204d9081f7081cdb6caea9-0.
INFO 03-02 00:28:04 [logger.py:42] Received request cmpl-643dded4215046a2824d6f62df0d7be1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:28:04 [async_llm.py:261] Added request cmpl-643dded4215046a2824d6f62df0d7be1-0.
INFO 03-02 00:28:05 [logger.py:42] Received request cmpl-635a108b9df84ef187219ee6cd4a345f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:28:05 [async_llm.py:261] Added request cmpl-635a108b9df84ef187219ee6cd4a345f-0.
INFO 03-02 00:28:06 [logger.py:42] Received request cmpl-137634d7deb34d2ea702ec9f7b2c4a8a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:28:06 [async_llm.py:261] Added request cmpl-137634d7deb34d2ea702ec9f7b2c4a8a-0.
INFO 03-02 00:28:07 [logger.py:42] Received request cmpl-38ad73d006414f3789f0adeb92a39529-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:28:07 [async_llm.py:261] Added request cmpl-38ad73d006414f3789f0adeb92a39529-0.
INFO 03-02 00:28:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:28:09 [logger.py:42] Received request cmpl-8d8138fd9357407da928ef52aaffb99e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:28:09 [async_llm.py:261] Added request cmpl-8d8138fd9357407da928ef52aaffb99e-0.
INFO 03-02 00:28:10 [logger.py:42] Received request cmpl-061b36c279464f4f86187e7a44a90e15-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:28:10 [async_llm.py:261] Added request cmpl-061b36c279464f4f86187e7a44a90e15-0.
INFO 03-02 00:28:11 [logger.py:42] Received request cmpl-eee5a64675734ddc896126c5a92ced2c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:28:11 [async_llm.py:261] Added request cmpl-eee5a64675734ddc896126c5a92ced2c-0.
INFO 03-02 00:28:12 [logger.py:42] Received request cmpl-fef7c1a6de0f43e49ed04554239bb663-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:28:12 [async_llm.py:261] Added request cmpl-fef7c1a6de0f43e49ed04554239bb663-0.
INFO 03-02 00:28:13 [logger.py:42] Received request cmpl-878ffb724bd34f67838237c143b1b4e9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:28:13 [async_llm.py:261] Added request cmpl-878ffb724bd34f67838237c143b1b4e9-0.
INFO 03-02 00:28:14 [logger.py:42] Received request cmpl-7484da2c0bba4d01827855bfd91a9272-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:28:14 [async_llm.py:261] Added request cmpl-7484da2c0bba4d01827855bfd91a9272-0.
INFO 03-02 00:28:15 [logger.py:42] Received request cmpl-a83deab54400450195fdba620ded561b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:28:15 [async_llm.py:261] Added request cmpl-a83deab54400450195fdba620ded561b-0.
INFO 03-02 00:28:17 [logger.py:42] Received request cmpl-02978b9037624b07a0b643e7859fac7b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:28:17 [async_llm.py:261] Added request cmpl-02978b9037624b07a0b643e7859fac7b-0.
INFO 03-02 00:28:18 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:28:18 [logger.py:42] Received request cmpl-bd5c10f3c22040e7ace74fa6a7f5fae8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:28:18 [async_llm.py:261] Added request cmpl-bd5c10f3c22040e7ace74fa6a7f5fae8-0.
INFO 03-02 00:28:19 [logger.py:42] Received request cmpl-f1380dccad734040a02f6f78748abe89-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:28:19 [async_llm.py:261] Added request cmpl-f1380dccad734040a02f6f78748abe89-0.
INFO 03-02 00:28:20 [logger.py:42] Received request cmpl-4120c187bbdf47f8aa9d1be4ad5948ec-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:28:20 [async_llm.py:261] Added request cmpl-4120c187bbdf47f8aa9d1be4ad5948ec-0.
INFO 03-02 00:28:21 [logger.py:42] Received request cmpl-9b6ac236460247488fd16faaa0ec3748-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:28:21 [async_llm.py:261] Added request cmpl-9b6ac236460247488fd16faaa0ec3748-0.
INFO 03-02 00:28:22 [logger.py:42] Received request cmpl-1d365fa4b4964254a4bfccc956156def-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:28:22 [async_llm.py:261] Added request cmpl-1d365fa4b4964254a4bfccc956156def-0.
INFO 03-02 00:28:23 [logger.py:42] Received request cmpl-f0d2aafe765e47ef82024792568e2a51-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:28:23 [async_llm.py:261] Added request cmpl-f0d2aafe765e47ef82024792568e2a51-0.
INFO 03-02 00:28:25 [logger.py:42] Received request cmpl-0fbb48b74acd4690bebcc141aed5874c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:28:25 [async_llm.py:261] Added request cmpl-0fbb48b74acd4690bebcc141aed5874c-0.
INFO 03-02 00:28:26 [logger.py:42] Received request cmpl-77173ad2658346bfab6889cdfd7f691b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:28:26 [async_llm.py:261] Added request cmpl-77173ad2658346bfab6889cdfd7f691b-0.
INFO 03-02 00:28:27 [logger.py:42] Received request cmpl-f82f67c05048444dbaaaefd00a6ba21a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:28:27 [async_llm.py:261] Added request cmpl-f82f67c05048444dbaaaefd00a6ba21a-0.
INFO 03-02 00:28:28 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:28:28 [logger.py:42] Received request cmpl-b498dca9bf554eae9d9217a68c7ef1af-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:28:28 [async_llm.py:261] Added request cmpl-b498dca9bf554eae9d9217a68c7ef1af-0.
INFO 03-02 00:28:29 [logger.py:42] Received request cmpl-7e174513aae045829cc3ca30f2561df2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:28:29 [async_llm.py:261] Added request cmpl-7e174513aae045829cc3ca30f2561df2-0.
INFO 03-02 00:28:30 [logger.py:42] Received request cmpl-0077c406073a4dc8bc811c67b79faf51-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:28:30 [async_llm.py:261] Added request cmpl-0077c406073a4dc8bc811c67b79faf51-0.
INFO 03-02 00:28:32 [logger.py:42] Received request cmpl-750f201b1e6a473fa9a946029643fda2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:28:32 [async_llm.py:261] Added request cmpl-750f201b1e6a473fa9a946029643fda2-0.
INFO 03-02 00:28:33 [logger.py:42] Received request cmpl-014d2b869b3245e6a43928aeb547e35c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:28:33 [async_llm.py:261] Added request cmpl-014d2b869b3245e6a43928aeb547e35c-0.
INFO 03-02 00:28:34 [logger.py:42] Received request cmpl-0ea15e33f82e4ceb8f9c7fcd4a9c8ab9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:28:34 [async_llm.py:261] Added request cmpl-0ea15e33f82e4ceb8f9c7fcd4a9c8ab9-0.
INFO 03-02 00:28:35 [logger.py:42] Received request cmpl-63b6d1cd0fe2432c90fc2d27676c9884-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:28:35 [async_llm.py:261] Added request cmpl-63b6d1cd0fe2432c90fc2d27676c9884-0.
INFO 03-02 00:28:36 [logger.py:42] Received request cmpl-89986d49019e40f09ad8525f300be18f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:28:36 [async_llm.py:261] Added request cmpl-89986d49019e40f09ad8525f300be18f-0.
INFO 03-02 00:28:37 [logger.py:42] Received request cmpl-5288c902593c4fc898e6e94fa9683ff0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:28:37 [async_llm.py:261] Added request cmpl-5288c902593c4fc898e6e94fa9683ff0-0.
INFO 03-02 00:28:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:28:38 [logger.py:42] Received request cmpl-3c7613107d084503be232a2edbc35f79-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:28:38 [async_llm.py:261] Added request cmpl-3c7613107d084503be232a2edbc35f79-0.
INFO 03-02 00:28:40 [logger.py:42] Received request cmpl-02a9696b0c544e7db234241492cfb8d9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:28:40 [async_llm.py:261] Added request cmpl-02a9696b0c544e7db234241492cfb8d9-0.
INFO 03-02 00:28:41 [logger.py:42] Received request cmpl-bbe477d8491144c79631de7d79d537ae-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:28:41 [async_llm.py:261] Added request cmpl-bbe477d8491144c79631de7d79d537ae-0.
INFO 03-02 00:28:42 [logger.py:42] Received request cmpl-c4c9123c82b24cae92ac2af16af00ed8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:28:42 [async_llm.py:261] Added request cmpl-c4c9123c82b24cae92ac2af16af00ed8-0.
INFO 03-02 00:28:43 [logger.py:42] Received request cmpl-c62e5d68e90a4838bed57869556242ea-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:28:43 [async_llm.py:261] Added request cmpl-c62e5d68e90a4838bed57869556242ea-0.
INFO 03-02 00:28:44 [logger.py:42] Received request cmpl-8c6db81eb41940709d6a6accaced7c9e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:28:44 [async_llm.py:261] Added request cmpl-8c6db81eb41940709d6a6accaced7c9e-0.
INFO 03-02 00:28:45 [logger.py:42] Received request cmpl-012f62e9a1f24fd0ac7379f654fda340-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:28:45 [async_llm.py:261] Added request cmpl-012f62e9a1f24fd0ac7379f654fda340-0.
INFO 03-02 00:28:47 [logger.py:42] Received request cmpl-4fbfe701344b4700aeb33a3f0a892f7f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:28:47 [async_llm.py:261] Added request cmpl-4fbfe701344b4700aeb33a3f0a892f7f-0.
INFO 03-02 00:28:48 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:28:48 [logger.py:42] Received request cmpl-100c45c02ecb4f358858e9ad60c2cf3d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:28:48 [async_llm.py:261] Added request cmpl-100c45c02ecb4f358858e9ad60c2cf3d-0.
INFO 03-02 00:28:49 [logger.py:42] Received request cmpl-718104fafcd94bcfb2d351b4b172dda4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:28:49 [async_llm.py:261] Added request cmpl-718104fafcd94bcfb2d351b4b172dda4-0.
INFO 03-02 00:28:50 [logger.py:42] Received request cmpl-05d6a2bcacc442f384f8c9e029a02fc3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:28:50 [async_llm.py:261] Added request cmpl-05d6a2bcacc442f384f8c9e029a02fc3-0.
INFO 03-02 00:28:51 [logger.py:42] Received request cmpl-18a0f63abfa1469f88e9373a2fb3d7b5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:28:51 [async_llm.py:261] Added request cmpl-18a0f63abfa1469f88e9373a2fb3d7b5-0.
INFO 03-02 00:28:52 [logger.py:42] Received request cmpl-7fca168d98fa47658626452d7526f9d1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:28:52 [async_llm.py:261] Added request cmpl-7fca168d98fa47658626452d7526f9d1-0.
INFO 03-02 00:28:53 [logger.py:42] Received request cmpl-e182bd97883146debdbbf9b72dc14923-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:28:53 [async_llm.py:261] Added request cmpl-e182bd97883146debdbbf9b72dc14923-0.
INFO 03-02 00:28:55 [logger.py:42] Received request cmpl-0ec3dc78b7cf4e2f9b5f72bf5c23c99f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:28:55 [async_llm.py:261] Added request cmpl-0ec3dc78b7cf4e2f9b5f72bf5c23c99f-0.
INFO 03-02 00:28:56 [logger.py:42] Received request cmpl-7fd25eaa1146481bbbbd76b201882be1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:28:56 [async_llm.py:261] Added request cmpl-7fd25eaa1146481bbbbd76b201882be1-0.
INFO 03-02 00:28:57 [logger.py:42] Received request cmpl-86e6d89b72944d0cbfbd4cf95fb06e32-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:28:57 [async_llm.py:261] Added request cmpl-86e6d89b72944d0cbfbd4cf95fb06e32-0.
INFO 03-02 00:28:58 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:28:58 [logger.py:42] Received request cmpl-3f1c0f94960749b29acab49b98aa41de-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:28:58 [async_llm.py:261] Added request cmpl-3f1c0f94960749b29acab49b98aa41de-0.
INFO 03-02 00:28:59 [logger.py:42] Received request cmpl-5f2dc74ee3804b63824da506f05ea897-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:28:59 [async_llm.py:261] Added request cmpl-5f2dc74ee3804b63824da506f05ea897-0.
INFO 03-02 00:29:00 [logger.py:42] Received request cmpl-a5b3919af6ba486fb46386aaed18253d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:29:00 [async_llm.py:261] Added request cmpl-a5b3919af6ba486fb46386aaed18253d-0.
INFO 03-02 00:29:02 [logger.py:42] Received request cmpl-2892f0681c504ff18fa37a45c4fa4001-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:29:02 [async_llm.py:261] Added request cmpl-2892f0681c504ff18fa37a45c4fa4001-0.
INFO 03-02 00:29:03 [logger.py:42] Received request cmpl-5abb90160ed34fd6bc32662694083c3e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:29:03 [async_llm.py:261] Added request cmpl-5abb90160ed34fd6bc32662694083c3e-0.
INFO 03-02 00:29:04 [logger.py:42] Received request cmpl-f0de6d86b7a14c1ebd01a65711fb96b8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:29:04 [async_llm.py:261] Added request cmpl-f0de6d86b7a14c1ebd01a65711fb96b8-0.
INFO 03-02 00:29:05 [logger.py:42] Received request cmpl-5d7e47e5301a4f539dc8327d18339681-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:29:05 [async_llm.py:261] Added request cmpl-5d7e47e5301a4f539dc8327d18339681-0.
INFO 03-02 00:29:06 [logger.py:42] Received request cmpl-b84ca767f24c427d906f79ecd6379eec-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:29:06 [async_llm.py:261] Added request cmpl-b84ca767f24c427d906f79ecd6379eec-0.
INFO 03-02 00:29:07 [logger.py:42] Received request cmpl-0b81f714a3354d31bd43e6d1fcf7f2c3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:29:07 [async_llm.py:261] Added request cmpl-0b81f714a3354d31bd43e6d1fcf7f2c3-0.
INFO 03-02 00:29:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:29:08 [logger.py:42] Received request cmpl-bf767b45bda743c8b45cb2878e3cd648-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:29:08 [async_llm.py:261] Added request cmpl-bf767b45bda743c8b45cb2878e3cd648-0.
INFO 03-02 00:29:10 [logger.py:42] Received request cmpl-c5177951207d4dd3a53da8bc7f2113c2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:29:10 [async_llm.py:261] Added request cmpl-c5177951207d4dd3a53da8bc7f2113c2-0.
INFO 03-02 00:29:11 [logger.py:42] Received request cmpl-9f93eb93b0374dfc907eceecf1248a01-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:29:11 [async_llm.py:261] Added request cmpl-9f93eb93b0374dfc907eceecf1248a01-0.
INFO 03-02 00:29:12 [logger.py:42] Received request cmpl-ff53a4b4d97b416da826543b371ff6de-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:29:12 [async_llm.py:261] Added request cmpl-ff53a4b4d97b416da826543b371ff6de-0.
INFO 03-02 00:29:13 [logger.py:42] Received request cmpl-a120888e7ca342aa9069d0de627313af-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:29:13 [async_llm.py:261] Added request cmpl-a120888e7ca342aa9069d0de627313af-0.
INFO 03-02 00:29:14 [logger.py:42] Received request cmpl-c75d58c560ab48b08a419d19ce7b92c0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:29:14 [async_llm.py:261] Added request cmpl-c75d58c560ab48b08a419d19ce7b92c0-0.
INFO 03-02 00:29:15 [logger.py:42] Received request cmpl-335725e5afc5477f8783abdeaf9b90ff-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:29:15 [async_llm.py:261] Added request cmpl-335725e5afc5477f8783abdeaf9b90ff-0.
INFO 03-02 00:29:17 [logger.py:42] Received request cmpl-d96bde16acec46feb61c912a0634d824-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:29:17 [async_llm.py:261] Added request cmpl-d96bde16acec46feb61c912a0634d824-0.
INFO 03-02 00:29:18 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:29:18 [logger.py:42] Received request cmpl-276d603fe1c846e7ad61326869f6001b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:29:18 [async_llm.py:261] Added request cmpl-276d603fe1c846e7ad61326869f6001b-0.
INFO 03-02 00:29:19 [logger.py:42] Received request cmpl-f1ce00076e2b43559058d269e7c7503a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:29:19 [async_llm.py:261] Added request cmpl-f1ce00076e2b43559058d269e7c7503a-0.
INFO 03-02 00:29:20 [logger.py:42] Received request cmpl-7f1a1cd76fa843e0b1a5d6b546fa47b3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:29:20 [async_llm.py:261] Added request cmpl-7f1a1cd76fa843e0b1a5d6b546fa47b3-0.
INFO 03-02 00:29:21 [logger.py:42] Received request cmpl-05e315d9a4874c949bde731db0a8b052-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:29:21 [async_llm.py:261] Added request cmpl-05e315d9a4874c949bde731db0a8b052-0.
INFO 03-02 00:29:22 [logger.py:42] Received request cmpl-e46d805c962844e7af5a69204b1de511-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:29:22 [async_llm.py:261] Added request cmpl-e46d805c962844e7af5a69204b1de511-0.
INFO 03-02 00:29:23 [logger.py:42] Received request cmpl-36fb50edcf47424e83a8e5cd3fce350c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:29:23 [async_llm.py:261] Added request cmpl-36fb50edcf47424e83a8e5cd3fce350c-0.
INFO 03-02 00:29:25 [logger.py:42] Received request cmpl-063b8a6cf4d949f09b83d17da61752c4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:29:25 [async_llm.py:261] Added request cmpl-063b8a6cf4d949f09b83d17da61752c4-0.
INFO 03-02 00:29:26 [logger.py:42] Received request cmpl-1a8357d01af347fd83c4ca5a2c6514ee-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:29:26 [async_llm.py:261] Added request cmpl-1a8357d01af347fd83c4ca5a2c6514ee-0.
INFO 03-02 00:29:27 [logger.py:42] Received request cmpl-436c2f816d624775bd1ff2b4ad8c7f6f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:29:27 [async_llm.py:261] Added request cmpl-436c2f816d624775bd1ff2b4ad8c7f6f-0.
INFO 03-02 00:29:28 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:29:28 [logger.py:42] Received request cmpl-899dc57a99814da4ba2df0bd9f819ed2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:29:28 [async_llm.py:261] Added request cmpl-899dc57a99814da4ba2df0bd9f819ed2-0.
INFO 03-02 00:29:29 [logger.py:42] Received request cmpl-387a49bfc4804b6a80bbfe52a117a757-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:29:29 [async_llm.py:261] Added request cmpl-387a49bfc4804b6a80bbfe52a117a757-0.
INFO 03-02 00:29:30 [logger.py:42] Received request cmpl-11f431e78b7e42548a8dc6b14317ca78-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:29:30 [async_llm.py:261] Added request cmpl-11f431e78b7e42548a8dc6b14317ca78-0.
INFO 03-02 00:29:32 [logger.py:42] Received request cmpl-cf94c1201f1a45b0ab73ecf694e16d5a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:29:32 [async_llm.py:261] Added request cmpl-cf94c1201f1a45b0ab73ecf694e16d5a-0.
INFO 03-02 00:29:33 [logger.py:42] Received request cmpl-0bddd2c6925342b4a3ac58755ec883ff-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:29:33 [async_llm.py:261] Added request cmpl-0bddd2c6925342b4a3ac58755ec883ff-0.
INFO 03-02 00:29:34 [logger.py:42] Received request cmpl-66c0daf8ffaa415dbbe3568c5a7ab3f9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:29:34 [async_llm.py:261] Added request cmpl-66c0daf8ffaa415dbbe3568c5a7ab3f9-0.
INFO 03-02 00:29:35 [logger.py:42] Received request cmpl-8b13d769dbd9442baa351a5e1563cf52-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:29:35 [async_llm.py:261] Added request cmpl-8b13d769dbd9442baa351a5e1563cf52-0.
INFO 03-02 00:29:36 [logger.py:42] Received request cmpl-f7c31cc7b02f4755b2476e9199563e29-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:29:36 [async_llm.py:261] Added request cmpl-f7c31cc7b02f4755b2476e9199563e29-0.
INFO 03-02 00:29:37 [logger.py:42] Received request cmpl-584679d5958e4303b9731b0e13a6cab2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:29:37 [async_llm.py:261] Added request cmpl-584679d5958e4303b9731b0e13a6cab2-0.
INFO 03-02 00:29:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:29:38 [logger.py:42] Received request cmpl-836a4a3621074dc08404b61869d770eb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:29:38 [async_llm.py:261] Added request cmpl-836a4a3621074dc08404b61869d770eb-0.
INFO 03-02 00:29:40 [logger.py:42] Received request cmpl-89633cb970b14ec18cf12964f59f1f3b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:29:40 [async_llm.py:261] Added request cmpl-89633cb970b14ec18cf12964f59f1f3b-0.
INFO 03-02 00:29:41 [logger.py:42] Received request cmpl-a00c8c6ff3754fafbf7c61ab713dee06-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:29:41 [async_llm.py:261] Added request cmpl-a00c8c6ff3754fafbf7c61ab713dee06-0.
INFO 03-02 00:29:42 [logger.py:42] Received request cmpl-71695a416d23411c9bca171ae39a1b91-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:29:42 [async_llm.py:261] Added request cmpl-71695a416d23411c9bca171ae39a1b91-0.
INFO 03-02 00:29:43 [logger.py:42] Received request cmpl-ee7af03877cb44e1add63ce64311ac1c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:29:43 [async_llm.py:261] Added request cmpl-ee7af03877cb44e1add63ce64311ac1c-0.
INFO 03-02 00:29:44 [logger.py:42] Received request cmpl-c54eeb4b1179470ea6edff22204161c4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:29:44 [async_llm.py:261] Added request cmpl-c54eeb4b1179470ea6edff22204161c4-0.
INFO 03-02 00:29:45 [logger.py:42] Received request cmpl-85c0a2c9039b4a8996cf2f60a86c2e4b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:29:45 [async_llm.py:261] Added request cmpl-85c0a2c9039b4a8996cf2f60a86c2e4b-0.
INFO 03-02 00:29:47 [logger.py:42] Received request cmpl-0d920fb39b504ec1abd16017ba502f40-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:29:47 [async_llm.py:261] Added request cmpl-0d920fb39b504ec1abd16017ba502f40-0.
INFO 03-02 00:29:48 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:29:48 [logger.py:42] Received request cmpl-17d03fce1114411083ca64ca713a45fb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:29:48 [async_llm.py:261] Added request cmpl-17d03fce1114411083ca64ca713a45fb-0.
INFO 03-02 00:29:49 [logger.py:42] Received request cmpl-46b10d6c0ba647eaa4af70a783a7fa0b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:29:49 [async_llm.py:261] Added request cmpl-46b10d6c0ba647eaa4af70a783a7fa0b-0.
INFO 03-02 00:29:50 [logger.py:42] Received request cmpl-a42936730bcd4b318521093f4f7bc12d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:29:50 [async_llm.py:261] Added request cmpl-a42936730bcd4b318521093f4f7bc12d-0.
INFO 03-02 00:29:51 [logger.py:42] Received request cmpl-7ce5e336ad3148f488d65015214a8301-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:29:51 [async_llm.py:261] Added request cmpl-7ce5e336ad3148f488d65015214a8301-0.
INFO 03-02 00:29:52 [logger.py:42] Received request cmpl-b901b2ba513449d78dbc10d1c7aff447-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:29:52 [async_llm.py:261] Added request cmpl-b901b2ba513449d78dbc10d1c7aff447-0.
INFO 03-02 00:29:53 [logger.py:42] Received request cmpl-04efe57902b3487ab8e630fcec806e5e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:29:53 [async_llm.py:261] Added request cmpl-04efe57902b3487ab8e630fcec806e5e-0.
INFO 03-02 00:29:55 [logger.py:42] Received request cmpl-b14a232c3ebc4185a8025b6846827e6a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:29:55 [async_llm.py:261] Added request cmpl-b14a232c3ebc4185a8025b6846827e6a-0.
INFO 03-02 00:29:56 [logger.py:42] Received request cmpl-fd52c62838f847cc866d2ba70cba7c8b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:29:56 [async_llm.py:261] Added request cmpl-fd52c62838f847cc866d2ba70cba7c8b-0.
INFO 03-02 00:29:57 [logger.py:42] Received request cmpl-c145daad89ea4d68b408e6931fcc10a8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:29:57 [async_llm.py:261] Added request cmpl-c145daad89ea4d68b408e6931fcc10a8-0.
INFO 03-02 00:29:58 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:29:58 [logger.py:42] Received request cmpl-c598b18671914b0da3a93ea0baacf892-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:29:58 [async_llm.py:261] Added request cmpl-c598b18671914b0da3a93ea0baacf892-0.
INFO 03-02 00:29:59 [logger.py:42] Received request cmpl-354196026e2e4163bc54e3f755b9cdb0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:29:59 [async_llm.py:261] Added request cmpl-354196026e2e4163bc54e3f755b9cdb0-0.
INFO 03-02 00:30:00 [logger.py:42] Received request cmpl-341cd36d9b6d491ea6a312d56e594e99-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:30:00 [async_llm.py:261] Added request cmpl-341cd36d9b6d491ea6a312d56e594e99-0.
INFO 03-02 00:30:02 [logger.py:42] Received request cmpl-063a8dea0c784cc4aac9ea86927a2d0f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:30:02 [async_llm.py:261] Added request cmpl-063a8dea0c784cc4aac9ea86927a2d0f-0.
INFO 03-02 00:30:03 [logger.py:42] Received request cmpl-5a744a7d4cdc4340a0e720e233e915db-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:30:03 [async_llm.py:261] Added request cmpl-5a744a7d4cdc4340a0e720e233e915db-0.
INFO 03-02 00:30:04 [logger.py:42] Received request cmpl-772677e13a204b42910cc08a7f9f076f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:30:04 [async_llm.py:261] Added request cmpl-772677e13a204b42910cc08a7f9f076f-0.
INFO 03-02 00:30:05 [logger.py:42] Received request cmpl-c2348b12d3c644378d7862d1351c277a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:30:05 [async_llm.py:261] Added request cmpl-c2348b12d3c644378d7862d1351c277a-0.
INFO 03-02 00:30:06 [logger.py:42] Received request cmpl-ed796a9d72e047199946e7baf277ff46-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:30:06 [async_llm.py:261] Added request cmpl-ed796a9d72e047199946e7baf277ff46-0.
INFO 03-02 00:30:07 [logger.py:42] Received request cmpl-38067e891e8048c0a8ac4dce9821497c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:30:07 [async_llm.py:261] Added request cmpl-38067e891e8048c0a8ac4dce9821497c-0.
INFO 03-02 00:30:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:30:08 [logger.py:42] Received request cmpl-961548a974584af4b5b3e56cfe27b937-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:30:08 [async_llm.py:261] Added request cmpl-961548a974584af4b5b3e56cfe27b937-0.
INFO 03-02 00:30:10 [logger.py:42] Received request cmpl-53cfbe4d2e5448aa9b66175d93ba743c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:30:10 [async_llm.py:261] Added request cmpl-53cfbe4d2e5448aa9b66175d93ba743c-0.
INFO 03-02 00:30:11 [logger.py:42] Received request cmpl-cd4ab2f5954749a7909fcbd7bf2bb532-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:30:11 [async_llm.py:261] Added request cmpl-cd4ab2f5954749a7909fcbd7bf2bb532-0.
INFO 03-02 00:30:12 [logger.py:42] Received request cmpl-5dc276f1460d4a45ba13a2aa5b8d49c2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:30:12 [async_llm.py:261] Added request cmpl-5dc276f1460d4a45ba13a2aa5b8d49c2-0.
INFO 03-02 00:30:13 [logger.py:42] Received request cmpl-295a77ad29274a1aadcab3be97aafdd0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:30:13 [async_llm.py:261] Added request cmpl-295a77ad29274a1aadcab3be97aafdd0-0.
INFO 03-02 00:30:14 [logger.py:42] Received request cmpl-c65a135d41b24dba8e722b00497f4a6c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:30:14 [async_llm.py:261] Added request cmpl-c65a135d41b24dba8e722b00497f4a6c-0.
INFO 03-02 00:30:15 [logger.py:42] Received request cmpl-3379afc8e31f4a12936f90f721a84fd6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:30:15 [async_llm.py:261] Added request cmpl-3379afc8e31f4a12936f90f721a84fd6-0.
INFO 03-02 00:30:16 [logger.py:42] Received request cmpl-2f70252295df45b1a895d730210702c7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:30:16 [async_llm.py:261] Added request cmpl-2f70252295df45b1a895d730210702c7-0.
INFO 03-02 00:30:18 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:30:18 [logger.py:42] Received request cmpl-f4360ebfded94438ae42c2b033eb985d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:30:18 [async_llm.py:261] Added request cmpl-f4360ebfded94438ae42c2b033eb985d-0.
INFO 03-02 00:30:19 [logger.py:42] Received request cmpl-63a1c7be589b4a74990c3b15e69b373f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:30:19 [async_llm.py:261] Added request cmpl-63a1c7be589b4a74990c3b15e69b373f-0.
INFO 03-02 00:30:20 [logger.py:42] Received request cmpl-30a43e4e66794aeabc728d43cee198d2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:30:20 [async_llm.py:261] Added request cmpl-30a43e4e66794aeabc728d43cee198d2-0.
INFO 03-02 00:30:21 [logger.py:42] Received request cmpl-e8a301f373654d778075de932a7a36be-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:30:21 [async_llm.py:261] Added request cmpl-e8a301f373654d778075de932a7a36be-0.
INFO 03-02 00:30:22 [logger.py:42] Received request cmpl-e379207d292346a0aef40046a294916c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:30:22 [async_llm.py:261] Added request cmpl-e379207d292346a0aef40046a294916c-0.
INFO 03-02 00:30:23 [logger.py:42] Received request cmpl-3f7e0ae91af947e087dc1205dba74677-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:30:23 [async_llm.py:261] Added request cmpl-3f7e0ae91af947e087dc1205dba74677-0.
INFO 03-02 00:30:25 [logger.py:42] Received request cmpl-98d1aaf5412e4031a5613eb0713aad18-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:30:25 [async_llm.py:261] Added request cmpl-98d1aaf5412e4031a5613eb0713aad18-0.
INFO 03-02 00:30:26 [logger.py:42] Received request cmpl-681a5b6d01634a559facbdf1a4879cc8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:30:26 [async_llm.py:261] Added request cmpl-681a5b6d01634a559facbdf1a4879cc8-0.
INFO 03-02 00:30:27 [logger.py:42] Received request cmpl-1f8592580ab84e7792f5caaa31bad4c2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:30:27 [async_llm.py:261] Added request cmpl-1f8592580ab84e7792f5caaa31bad4c2-0.
INFO 03-02 00:30:28 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:30:28 [logger.py:42] Received request cmpl-6b921866ffb14803b087685268bd344b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:30:28 [async_llm.py:261] Added request cmpl-6b921866ffb14803b087685268bd344b-0.
INFO 03-02 00:30:29 [logger.py:42] Received request cmpl-dbf09838c99a4385a65943edc7d83b48-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:30:29 [async_llm.py:261] Added request cmpl-dbf09838c99a4385a65943edc7d83b48-0.
INFO 03-02 00:30:30 [logger.py:42] Received request cmpl-149c5b0e68d44cbe8e08e2d980ace4bd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:30:30 [async_llm.py:261] Added request cmpl-149c5b0e68d44cbe8e08e2d980ace4bd-0.
INFO 03-02 00:30:31 [logger.py:42] Received request cmpl-d777f5c6a2164bb3bc7d012d10697ef2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:30:31 [async_llm.py:261] Added request cmpl-d777f5c6a2164bb3bc7d012d10697ef2-0.
INFO 03-02 00:30:33 [logger.py:42] Received request cmpl-afc75e8390524dada4727d5ee363dae6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:30:33 [async_llm.py:261] Added request cmpl-afc75e8390524dada4727d5ee363dae6-0.
INFO 03-02 00:30:34 [logger.py:42] Received request cmpl-f389b32507804acabc11dbbf552b4549-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:30:34 [async_llm.py:261] Added request cmpl-f389b32507804acabc11dbbf552b4549-0.
INFO 03-02 00:30:35 [logger.py:42] Received request cmpl-55c588c9eda048d3a62e4da1e441f315-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:30:35 [async_llm.py:261] Added request cmpl-55c588c9eda048d3a62e4da1e441f315-0.
INFO 03-02 00:30:36 [logger.py:42] Received request cmpl-b186c7155f5e4c4db5e1bb327758ccbe-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:30:36 [async_llm.py:261] Added request cmpl-b186c7155f5e4c4db5e1bb327758ccbe-0.
INFO 03-02 00:30:37 [logger.py:42] Received request cmpl-1f3a326d84694eb29f85ee422e6913b8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:30:37 [async_llm.py:261] Added request cmpl-1f3a326d84694eb29f85ee422e6913b8-0.
INFO 03-02 00:30:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:30:38 [logger.py:42] Received request cmpl-e6aedb1977c24678b33eb833d373fc84-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:30:38 [async_llm.py:261] Added request cmpl-e6aedb1977c24678b33eb833d373fc84-0.
INFO 03-02 00:30:40 [logger.py:42] Received request cmpl-00619b1092574d21981a4e4ea17d507a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:30:40 [async_llm.py:261] Added request cmpl-00619b1092574d21981a4e4ea17d507a-0.
INFO 03-02 00:30:41 [logger.py:42] Received request cmpl-68b6d7e243294e72bb5d168b04605751-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:30:41 [async_llm.py:261] Added request cmpl-68b6d7e243294e72bb5d168b04605751-0.
INFO 03-02 00:30:42 [logger.py:42] Received request cmpl-3066275966614b3e9f3937e7bc6c5d99-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:30:42 [async_llm.py:261] Added request cmpl-3066275966614b3e9f3937e7bc6c5d99-0.
INFO 03-02 00:30:43 [logger.py:42] Received request cmpl-46576bae451c46bca8e338781e15cd15-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:30:43 [async_llm.py:261] Added request cmpl-46576bae451c46bca8e338781e15cd15-0.
INFO 03-02 00:30:44 [logger.py:42] Received request cmpl-ab28d6a2946948ac976052e8754ec676-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:30:44 [async_llm.py:261] Added request cmpl-ab28d6a2946948ac976052e8754ec676-0.
INFO 03-02 00:30:45 [logger.py:42] Received request cmpl-7f61fc301eca42b393247807183dd6bf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:30:45 [async_llm.py:261] Added request cmpl-7f61fc301eca42b393247807183dd6bf-0.
INFO 03-02 00:30:46 [logger.py:42] Received request cmpl-47a708759563404e91596483241c9896-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:30:46 [async_llm.py:261] Added request cmpl-47a708759563404e91596483241c9896-0.
INFO 03-02 00:30:48 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:30:48 [logger.py:42] Received request cmpl-14a0143b03b147ebbed8d842a72b4a15-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:30:48 [async_llm.py:261] Added request cmpl-14a0143b03b147ebbed8d842a72b4a15-0.
INFO 03-02 00:30:49 [logger.py:42] Received request cmpl-6b1a67c43e2b447f9d402bb77d3301b0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:30:49 [async_llm.py:261] Added request cmpl-6b1a67c43e2b447f9d402bb77d3301b0-0.
INFO 03-02 00:30:50 [logger.py:42] Received request cmpl-646676d2aeff459a902ebc3bfd389ac3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:30:50 [async_llm.py:261] Added request cmpl-646676d2aeff459a902ebc3bfd389ac3-0.
INFO 03-02 00:30:51 [logger.py:42] Received request cmpl-97b327b4749f485096064813f6aba93c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:30:51 [async_llm.py:261] Added request cmpl-97b327b4749f485096064813f6aba93c-0.
INFO 03-02 00:30:52 [logger.py:42] Received request cmpl-cc6ce65c13064977b40f3cc476655f85-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:30:52 [async_llm.py:261] Added request cmpl-cc6ce65c13064977b40f3cc476655f85-0.
INFO 03-02 00:30:53 [logger.py:42] Received request cmpl-a9049ee64f99417195994ce33523c150-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:30:53 [async_llm.py:261] Added request cmpl-a9049ee64f99417195994ce33523c150-0.
INFO 03-02 00:30:55 [logger.py:42] Received request cmpl-5fec0c46bfbd4bd3a866627033279fa7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:30:55 [async_llm.py:261] Added request cmpl-5fec0c46bfbd4bd3a866627033279fa7-0.
INFO 03-02 00:30:56 [logger.py:42] Received request cmpl-f32a8a87db0b4086a9cdbe683bbac272-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:30:56 [async_llm.py:261] Added request cmpl-f32a8a87db0b4086a9cdbe683bbac272-0.
INFO 03-02 00:30:57 [logger.py:42] Received request cmpl-4dab140efe2e4f898edf7b15a8a9a178-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:30:57 [async_llm.py:261] Added request cmpl-4dab140efe2e4f898edf7b15a8a9a178-0.
INFO 03-02 00:30:58 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:30:58 [logger.py:42] Received request cmpl-b73e458157bb47f39e92689e36991cfe-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:30:58 [async_llm.py:261] Added request cmpl-b73e458157bb47f39e92689e36991cfe-0.
INFO 03-02 00:30:59 [logger.py:42] Received request cmpl-ff55a7731ba8426086ad73408b7290c0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:30:59 [async_llm.py:261] Added request cmpl-ff55a7731ba8426086ad73408b7290c0-0.
INFO 03-02 00:31:00 [logger.py:42] Received request cmpl-0069fa9380d84043be256c66f5caa048-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:31:00 [async_llm.py:261] Added request cmpl-0069fa9380d84043be256c66f5caa048-0.
INFO 03-02 00:31:01 [logger.py:42] Received request cmpl-2b02935d0f1142068038132cbd70f879-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:31:01 [async_llm.py:261] Added request cmpl-2b02935d0f1142068038132cbd70f879-0.
INFO 03-02 00:31:03 [logger.py:42] Received request cmpl-d902c45f89c24bd28087b8ccc87432ee-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:31:03 [async_llm.py:261] Added request cmpl-d902c45f89c24bd28087b8ccc87432ee-0.
INFO 03-02 00:31:04 [logger.py:42] Received request cmpl-ccb40843c9cf4418918d09922b94949d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:31:04 [async_llm.py:261] Added request cmpl-ccb40843c9cf4418918d09922b94949d-0.
INFO 03-02 00:31:05 [logger.py:42] Received request cmpl-e919e657728048ff82629ff4ab3306f1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:31:05 [async_llm.py:261] Added request cmpl-e919e657728048ff82629ff4ab3306f1-0.
INFO 03-02 00:31:06 [logger.py:42] Received request cmpl-81203b2a02584c6ba15b84c17dcc6668-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:31:06 [async_llm.py:261] Added request cmpl-81203b2a02584c6ba15b84c17dcc6668-0.
INFO 03-02 00:31:07 [logger.py:42] Received request cmpl-4bfc4262628b48a4bcd036835a0ca4af-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:31:07 [async_llm.py:261] Added request cmpl-4bfc4262628b48a4bcd036835a0ca4af-0.
INFO 03-02 00:31:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:31:08 [logger.py:42] Received request cmpl-374da11855054615a48214c0ec684ba1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:31:08 [async_llm.py:261] Added request cmpl-374da11855054615a48214c0ec684ba1-0.
INFO 03-02 00:31:10 [logger.py:42] Received request cmpl-1cec2c0ed4864e5187a86fa52d0b38c1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:31:10 [async_llm.py:261] Added request cmpl-1cec2c0ed4864e5187a86fa52d0b38c1-0.
INFO 03-02 00:31:11 [logger.py:42] Received request cmpl-76cd2443cda4464fa6254edfa4db5e74-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:31:11 [async_llm.py:261] Added request cmpl-76cd2443cda4464fa6254edfa4db5e74-0.
INFO 03-02 00:31:12 [logger.py:42] Received request cmpl-d7b79466d821479fb211c1afa9663285-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:31:12 [async_llm.py:261] Added request cmpl-d7b79466d821479fb211c1afa9663285-0.
INFO 03-02 00:31:13 [logger.py:42] Received request cmpl-4d5bd3bb6fe34d63916e837f914a9779-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:31:13 [async_llm.py:261] Added request cmpl-4d5bd3bb6fe34d63916e837f914a9779-0.
INFO 03-02 00:31:14 [logger.py:42] Received request cmpl-4f6c79619c1c4774b20078029b68b340-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:31:14 [async_llm.py:261] Added request cmpl-4f6c79619c1c4774b20078029b68b340-0.
INFO 03-02 00:31:15 [logger.py:42] Received request cmpl-d6e7c231fc1f418794ef59627e2954a3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:31:15 [async_llm.py:261] Added request cmpl-d6e7c231fc1f418794ef59627e2954a3-0.
INFO 03-02 00:31:16 [logger.py:42] Received request cmpl-a70437e91e1b41fc9d106dced68707d4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:31:16 [async_llm.py:261] Added request cmpl-a70437e91e1b41fc9d106dced68707d4-0.
INFO 03-02 00:31:18 [logger.py:42] Received request cmpl-74510c1620814ece8ec8e04c1566cdec-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:31:18 [async_llm.py:261] Added request cmpl-74510c1620814ece8ec8e04c1566cdec-0.
INFO 03-02 00:31:18 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:31:19 [logger.py:42] Received request cmpl-1f781aed470349449ed491a6950b365f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:31:19 [async_llm.py:261] Added request cmpl-1f781aed470349449ed491a6950b365f-0.
INFO 03-02 00:31:20 [logger.py:42] Received request cmpl-1adfafa797ce4ab3bfa59a5261da4c2e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:31:20 [async_llm.py:261] Added request cmpl-1adfafa797ce4ab3bfa59a5261da4c2e-0.
INFO 03-02 00:31:21 [logger.py:42] Received request cmpl-e46227da50034c15be819ba05549124f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:31:21 [async_llm.py:261] Added request cmpl-e46227da50034c15be819ba05549124f-0.
INFO 03-02 00:31:22 [logger.py:42] Received request cmpl-d372056cf6224990bb76d50e6b479b42-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:31:22 [async_llm.py:261] Added request cmpl-d372056cf6224990bb76d50e6b479b42-0.
INFO 03-02 00:31:23 [logger.py:42] Received request cmpl-fdbf4ac1b7cd43d0a02cb92efac11634-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:31:23 [async_llm.py:261] Added request cmpl-fdbf4ac1b7cd43d0a02cb92efac11634-0.
INFO 03-02 00:31:24 [logger.py:42] Received request cmpl-2e4b908be6fc4bba8952420045ccc818-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:31:24 [async_llm.py:261] Added request cmpl-2e4b908be6fc4bba8952420045ccc818-0.
INFO 03-02 00:31:26 [logger.py:42] Received request cmpl-585ce09e98384c71b7eb36736483d6cc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:31:26 [async_llm.py:261] Added request cmpl-585ce09e98384c71b7eb36736483d6cc-0.
INFO 03-02 00:31:27 [logger.py:42] Received request cmpl-5c6fd51ead564c58a5495625c0ffbf70-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:31:27 [async_llm.py:261] Added request cmpl-5c6fd51ead564c58a5495625c0ffbf70-0.
INFO 03-02 00:31:28 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:31:28 [logger.py:42] Received request cmpl-d85c0bb36530465397de169ce25c33fc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:31:28 [async_llm.py:261] Added request cmpl-d85c0bb36530465397de169ce25c33fc-0.
INFO 03-02 00:31:29 [logger.py:42] Received request cmpl-22eee70a320d4611bd6f3c9f217a7f00-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:31:29 [async_llm.py:261] Added request cmpl-22eee70a320d4611bd6f3c9f217a7f00-0.
INFO 03-02 00:31:30 [logger.py:42] Received request cmpl-c678e0960b834540a0ae904deeea0aad-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:31:30 [async_llm.py:261] Added request cmpl-c678e0960b834540a0ae904deeea0aad-0.
INFO 03-02 00:31:31 [logger.py:42] Received request cmpl-3a2d247990c241e2b2c27320054e3924-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:31:31 [async_llm.py:261] Added request cmpl-3a2d247990c241e2b2c27320054e3924-0.
INFO 03-02 00:31:33 [logger.py:42] Received request cmpl-e8d2a01d67af4053b9b800457d330308-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:31:33 [async_llm.py:261] Added request cmpl-e8d2a01d67af4053b9b800457d330308-0.
INFO 03-02 00:31:34 [logger.py:42] Received request cmpl-58ef3481047e402eaebc759ce1e749ac-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:31:34 [async_llm.py:261] Added request cmpl-58ef3481047e402eaebc759ce1e749ac-0.
INFO 03-02 00:31:35 [logger.py:42] Received request cmpl-a7db51be2491423f9d373c83719f7ab0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:31:35 [async_llm.py:261] Added request cmpl-a7db51be2491423f9d373c83719f7ab0-0.
INFO 03-02 00:31:36 [logger.py:42] Received request cmpl-8b07b1573c4648c0863a5a34780f2f45-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:31:36 [async_llm.py:261] Added request cmpl-8b07b1573c4648c0863a5a34780f2f45-0.
INFO 03-02 00:31:37 [logger.py:42] Received request cmpl-b439788e1a3f4c8183e704ba96cdc11c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:31:37 [async_llm.py:261] Added request cmpl-b439788e1a3f4c8183e704ba96cdc11c-0.
INFO 03-02 00:31:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:31:38 [logger.py:42] Received request cmpl-d6df9e5c854442d6a898e264f67dd2f8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:31:38 [async_llm.py:261] Added request cmpl-d6df9e5c854442d6a898e264f67dd2f8-0.
INFO 03-02 00:31:39 [logger.py:42] Received request cmpl-268428adc55f4625ae1ad95fd666ee38-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:31:39 [async_llm.py:261] Added request cmpl-268428adc55f4625ae1ad95fd666ee38-0.
INFO 03-02 00:31:41 [logger.py:42] Received request cmpl-b276329249034f5a856a38c8cc1f4654-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:31:41 [async_llm.py:261] Added request cmpl-b276329249034f5a856a38c8cc1f4654-0.
INFO 03-02 00:31:42 [logger.py:42] Received request cmpl-6a2a0c804f62443ab370cb32139fae22-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:31:42 [async_llm.py:261] Added request cmpl-6a2a0c804f62443ab370cb32139fae22-0.
INFO 03-02 00:31:43 [logger.py:42] Received request cmpl-7df080b8c70445a68fea706b53c1fc57-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:31:43 [async_llm.py:261] Added request cmpl-7df080b8c70445a68fea706b53c1fc57-0.
INFO 03-02 00:31:44 [logger.py:42] Received request cmpl-24720df19c5e4c2d924a983aa4307eed-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:31:44 [async_llm.py:261] Added request cmpl-24720df19c5e4c2d924a983aa4307eed-0.
INFO 03-02 00:31:45 [logger.py:42] Received request cmpl-6ed97cee75a2484b8673de51f2d44cf0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:31:45 [async_llm.py:261] Added request cmpl-6ed97cee75a2484b8673de51f2d44cf0-0.
INFO 03-02 00:31:46 [logger.py:42] Received request cmpl-ca3291a6c074476b878cfd53068ce569-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:31:46 [async_llm.py:261] Added request cmpl-ca3291a6c074476b878cfd53068ce569-0.
INFO 03-02 00:31:48 [logger.py:42] Received request cmpl-9f6329c101fd4f9a973c245fda144ccd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:31:48 [async_llm.py:261] Added request cmpl-9f6329c101fd4f9a973c245fda144ccd-0.
INFO 03-02 00:31:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.1 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.3%, Prefix cache hit rate: 51.6%
INFO 03-02 00:31:49 [logger.py:42] Received request cmpl-f93fd0c3ab3543b79ca8127d69fde7c8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:31:49 [async_llm.py:261] Added request cmpl-f93fd0c3ab3543b79ca8127d69fde7c8-0.
INFO 03-02 00:31:50 [logger.py:42] Received request cmpl-4b84159d7f64445fa1170c7a03de2016-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:31:50 [async_llm.py:261] Added request cmpl-4b84159d7f64445fa1170c7a03de2016-0.
INFO 03-02 00:31:51 [logger.py:42] Received request cmpl-a4576d746b914569a2a52850f999b802-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:31:51 [async_llm.py:261] Added request cmpl-a4576d746b914569a2a52850f999b802-0.
INFO 03-02 00:31:52 [logger.py:42] Received request cmpl-1631c06c994f4d1bb9a120b3a44f0164-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:31:52 [async_llm.py:261] Added request cmpl-1631c06c994f4d1bb9a120b3a44f0164-0.
INFO 03-02 00:31:53 [logger.py:42] Received request cmpl-0f572f21107b43a9879fde16f70914de-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:31:53 [async_llm.py:261] Added request cmpl-0f572f21107b43a9879fde16f70914de-0.
INFO 03-02 00:31:54 [logger.py:42] Received request cmpl-561e84da94eb467cb83da299cb15fd95-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:31:54 [async_llm.py:261] Added request cmpl-561e84da94eb467cb83da299cb15fd95-0.
INFO 03-02 00:31:56 [logger.py:42] Received request cmpl-74aa6f700b0548d0b4b37c46c44beee0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:31:56 [async_llm.py:261] Added request cmpl-74aa6f700b0548d0b4b37c46c44beee0-0.
INFO 03-02 00:31:57 [logger.py:42] Received request cmpl-d80cd0ba20734a03a544563022916ddb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:31:57 [async_llm.py:261] Added request cmpl-d80cd0ba20734a03a544563022916ddb-0.
INFO 03-02 00:31:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.4 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:31:58 [logger.py:42] Received request cmpl-f3da8714cd16407e8317b52e2ffa76b1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:31:58 [async_llm.py:261] Added request cmpl-f3da8714cd16407e8317b52e2ffa76b1-0.
INFO 03-02 00:31:59 [logger.py:42] Received request cmpl-32a5107134564c858dd68e35a7720251-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:31:59 [async_llm.py:261] Added request cmpl-32a5107134564c858dd68e35a7720251-0.
INFO 03-02 00:32:00 [logger.py:42] Received request cmpl-00302a7edf54485d8e3fc9e2e0126368-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:32:00 [async_llm.py:261] Added request cmpl-00302a7edf54485d8e3fc9e2e0126368-0.
INFO 03-02 00:32:01 [logger.py:42] Received request cmpl-645b4853b40d4053be9c01ada351a7f2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:32:01 [async_llm.py:261] Added request cmpl-645b4853b40d4053be9c01ada351a7f2-0.
INFO 03-02 00:32:03 [logger.py:42] Received request cmpl-9d869d305b2941e897695c786147e878-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:32:03 [async_llm.py:261] Added request cmpl-9d869d305b2941e897695c786147e878-0.
INFO 03-02 00:32:04 [logger.py:42] Received request cmpl-5f7a7a8fa9984fa2ac21be55ffaf9e37-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:32:04 [async_llm.py:261] Added request cmpl-5f7a7a8fa9984fa2ac21be55ffaf9e37-0.
INFO 03-02 00:32:05 [logger.py:42] Received request cmpl-31746b9fa96445e59cca2326dea22a4c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:32:05 [async_llm.py:261] Added request cmpl-31746b9fa96445e59cca2326dea22a4c-0.
INFO 03-02 00:32:06 [logger.py:42] Received request cmpl-89fb244c9dd04c2a8e679c6a745fcc62-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:32:06 [async_llm.py:261] Added request cmpl-89fb244c9dd04c2a8e679c6a745fcc62-0.
INFO 03-02 00:32:07 [logger.py:42] Received request cmpl-d90a9d62047c4d40a9c3ee81920e706f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:32:07 [async_llm.py:261] Added request cmpl-d90a9d62047c4d40a9c3ee81920e706f-0.
INFO 03-02 00:32:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:32:08 [logger.py:42] Received request cmpl-63f5156e9c384d18b733a37d4b4c9eda-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:32:08 [async_llm.py:261] Added request cmpl-63f5156e9c384d18b733a37d4b4c9eda-0.
INFO 03-02 00:32:09 [logger.py:42] Received request cmpl-680000f4accd4443b3f0f31bc488e182-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:32:09 [async_llm.py:261] Added request cmpl-680000f4accd4443b3f0f31bc488e182-0.
INFO 03-02 00:32:11 [logger.py:42] Received request cmpl-35d4929b712240c7976237d2a0c6209a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:32:11 [async_llm.py:261] Added request cmpl-35d4929b712240c7976237d2a0c6209a-0.
INFO 03-02 00:32:12 [logger.py:42] Received request cmpl-712a7cf021d1496e849c94a70f916f18-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:32:12 [async_llm.py:261] Added request cmpl-712a7cf021d1496e849c94a70f916f18-0.
INFO 03-02 00:32:13 [logger.py:42] Received request cmpl-604edcc8e91e4edaa8f5d3615abb164f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:32:13 [async_llm.py:261] Added request cmpl-604edcc8e91e4edaa8f5d3615abb164f-0.
INFO 03-02 00:32:14 [logger.py:42] Received request cmpl-a19b29f6c6aa474c85dcfc04dddaa1dd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:32:14 [async_llm.py:261] Added request cmpl-a19b29f6c6aa474c85dcfc04dddaa1dd-0.
INFO 03-02 00:32:15 [logger.py:42] Received request cmpl-b0e38b0219304cb2b4461175c313748b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:32:15 [async_llm.py:261] Added request cmpl-b0e38b0219304cb2b4461175c313748b-0.
INFO 03-02 00:32:16 [logger.py:42] Received request cmpl-2d969d1128f54c2c8772c81903df6463-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:32:16 [async_llm.py:261] Added request cmpl-2d969d1128f54c2c8772c81903df6463-0.
INFO 03-02 00:32:17 [logger.py:42] Received request cmpl-5c4f218a7cc74d6c9c59bb760fd83de3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:32:17 [async_llm.py:261] Added request cmpl-5c4f218a7cc74d6c9c59bb760fd83de3-0.
INFO 03-02 00:32:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.3 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.4%, Prefix cache hit rate: 51.6%
INFO 03-02 00:32:19 [logger.py:42] Received request cmpl-0a9ba2771e124fd2b7955b8fb247adbe-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:32:19 [async_llm.py:261] Added request cmpl-0a9ba2771e124fd2b7955b8fb247adbe-0.
INFO 03-02 00:32:20 [logger.py:42] Received request cmpl-1e9b09dc37184a1dbd9ecd7512bb4bdc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:32:20 [async_llm.py:261] Added request cmpl-1e9b09dc37184a1dbd9ecd7512bb4bdc-0.
INFO 03-02 00:32:21 [logger.py:42] Received request cmpl-a8f51d72236b4f41ae28d7439e21a700-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:32:21 [async_llm.py:261] Added request cmpl-a8f51d72236b4f41ae28d7439e21a700-0.
INFO 03-02 00:32:22 [logger.py:42] Received request cmpl-27e15ef0a2a34598acf89e8ade865869-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:32:22 [async_llm.py:261] Added request cmpl-27e15ef0a2a34598acf89e8ade865869-0.
INFO 03-02 00:32:23 [logger.py:42] Received request cmpl-a14d8f433b5149f5b7c781fec8f4140a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:32:23 [async_llm.py:261] Added request cmpl-a14d8f433b5149f5b7c781fec8f4140a-0.
INFO 03-02 00:32:24 [logger.py:42] Received request cmpl-1da3bfd3de374818b0684179949d65df-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:32:24 [async_llm.py:261] Added request cmpl-1da3bfd3de374818b0684179949d65df-0.
INFO 03-02 00:32:26 [logger.py:42] Received request cmpl-d0ed81f2756a45b696488a378ebfdd05-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:32:26 [async_llm.py:261] Added request cmpl-d0ed81f2756a45b696488a378ebfdd05-0.
INFO 03-02 00:32:27 [logger.py:42] Received request cmpl-65b5d8474cb84477aeaa5e5e811abbf8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:32:27 [async_llm.py:261] Added request cmpl-65b5d8474cb84477aeaa5e5e811abbf8-0.
INFO 03-02 00:32:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.2 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:32:28 [logger.py:42] Received request cmpl-e63a485495bc40759ec8d885ca47885d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:32:28 [async_llm.py:261] Added request cmpl-e63a485495bc40759ec8d885ca47885d-0.
INFO 03-02 00:32:29 [logger.py:42] Received request cmpl-d57e448abd6f48a49a0d3716ea8f359e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:32:29 [async_llm.py:261] Added request cmpl-d57e448abd6f48a49a0d3716ea8f359e-0.
INFO 03-02 00:32:30 [logger.py:42] Received request cmpl-44e2c209edfc41f2a7535dc58fe35f65-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:32:30 [async_llm.py:261] Added request cmpl-44e2c209edfc41f2a7535dc58fe35f65-0.
INFO 03-02 00:32:31 [logger.py:42] Received request cmpl-b2009b6987c347869d3ec1fe1ed50b94-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:32:31 [async_llm.py:261] Added request cmpl-b2009b6987c347869d3ec1fe1ed50b94-0.
INFO 03-02 00:32:32 [logger.py:42] Received request cmpl-338811dbda2e4a2f894e876d48f72e6c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:32:32 [async_llm.py:261] Added request cmpl-338811dbda2e4a2f894e876d48f72e6c-0.
INFO 03-02 00:32:34 [logger.py:42] Received request cmpl-e69f284a33b44f76896c9cfc6f98f71d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:32:34 [async_llm.py:261] Added request cmpl-e69f284a33b44f76896c9cfc6f98f71d-0.
INFO 03-02 00:32:35 [logger.py:42] Received request cmpl-f6c438b345c941ce92e59ebc588bb01b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:32:35 [async_llm.py:261] Added request cmpl-f6c438b345c941ce92e59ebc588bb01b-0.
INFO 03-02 00:32:36 [logger.py:42] Received request cmpl-c7c6b493020d4f2a9e7c69263158c0f5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:32:36 [async_llm.py:261] Added request cmpl-c7c6b493020d4f2a9e7c69263158c0f5-0.
INFO 03-02 00:32:37 [logger.py:42] Received request cmpl-bc8432161caf460c85a3b6671ffabac6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:32:37 [async_llm.py:261] Added request cmpl-bc8432161caf460c85a3b6671ffabac6-0.
INFO 03-02 00:32:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:32:38 [logger.py:42] Received request cmpl-0dbd66fd442f428d81493d4bf2598036-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:32:38 [async_llm.py:261] Added request cmpl-0dbd66fd442f428d81493d4bf2598036-0.
INFO 03-02 00:32:39 [logger.py:42] Received request cmpl-2fbc7860ac084870ade9be1206a31154-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:32:39 [async_llm.py:261] Added request cmpl-2fbc7860ac084870ade9be1206a31154-0.
INFO 03-02 00:32:41 [logger.py:42] Received request cmpl-8eb888af9aa04ba0b676d91598962b67-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:32:41 [async_llm.py:261] Added request cmpl-8eb888af9aa04ba0b676d91598962b67-0.
INFO 03-02 00:32:42 [logger.py:42] Received request cmpl-92e440df1e214c1faca44048b930adb6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:32:42 [async_llm.py:261] Added request cmpl-92e440df1e214c1faca44048b930adb6-0.
INFO 03-02 00:32:43 [logger.py:42] Received request cmpl-82d19b6ac864446d825e8513131285df-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:32:43 [async_llm.py:261] Added request cmpl-82d19b6ac864446d825e8513131285df-0.
INFO 03-02 00:32:44 [logger.py:42] Received request cmpl-8a8266a5a2904a80aec7380cbba443a7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:32:44 [async_llm.py:261] Added request cmpl-8a8266a5a2904a80aec7380cbba443a7-0.
INFO 03-02 00:32:45 [logger.py:42] Received request cmpl-7efc851cd745425987a608d0fcbde388-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:32:45 [async_llm.py:261] Added request cmpl-7efc851cd745425987a608d0fcbde388-0.
INFO 03-02 00:32:46 [logger.py:42] Received request cmpl-a504291402c84db2ad3fc35f78402721-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:32:46 [async_llm.py:261] Added request cmpl-a504291402c84db2ad3fc35f78402721-0.
INFO 03-02 00:32:47 [logger.py:42] Received request cmpl-51a8a8bded4247ebbbed77a486844d5b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:32:47 [async_llm.py:261] Added request cmpl-51a8a8bded4247ebbbed77a486844d5b-0.
INFO 03-02 00:32:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.4 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.4%, Prefix cache hit rate: 51.6%
INFO 03-02 00:32:49 [logger.py:42] Received request cmpl-6b08c285c65d49c6a16c129b612c3b00-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:32:49 [async_llm.py:261] Added request cmpl-6b08c285c65d49c6a16c129b612c3b00-0.
INFO 03-02 00:32:50 [logger.py:42] Received request cmpl-d423c7fcfb544c1b967ad051be772a15-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:32:50 [async_llm.py:261] Added request cmpl-d423c7fcfb544c1b967ad051be772a15-0.
INFO 03-02 00:32:51 [logger.py:42] Received request cmpl-95a841a9ba5b47b1a8df3c928272f7e5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:32:51 [async_llm.py:261] Added request cmpl-95a841a9ba5b47b1a8df3c928272f7e5-0.
INFO 03-02 00:32:52 [logger.py:42] Received request cmpl-df184f7c129a49bcb989ff8ece1ba28b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:32:52 [async_llm.py:261] Added request cmpl-df184f7c129a49bcb989ff8ece1ba28b-0.
INFO 03-02 00:32:53 [logger.py:42] Received request cmpl-e4e33c22a1dd4238b400e4391d3cae18-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:32:53 [async_llm.py:261] Added request cmpl-e4e33c22a1dd4238b400e4391d3cae18-0.
INFO 03-02 00:32:54 [logger.py:42] Received request cmpl-3d26ef01b3724dfeaee9e91a5782c49c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:32:54 [async_llm.py:261] Added request cmpl-3d26ef01b3724dfeaee9e91a5782c49c-0.
INFO 03-02 00:32:56 [logger.py:42] Received request cmpl-9079f2d4956249cc988434bb7e67dd30-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:32:56 [async_llm.py:261] Added request cmpl-9079f2d4956249cc988434bb7e67dd30-0.
INFO 03-02 00:32:57 [logger.py:42] Received request cmpl-0a282e7f4904407faca5bf9e6452069b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:32:57 [async_llm.py:261] Added request cmpl-0a282e7f4904407faca5bf9e6452069b-0.
INFO 03-02 00:32:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.1 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:32:58 [logger.py:42] Received request cmpl-368e738fa66f4699be455023e55893fb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:32:58 [async_llm.py:261] Added request cmpl-368e738fa66f4699be455023e55893fb-0.
INFO 03-02 00:32:59 [logger.py:42] Received request cmpl-193ba85edc1145d7bbf2ab6504dcfef5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:32:59 [async_llm.py:261] Added request cmpl-193ba85edc1145d7bbf2ab6504dcfef5-0.
INFO 03-02 00:33:00 [logger.py:42] Received request cmpl-709c9e66416841a2acad0d49242777f3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:33:00 [async_llm.py:261] Added request cmpl-709c9e66416841a2acad0d49242777f3-0.
INFO 03-02 00:33:01 [logger.py:42] Received request cmpl-fa2a4f5ea7a24b929450b668e3c916fd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:33:01 [async_llm.py:261] Added request cmpl-fa2a4f5ea7a24b929450b668e3c916fd-0.
INFO 03-02 00:33:02 [logger.py:42] Received request cmpl-a020bdab204a4c76b1804f5a7b8a47a4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:33:02 [async_llm.py:261] Added request cmpl-a020bdab204a4c76b1804f5a7b8a47a4-0.
INFO 03-02 00:33:04 [logger.py:42] Received request cmpl-5473d4a4ac944d3fab4c4f7789992943-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:33:04 [async_llm.py:261] Added request cmpl-5473d4a4ac944d3fab4c4f7789992943-0.
INFO 03-02 00:33:05 [logger.py:42] Received request cmpl-bee3cf39953e47eca277f6f70c0a9e9c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:33:05 [async_llm.py:261] Added request cmpl-bee3cf39953e47eca277f6f70c0a9e9c-0.
INFO 03-02 00:33:06 [logger.py:42] Received request cmpl-50a98001a8744445b2f7782789d54f69-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:33:06 [async_llm.py:261] Added request cmpl-50a98001a8744445b2f7782789d54f69-0.
INFO 03-02 00:33:07 [logger.py:42] Received request cmpl-81fd559a900a48109767770fcd9b3ddd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:33:07 [async_llm.py:261] Added request cmpl-81fd559a900a48109767770fcd9b3ddd-0.
INFO 03-02 00:33:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:33:08 [logger.py:42] Received request cmpl-ae2a904ba5144431b07691cc4112af1e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:33:08 [async_llm.py:261] Added request cmpl-ae2a904ba5144431b07691cc4112af1e-0.
INFO 03-02 00:33:09 [logger.py:42] Received request cmpl-780f11ecb287486088b8d38e61d54cb9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:33:09 [async_llm.py:261] Added request cmpl-780f11ecb287486088b8d38e61d54cb9-0.
INFO 03-02 00:33:11 [logger.py:42] Received request cmpl-908d7b751d60403ab12c249c77326ef4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:33:11 [async_llm.py:261] Added request cmpl-908d7b751d60403ab12c249c77326ef4-0.
INFO 03-02 00:33:12 [logger.py:42] Received request cmpl-2d973c0a775c4916973c99d6f20e735b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:33:12 [async_llm.py:261] Added request cmpl-2d973c0a775c4916973c99d6f20e735b-0.
INFO 03-02 00:33:13 [logger.py:42] Received request cmpl-69f5305821b4485f8ac5e764651e393c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:33:13 [async_llm.py:261] Added request cmpl-69f5305821b4485f8ac5e764651e393c-0.
INFO 03-02 00:33:14 [logger.py:42] Received request cmpl-b701b923e4ee4745b3430a60aaf1271d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:33:14 [async_llm.py:261] Added request cmpl-b701b923e4ee4745b3430a60aaf1271d-0.
INFO 03-02 00:33:15 [logger.py:42] Received request cmpl-5ffbeddd20634acdb58991a626310fc7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:33:15 [async_llm.py:261] Added request cmpl-5ffbeddd20634acdb58991a626310fc7-0.
INFO 03-02 00:33:16 [logger.py:42] Received request cmpl-8ea78a5df3a84550a5b4a8802443cbbd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:33:16 [async_llm.py:261] Added request cmpl-8ea78a5df3a84550a5b4a8802443cbbd-0.
INFO 03-02 00:33:17 [logger.py:42] Received request cmpl-da2d9b255ede4f998678fe99e492e2bc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:33:17 [async_llm.py:261] Added request cmpl-da2d9b255ede4f998678fe99e492e2bc-0.
INFO 03-02 00:33:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.4 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.4%, Prefix cache hit rate: 51.6%
INFO 03-02 00:33:19 [logger.py:42] Received request cmpl-b8a5642bbaf44688a5d498ffcfea8141-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:33:19 [async_llm.py:261] Added request cmpl-b8a5642bbaf44688a5d498ffcfea8141-0.
INFO 03-02 00:33:20 [logger.py:42] Received request cmpl-f14db9457f9f441fb480fd583c0b404b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:33:20 [async_llm.py:261] Added request cmpl-f14db9457f9f441fb480fd583c0b404b-0.
INFO 03-02 00:33:21 [logger.py:42] Received request cmpl-4fbb2b9515294da1a69c25e3c8193701-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:33:21 [async_llm.py:261] Added request cmpl-4fbb2b9515294da1a69c25e3c8193701-0.
INFO 03-02 00:33:22 [logger.py:42] Received request cmpl-d9a54a96cd83467a8fab99ec8c7f9fa8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:33:22 [async_llm.py:261] Added request cmpl-d9a54a96cd83467a8fab99ec8c7f9fa8-0.
INFO 03-02 00:33:23 [logger.py:42] Received request cmpl-2f8fa263aea1426291c4fbb464eca2af-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:33:23 [async_llm.py:261] Added request cmpl-2f8fa263aea1426291c4fbb464eca2af-0.
INFO 03-02 00:33:24 [logger.py:42] Received request cmpl-d304ef888a884a5cb69073571ffd4e02-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:33:24 [async_llm.py:261] Added request cmpl-d304ef888a884a5cb69073571ffd4e02-0.
INFO 03-02 00:33:26 [logger.py:42] Received request cmpl-96f6f11b4bae442298c79b46cfffa5f9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:33:26 [async_llm.py:261] Added request cmpl-96f6f11b4bae442298c79b46cfffa5f9-0.
INFO 03-02 00:33:27 [logger.py:42] Received request cmpl-ca47d80c61f04c3f9b41ca7d91c9ee14-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:33:27 [async_llm.py:261] Added request cmpl-ca47d80c61f04c3f9b41ca7d91c9ee14-0.
INFO 03-02 00:33:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.1 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:33:28 [logger.py:42] Received request cmpl-4da9c31c21504719904e019c1ba200a2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:33:28 [async_llm.py:261] Added request cmpl-4da9c31c21504719904e019c1ba200a2-0.
INFO 03-02 00:33:29 [logger.py:42] Received request cmpl-6cbd6f70183f437faaed810c95b6cfd8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:33:29 [async_llm.py:261] Added request cmpl-6cbd6f70183f437faaed810c95b6cfd8-0.
INFO 03-02 00:33:30 [logger.py:42] Received request cmpl-494e642810644467a14f9593e69f0af2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:33:30 [async_llm.py:261] Added request cmpl-494e642810644467a14f9593e69f0af2-0.
INFO 03-02 00:33:31 [logger.py:42] Received request cmpl-cc072a70329f4023be60f841ecc9356a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:33:31 [async_llm.py:261] Added request cmpl-cc072a70329f4023be60f841ecc9356a-0.
INFO 03-02 00:33:32 [logger.py:42] Received request cmpl-96c56b8d5d0c4111b3ca9eda133efdd2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:33:32 [async_llm.py:261] Added request cmpl-96c56b8d5d0c4111b3ca9eda133efdd2-0.
INFO 03-02 00:33:34 [logger.py:42] Received request cmpl-d46e57954e6a42e9ad036333c35f6830-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:33:34 [async_llm.py:261] Added request cmpl-d46e57954e6a42e9ad036333c35f6830-0.
INFO 03-02 00:33:35 [logger.py:42] Received request cmpl-8657ddef5848485e84a7120b59f4fae4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:33:35 [async_llm.py:261] Added request cmpl-8657ddef5848485e84a7120b59f4fae4-0.
INFO 03-02 00:33:36 [logger.py:42] Received request cmpl-3affe88d0b4543d0bd736fcc93bb9d2d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:33:36 [async_llm.py:261] Added request cmpl-3affe88d0b4543d0bd736fcc93bb9d2d-0.
INFO 03-02 00:33:37 [logger.py:42] Received request cmpl-998c508413b9470388b38def214e1096-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:33:37 [async_llm.py:261] Added request cmpl-998c508413b9470388b38def214e1096-0.
INFO 03-02 00:33:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:33:38 [logger.py:42] Received request cmpl-2ba0bfb2dba24a69aae7baa24fbb6bb8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:33:38 [async_llm.py:261] Added request cmpl-2ba0bfb2dba24a69aae7baa24fbb6bb8-0.
INFO 03-02 00:33:39 [logger.py:42] Received request cmpl-62a636ff2f1342a4967ef407df84b91e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:33:39 [async_llm.py:261] Added request cmpl-62a636ff2f1342a4967ef407df84b91e-0.
INFO 03-02 00:33:41 [logger.py:42] Received request cmpl-12ef10518f534d8187368de0c2dacb8e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:33:41 [async_llm.py:261] Added request cmpl-12ef10518f534d8187368de0c2dacb8e-0.
INFO 03-02 00:33:42 [logger.py:42] Received request cmpl-3c50a1b480f4488293920245ab2ce1ca-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:33:42 [async_llm.py:261] Added request cmpl-3c50a1b480f4488293920245ab2ce1ca-0.
INFO 03-02 00:33:43 [logger.py:42] Received request cmpl-09d3d2d282e647ff9f2613d855f4b8ba-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:33:43 [async_llm.py:261] Added request cmpl-09d3d2d282e647ff9f2613d855f4b8ba-0.
INFO 03-02 00:33:44 [logger.py:42] Received request cmpl-f35e30b375a741a9aefa589ad7302684-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:33:44 [async_llm.py:261] Added request cmpl-f35e30b375a741a9aefa589ad7302684-0.
INFO 03-02 00:33:45 [logger.py:42] Received request cmpl-d20b1103b3d14eb7922c197ab2d99fd8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:33:45 [async_llm.py:261] Added request cmpl-d20b1103b3d14eb7922c197ab2d99fd8-0.
INFO 03-02 00:33:46 [logger.py:42] Received request cmpl-079afb73fdb14c6fb9c6cbc40668c5e2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:33:46 [async_llm.py:261] Added request cmpl-079afb73fdb14c6fb9c6cbc40668c5e2-0.
INFO 03-02 00:33:47 [logger.py:42] Received request cmpl-c478c8914d0f4f5ea0dba9aa22787859-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:33:47 [async_llm.py:261] Added request cmpl-c478c8914d0f4f5ea0dba9aa22787859-0.
INFO 03-02 00:33:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:33:49 [logger.py:42] Received request cmpl-6df689d4e70f440e924d12eafbbdc895-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:33:49 [async_llm.py:261] Added request cmpl-6df689d4e70f440e924d12eafbbdc895-0.
INFO 03-02 00:33:50 [logger.py:42] Received request cmpl-20c551ae14754a179ecd62f77f9416df-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:33:50 [async_llm.py:261] Added request cmpl-20c551ae14754a179ecd62f77f9416df-0.
INFO 03-02 00:33:51 [logger.py:42] Received request cmpl-0c558738fc4a4fb59567d02ad50a5d99-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:33:51 [async_llm.py:261] Added request cmpl-0c558738fc4a4fb59567d02ad50a5d99-0.
INFO 03-02 00:33:52 [logger.py:42] Received request cmpl-fe89574dbee844499c0c6849ff34f4ef-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:33:52 [async_llm.py:261] Added request cmpl-fe89574dbee844499c0c6849ff34f4ef-0.
INFO 03-02 00:33:53 [logger.py:42] Received request cmpl-f61b87942ea548afb89ad708433454f8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:33:53 [async_llm.py:261] Added request cmpl-f61b87942ea548afb89ad708433454f8-0.
INFO 03-02 00:33:54 [logger.py:42] Received request cmpl-bace9a0ca3b04bac9f1e62b22f4a29b0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:33:54 [async_llm.py:261] Added request cmpl-bace9a0ca3b04bac9f1e62b22f4a29b0-0.
INFO 03-02 00:33:55 [logger.py:42] Received request cmpl-7b43045a2ba546cbaf15d4a0c1da008f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:33:55 [async_llm.py:261] Added request cmpl-7b43045a2ba546cbaf15d4a0c1da008f-0.
INFO 03-02 00:33:57 [logger.py:42] Received request cmpl-3b959615979248049656c30a6c8b1ac9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:33:57 [async_llm.py:261] Added request cmpl-3b959615979248049656c30a6c8b1ac9-0.
INFO 03-02 00:33:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:33:58 [logger.py:42] Received request cmpl-64becea81a7e41ec862f4a9a54bdcb37-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:33:58 [async_llm.py:261] Added request cmpl-64becea81a7e41ec862f4a9a54bdcb37-0.
INFO 03-02 00:33:59 [logger.py:42] Received request cmpl-b7340c8d6c4c48cebec01441079581cc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:33:59 [async_llm.py:261] Added request cmpl-b7340c8d6c4c48cebec01441079581cc-0.
INFO 03-02 00:34:00 [logger.py:42] Received request cmpl-4e7c4e607c904b079f66655a0ab0afdc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:34:00 [async_llm.py:261] Added request cmpl-4e7c4e607c904b079f66655a0ab0afdc-0.
INFO 03-02 00:34:01 [logger.py:42] Received request cmpl-557ccc2db19241b8a54ccf05a2825b7a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:34:01 [async_llm.py:261] Added request cmpl-557ccc2db19241b8a54ccf05a2825b7a-0.
INFO 03-02 00:34:02 [logger.py:42] Received request cmpl-3a11e5c91f2b47e6b02dcc2e35405f99-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:34:02 [async_llm.py:261] Added request cmpl-3a11e5c91f2b47e6b02dcc2e35405f99-0.
INFO 03-02 00:34:04 [logger.py:42] Received request cmpl-05dbef23eb5649ed9c149ae253d4ea2f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:34:04 [async_llm.py:261] Added request cmpl-05dbef23eb5649ed9c149ae253d4ea2f-0.
INFO 03-02 00:34:05 [logger.py:42] Received request cmpl-5c2ead28d8264ad68fc08e65a8bbbc79-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:34:05 [async_llm.py:261] Added request cmpl-5c2ead28d8264ad68fc08e65a8bbbc79-0.
INFO 03-02 00:34:06 [logger.py:42] Received request cmpl-d5ef1a0e14d84cb3bdf70077b39316cd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:34:06 [async_llm.py:261] Added request cmpl-d5ef1a0e14d84cb3bdf70077b39316cd-0.
INFO 03-02 00:34:07 [logger.py:42] Received request cmpl-32edf43901044802827a29b75dab6161-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:34:07 [async_llm.py:261] Added request cmpl-32edf43901044802827a29b75dab6161-0.
INFO 03-02 00:34:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:34:08 [logger.py:42] Received request cmpl-2ba3e3692ff045ca94dc83ca3fe724ed-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:34:08 [async_llm.py:261] Added request cmpl-2ba3e3692ff045ca94dc83ca3fe724ed-0.
INFO 03-02 00:34:09 [logger.py:42] Received request cmpl-abe94ba147e04181b00205e176eba982-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:34:09 [async_llm.py:261] Added request cmpl-abe94ba147e04181b00205e176eba982-0.
INFO 03-02 00:34:10 [logger.py:42] Received request cmpl-857088404fe54a2da23d772b1c573482-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:34:10 [async_llm.py:261] Added request cmpl-857088404fe54a2da23d772b1c573482-0.
INFO 03-02 00:34:12 [logger.py:42] Received request cmpl-59e93eac5d6e44289de7971656f970a1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:34:12 [async_llm.py:261] Added request cmpl-59e93eac5d6e44289de7971656f970a1-0.
INFO 03-02 00:34:13 [logger.py:42] Received request cmpl-266b3c8bc0d7441ab8bd80aa9a5e64db-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:34:13 [async_llm.py:261] Added request cmpl-266b3c8bc0d7441ab8bd80aa9a5e64db-0.
INFO 03-02 00:34:14 [logger.py:42] Received request cmpl-f0ed47de79cf4f82bf82fbaa9a383fc4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:34:14 [async_llm.py:261] Added request cmpl-f0ed47de79cf4f82bf82fbaa9a383fc4-0.
INFO 03-02 00:34:15 [logger.py:42] Received request cmpl-577c2636d90a453daa654db3cdbdcfee-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:34:15 [async_llm.py:261] Added request cmpl-577c2636d90a453daa654db3cdbdcfee-0.
INFO 03-02 00:34:16 [logger.py:42] Received request cmpl-42ba7f65d14d4c31af476d6a5c732f6e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:34:16 [async_llm.py:261] Added request cmpl-42ba7f65d14d4c31af476d6a5c732f6e-0.
INFO 03-02 00:34:17 [logger.py:42] Received request cmpl-f9553273aeb3431eb8668b590228cb9a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:34:17 [async_llm.py:261] Added request cmpl-f9553273aeb3431eb8668b590228cb9a-0.
INFO 03-02 00:34:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:34:19 [logger.py:42] Received request cmpl-d1b8ceed3ba345c6bb328351d7b22f6a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:34:19 [async_llm.py:261] Added request cmpl-d1b8ceed3ba345c6bb328351d7b22f6a-0.
INFO 03-02 00:34:20 [logger.py:42] Received request cmpl-83ee20145ff44b42b8a962d521374b84-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:34:20 [async_llm.py:261] Added request cmpl-83ee20145ff44b42b8a962d521374b84-0.
INFO 03-02 00:34:21 [logger.py:42] Received request cmpl-b74d2c8e751e49b891bf445f0de5345d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:34:21 [async_llm.py:261] Added request cmpl-b74d2c8e751e49b891bf445f0de5345d-0.
INFO 03-02 00:34:22 [logger.py:42] Received request cmpl-5988863b3e214aa0b4db205292888728-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:34:22 [async_llm.py:261] Added request cmpl-5988863b3e214aa0b4db205292888728-0.
INFO 03-02 00:34:23 [logger.py:42] Received request cmpl-fd908737882e42f989fadc4f4a04362b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:34:23 [async_llm.py:261] Added request cmpl-fd908737882e42f989fadc4f4a04362b-0.
INFO 03-02 00:34:24 [logger.py:42] Received request cmpl-3c61dc97fd4a4a2bb6da3d2e08e184eb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:34:24 [async_llm.py:261] Added request cmpl-3c61dc97fd4a4a2bb6da3d2e08e184eb-0.
INFO 03-02 00:34:25 [logger.py:42] Received request cmpl-3af690c5120747758d22e177673a6f86-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:34:25 [async_llm.py:261] Added request cmpl-3af690c5120747758d22e177673a6f86-0.
INFO 03-02 00:34:27 [logger.py:42] Received request cmpl-5367de3f589245f3b5cf50956dde291a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:34:27 [async_llm.py:261] Added request cmpl-5367de3f589245f3b5cf50956dde291a-0.
INFO 03-02 00:34:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:34:28 [logger.py:42] Received request cmpl-657ffd98185c4bf19a71709f95de6a9b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:34:28 [async_llm.py:261] Added request cmpl-657ffd98185c4bf19a71709f95de6a9b-0.
INFO 03-02 00:34:29 [logger.py:42] Received request cmpl-fb19e250f1f4442b9eb9c1b698000fc1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:34:29 [async_llm.py:261] Added request cmpl-fb19e250f1f4442b9eb9c1b698000fc1-0.
INFO 03-02 00:34:30 [logger.py:42] Received request cmpl-802a13ce32494f79bfb2ff9dde01375d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:34:30 [async_llm.py:261] Added request cmpl-802a13ce32494f79bfb2ff9dde01375d-0.
INFO 03-02 00:34:31 [logger.py:42] Received request cmpl-bce6ade1fcf345c6b8e176d90cbb4b21-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:34:31 [async_llm.py:261] Added request cmpl-bce6ade1fcf345c6b8e176d90cbb4b21-0.
INFO 03-02 00:34:32 [logger.py:42] Received request cmpl-1ef9e51cfa8d43be82bf21a117571c46-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:34:32 [async_llm.py:261] Added request cmpl-1ef9e51cfa8d43be82bf21a117571c46-0.
INFO 03-02 00:34:34 [logger.py:42] Received request cmpl-7c75ac4d546040c68fc21e7d33b79f39-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:34:34 [async_llm.py:261] Added request cmpl-7c75ac4d546040c68fc21e7d33b79f39-0.
INFO 03-02 00:34:35 [logger.py:42] Received request cmpl-9c4ade202e7840c2a00023499ce408bd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:34:35 [async_llm.py:261] Added request cmpl-9c4ade202e7840c2a00023499ce408bd-0.
INFO 03-02 00:34:36 [logger.py:42] Received request cmpl-9fbbbe489eff417e9eebe8bc68ba046b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:34:36 [async_llm.py:261] Added request cmpl-9fbbbe489eff417e9eebe8bc68ba046b-0.
INFO 03-02 00:34:37 [logger.py:42] Received request cmpl-75c22c3297834ed2afff02d1c95ca534-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:34:37 [async_llm.py:261] Added request cmpl-75c22c3297834ed2afff02d1c95ca534-0.
INFO 03-02 00:34:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:34:38 [logger.py:42] Received request cmpl-15cb95e21a6c4bf8a7e1d07e7b5a8de6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:34:38 [async_llm.py:261] Added request cmpl-15cb95e21a6c4bf8a7e1d07e7b5a8de6-0.
INFO 03-02 00:34:39 [logger.py:42] Received request cmpl-74b94062b7924192a030c9c700b77be3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:34:39 [async_llm.py:261] Added request cmpl-74b94062b7924192a030c9c700b77be3-0.
INFO 03-02 00:34:40 [logger.py:42] Received request cmpl-2e9e147acab4451e8d48b0ff6ff910aa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:34:40 [async_llm.py:261] Added request cmpl-2e9e147acab4451e8d48b0ff6ff910aa-0.
INFO 03-02 00:34:42 [logger.py:42] Received request cmpl-165603d076b8407cb327843638e1139c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:34:42 [async_llm.py:261] Added request cmpl-165603d076b8407cb327843638e1139c-0.
INFO 03-02 00:34:43 [logger.py:42] Received request cmpl-a337cc1bf0544a84a161ea151bf4c38d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:34:43 [async_llm.py:261] Added request cmpl-a337cc1bf0544a84a161ea151bf4c38d-0.
INFO 03-02 00:34:44 [logger.py:42] Received request cmpl-621d9e5677a540f090213206860914ac-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:34:44 [async_llm.py:261] Added request cmpl-621d9e5677a540f090213206860914ac-0.
INFO 03-02 00:34:45 [logger.py:42] Received request cmpl-4db43ef8869f43698f314aabb7e26354-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:34:45 [async_llm.py:261] Added request cmpl-4db43ef8869f43698f314aabb7e26354-0.
INFO 03-02 00:34:46 [logger.py:42] Received request cmpl-5f44da3ca7294a72b9e225d105b02357-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:34:46 [async_llm.py:261] Added request cmpl-5f44da3ca7294a72b9e225d105b02357-0.
INFO 03-02 00:34:47 [logger.py:42] Received request cmpl-3688ca0a8843438cb28f7a1b4d942b9f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:34:47 [async_llm.py:261] Added request cmpl-3688ca0a8843438cb28f7a1b4d942b9f-0.
INFO 03-02 00:34:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:34:49 [logger.py:42] Received request cmpl-061ce2b28aa143ed9ff806a265f5bdb0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:34:49 [async_llm.py:261] Added request cmpl-061ce2b28aa143ed9ff806a265f5bdb0-0.
INFO 03-02 00:34:50 [logger.py:42] Received request cmpl-858d58f76ae94178b7b49b2bec10d313-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:34:50 [async_llm.py:261] Added request cmpl-858d58f76ae94178b7b49b2bec10d313-0.
INFO 03-02 00:34:51 [logger.py:42] Received request cmpl-ef80161e45a64d92b3fc8ead2c5b4cfd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:34:51 [async_llm.py:261] Added request cmpl-ef80161e45a64d92b3fc8ead2c5b4cfd-0.
INFO 03-02 00:34:52 [logger.py:42] Received request cmpl-b5cd59b0f7984574b45bcdc39a2c6da0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:34:52 [async_llm.py:261] Added request cmpl-b5cd59b0f7984574b45bcdc39a2c6da0-0.
INFO 03-02 00:34:53 [logger.py:42] Received request cmpl-e8755f3cf237471eb14f8c5838935151-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:34:53 [async_llm.py:261] Added request cmpl-e8755f3cf237471eb14f8c5838935151-0.
INFO 03-02 00:34:54 [logger.py:42] Received request cmpl-0e6c5f4f56cd4a4ba550ea2ec1d1f346-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:34:54 [async_llm.py:261] Added request cmpl-0e6c5f4f56cd4a4ba550ea2ec1d1f346-0.
INFO 03-02 00:34:55 [logger.py:42] Received request cmpl-087259b7f9c442bfb7030680185085da-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:34:55 [async_llm.py:261] Added request cmpl-087259b7f9c442bfb7030680185085da-0.
INFO 03-02 00:34:57 [logger.py:42] Received request cmpl-4ca0bace90a343958762cf457bccb5dc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:34:57 [async_llm.py:261] Added request cmpl-4ca0bace90a343958762cf457bccb5dc-0.
INFO 03-02 00:34:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:34:58 [logger.py:42] Received request cmpl-35b2b2ec03d747289bb521e127ba3a39-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:34:58 [async_llm.py:261] Added request cmpl-35b2b2ec03d747289bb521e127ba3a39-0.
INFO 03-02 00:34:59 [logger.py:42] Received request cmpl-5ab3a8d424ec40f89d88c9fb81c17279-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:34:59 [async_llm.py:261] Added request cmpl-5ab3a8d424ec40f89d88c9fb81c17279-0.
INFO 03-02 00:35:00 [logger.py:42] Received request cmpl-d86e00acce224ff089cf2240b1710933-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:35:00 [async_llm.py:261] Added request cmpl-d86e00acce224ff089cf2240b1710933-0.
INFO 03-02 00:35:01 [logger.py:42] Received request cmpl-18cda963cecd4e2a82d6cba25feed44d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:35:01 [async_llm.py:261] Added request cmpl-18cda963cecd4e2a82d6cba25feed44d-0.
INFO 03-02 00:35:02 [logger.py:42] Received request cmpl-39c60adf991a48f0abd2210a0375eeed-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:35:02 [async_llm.py:261] Added request cmpl-39c60adf991a48f0abd2210a0375eeed-0.
INFO 03-02 00:35:04 [logger.py:42] Received request cmpl-359728a958544b62ad40e1cb158abc56-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:35:04 [async_llm.py:261] Added request cmpl-359728a958544b62ad40e1cb158abc56-0.
INFO 03-02 00:35:05 [logger.py:42] Received request cmpl-27c8d171d2e443d3b676df71dd76345e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:35:05 [async_llm.py:261] Added request cmpl-27c8d171d2e443d3b676df71dd76345e-0.
INFO 03-02 00:35:06 [logger.py:42] Received request cmpl-2afeab07fa87479d8a017ca6b8abd793-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:35:06 [async_llm.py:261] Added request cmpl-2afeab07fa87479d8a017ca6b8abd793-0.
INFO 03-02 00:35:07 [logger.py:42] Received request cmpl-6a60e88684ec4af79895d08afabc34d4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:35:07 [async_llm.py:261] Added request cmpl-6a60e88684ec4af79895d08afabc34d4-0.
INFO 03-02 00:35:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:35:08 [logger.py:42] Received request cmpl-336958039bc34b30a6cb7c08948aa3c5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:35:08 [async_llm.py:261] Added request cmpl-336958039bc34b30a6cb7c08948aa3c5-0.
INFO 03-02 00:35:09 [logger.py:42] Received request cmpl-8bc2c2bcb2d140b38ed7bf1ed05fe693-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:35:09 [async_llm.py:261] Added request cmpl-8bc2c2bcb2d140b38ed7bf1ed05fe693-0.
INFO 03-02 00:35:10 [logger.py:42] Received request cmpl-f8272670cebf474ab36ef909d26421a8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:35:10 [async_llm.py:261] Added request cmpl-f8272670cebf474ab36ef909d26421a8-0.
INFO 03-02 00:35:12 [logger.py:42] Received request cmpl-afd6e0fe229b4e36bf1dfb728e76133e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:35:12 [async_llm.py:261] Added request cmpl-afd6e0fe229b4e36bf1dfb728e76133e-0.
INFO 03-02 00:35:13 [logger.py:42] Received request cmpl-cafdc4b0e08d4516bbcf6c9031cac67c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:35:13 [async_llm.py:261] Added request cmpl-cafdc4b0e08d4516bbcf6c9031cac67c-0.
INFO 03-02 00:35:14 [logger.py:42] Received request cmpl-6b9dd9ea38124c6f8ebc9e90462f52cb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:35:14 [async_llm.py:261] Added request cmpl-6b9dd9ea38124c6f8ebc9e90462f52cb-0.
INFO 03-02 00:35:15 [logger.py:42] Received request cmpl-c6021152a30e44318c4c927afaed9056-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:35:15 [async_llm.py:261] Added request cmpl-c6021152a30e44318c4c927afaed9056-0.
INFO 03-02 00:35:16 [logger.py:42] Received request cmpl-5cf81984d0b049deaed60e532ff07f39-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:35:16 [async_llm.py:261] Added request cmpl-5cf81984d0b049deaed60e532ff07f39-0.
INFO 03-02 00:35:17 [logger.py:42] Received request cmpl-01e9b3b268c04e6c98794df5b485ea3a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:35:17 [async_llm.py:261] Added request cmpl-01e9b3b268c04e6c98794df5b485ea3a-0.
INFO 03-02 00:35:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:35:19 [logger.py:42] Received request cmpl-2cb47130adb444bbb992fb54aed9cbd0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:35:19 [async_llm.py:261] Added request cmpl-2cb47130adb444bbb992fb54aed9cbd0-0.
INFO 03-02 00:35:20 [logger.py:42] Received request cmpl-9178578a5e554268a9f5dc02e7b0c6cd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:35:20 [async_llm.py:261] Added request cmpl-9178578a5e554268a9f5dc02e7b0c6cd-0.
INFO 03-02 00:35:21 [logger.py:42] Received request cmpl-8016ec176ef7497f8284376b535d0d42-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:35:21 [async_llm.py:261] Added request cmpl-8016ec176ef7497f8284376b535d0d42-0.
INFO 03-02 00:35:22 [logger.py:42] Received request cmpl-004f1206b69a45df9d79faaac3c027a8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:35:22 [async_llm.py:261] Added request cmpl-004f1206b69a45df9d79faaac3c027a8-0.
INFO 03-02 00:35:23 [logger.py:42] Received request cmpl-1c941131b7624da28ef1d1d6d9c3e963-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:35:23 [async_llm.py:261] Added request cmpl-1c941131b7624da28ef1d1d6d9c3e963-0.
INFO 03-02 00:35:24 [logger.py:42] Received request cmpl-385f01fcc2884741b104c5569cb3d3c5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:35:24 [async_llm.py:261] Added request cmpl-385f01fcc2884741b104c5569cb3d3c5-0.
INFO 03-02 00:35:25 [logger.py:42] Received request cmpl-4969e3ddf2944071b5cdb1a6b6c2eb41-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:35:25 [async_llm.py:261] Added request cmpl-4969e3ddf2944071b5cdb1a6b6c2eb41-0.
INFO 03-02 00:35:27 [logger.py:42] Received request cmpl-4a8249443c934a9280ea2264c5e89c41-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:35:27 [async_llm.py:261] Added request cmpl-4a8249443c934a9280ea2264c5e89c41-0.
INFO 03-02 00:35:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:35:28 [logger.py:42] Received request cmpl-9736d3ba7bdd439b9ab2004543251eb1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:35:28 [async_llm.py:261] Added request cmpl-9736d3ba7bdd439b9ab2004543251eb1-0.
INFO 03-02 00:35:29 [logger.py:42] Received request cmpl-8bfa5505ce7c429d8e5e512f68887559-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:35:29 [async_llm.py:261] Added request cmpl-8bfa5505ce7c429d8e5e512f68887559-0.
INFO 03-02 00:35:30 [logger.py:42] Received request cmpl-4b251c1823be4f6182012d381c793237-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:35:30 [async_llm.py:261] Added request cmpl-4b251c1823be4f6182012d381c793237-0.
INFO 03-02 00:35:31 [logger.py:42] Received request cmpl-ba832dc2ce104213a719ee936b265a91-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:35:31 [async_llm.py:261] Added request cmpl-ba832dc2ce104213a719ee936b265a91-0.
INFO 03-02 00:35:32 [logger.py:42] Received request cmpl-7e683c6f7e184afb98651e5040ff1673-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:35:32 [async_llm.py:261] Added request cmpl-7e683c6f7e184afb98651e5040ff1673-0.
INFO 03-02 00:35:34 [logger.py:42] Received request cmpl-91f6cb8c180a4f35b6cdb4f2d4502b31-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:35:34 [async_llm.py:261] Added request cmpl-91f6cb8c180a4f35b6cdb4f2d4502b31-0.
INFO 03-02 00:35:35 [logger.py:42] Received request cmpl-4c572aac6788410c9dc0168245672a1f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:35:35 [async_llm.py:261] Added request cmpl-4c572aac6788410c9dc0168245672a1f-0.
INFO 03-02 00:35:36 [logger.py:42] Received request cmpl-2a57bdc31bd846f58449cb093ee1ca01-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:35:36 [async_llm.py:261] Added request cmpl-2a57bdc31bd846f58449cb093ee1ca01-0.
INFO 03-02 00:35:37 [logger.py:42] Received request cmpl-f70cb3d931e34c90aa358da9e4a72ab7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:35:37 [async_llm.py:261] Added request cmpl-f70cb3d931e34c90aa358da9e4a72ab7-0.
INFO 03-02 00:35:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:35:38 [logger.py:42] Received request cmpl-5174013b2ff341c9ae6a982ae37b9da6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:35:38 [async_llm.py:261] Added request cmpl-5174013b2ff341c9ae6a982ae37b9da6-0.
INFO 03-02 00:35:39 [logger.py:42] Received request cmpl-0b8b1deda5e044d2b23a1b01c298671e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:35:39 [async_llm.py:261] Added request cmpl-0b8b1deda5e044d2b23a1b01c298671e-0.
INFO 03-02 00:35:40 [logger.py:42] Received request cmpl-3cb1550204834bd2a0179f2a847e162c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:35:40 [async_llm.py:261] Added request cmpl-3cb1550204834bd2a0179f2a847e162c-0.
INFO 03-02 00:35:42 [logger.py:42] Received request cmpl-d26ec20a574b45f28522ddbb10cca52c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:35:42 [async_llm.py:261] Added request cmpl-d26ec20a574b45f28522ddbb10cca52c-0.
INFO 03-02 00:35:43 [logger.py:42] Received request cmpl-f9beb9ecdec646df83565655bb6787e3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:35:43 [async_llm.py:261] Added request cmpl-f9beb9ecdec646df83565655bb6787e3-0.
INFO 03-02 00:35:44 [logger.py:42] Received request cmpl-c04370433cf54fec989bdfa6112065e5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:35:44 [async_llm.py:261] Added request cmpl-c04370433cf54fec989bdfa6112065e5-0.
INFO 03-02 00:35:45 [logger.py:42] Received request cmpl-e7530030e4834c61a3cac73eaeee9f19-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:35:45 [async_llm.py:261] Added request cmpl-e7530030e4834c61a3cac73eaeee9f19-0.
INFO 03-02 00:35:46 [logger.py:42] Received request cmpl-ac489e8f80a94e3ebd9c7bd5131a6a1d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:35:46 [async_llm.py:261] Added request cmpl-ac489e8f80a94e3ebd9c7bd5131a6a1d-0.
INFO 03-02 00:35:47 [logger.py:42] Received request cmpl-2d0e055320f845a3b862c5a418782b7e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:35:47 [async_llm.py:261] Added request cmpl-2d0e055320f845a3b862c5a418782b7e-0.
INFO 03-02 00:35:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:35:49 [logger.py:42] Received request cmpl-d8a22d2483be467a98c9c63fcc5ce171-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:35:49 [async_llm.py:261] Added request cmpl-d8a22d2483be467a98c9c63fcc5ce171-0.
INFO 03-02 00:35:50 [logger.py:42] Received request cmpl-7f2e50539dc748e3b2a1f7822850ea9d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:35:50 [async_llm.py:261] Added request cmpl-7f2e50539dc748e3b2a1f7822850ea9d-0.
INFO 03-02 00:35:51 [logger.py:42] Received request cmpl-eb6b68382b59493fb66c3073e95d2e2d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:35:51 [async_llm.py:261] Added request cmpl-eb6b68382b59493fb66c3073e95d2e2d-0.
INFO 03-02 00:35:52 [logger.py:42] Received request cmpl-0465d444b8e14e699de59ccadc67d731-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:35:52 [async_llm.py:261] Added request cmpl-0465d444b8e14e699de59ccadc67d731-0.
INFO 03-02 00:35:53 [logger.py:42] Received request cmpl-a08bddd4f228461da1fcf292c50c57f8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:35:53 [async_llm.py:261] Added request cmpl-a08bddd4f228461da1fcf292c50c57f8-0.
INFO 03-02 00:35:54 [logger.py:42] Received request cmpl-06642de348d44dc48cfec178b06293c5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:35:54 [async_llm.py:261] Added request cmpl-06642de348d44dc48cfec178b06293c5-0.
INFO 03-02 00:35:55 [logger.py:42] Received request cmpl-8d856a16eb294fd781cf193cca0c3fb1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:35:55 [async_llm.py:261] Added request cmpl-8d856a16eb294fd781cf193cca0c3fb1-0.
INFO 03-02 00:35:57 [logger.py:42] Received request cmpl-1a6b905ade4c4bf3abcdb1e89ce3e6c3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:35:57 [async_llm.py:261] Added request cmpl-1a6b905ade4c4bf3abcdb1e89ce3e6c3-0.
INFO 03-02 00:35:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:35:58 [logger.py:42] Received request cmpl-2bde166a4ebf45b2af24f2bd82918280-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:35:58 [async_llm.py:261] Added request cmpl-2bde166a4ebf45b2af24f2bd82918280-0.
INFO 03-02 00:35:59 [logger.py:42] Received request cmpl-89b888a8e560467cb82bc7ffa965bca7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:35:59 [async_llm.py:261] Added request cmpl-89b888a8e560467cb82bc7ffa965bca7-0.
INFO 03-02 00:36:00 [logger.py:42] Received request cmpl-81eb8925ec554845b5d0ab8377b6514f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:36:00 [async_llm.py:261] Added request cmpl-81eb8925ec554845b5d0ab8377b6514f-0.
INFO 03-02 00:36:01 [logger.py:42] Received request cmpl-62fa9b70256b4a33812e5013f952282f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:36:01 [async_llm.py:261] Added request cmpl-62fa9b70256b4a33812e5013f952282f-0.
INFO 03-02 00:36:02 [logger.py:42] Received request cmpl-09bad9d4339943ddba85b4859ea25849-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:36:02 [async_llm.py:261] Added request cmpl-09bad9d4339943ddba85b4859ea25849-0.
INFO 03-02 00:36:04 [logger.py:42] Received request cmpl-53375832ca744364930d055e899c4052-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:36:04 [async_llm.py:261] Added request cmpl-53375832ca744364930d055e899c4052-0.
INFO 03-02 00:36:05 [logger.py:42] Received request cmpl-7092d72e22a14cb3add4317319810669-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:36:05 [async_llm.py:261] Added request cmpl-7092d72e22a14cb3add4317319810669-0.
INFO 03-02 00:36:06 [logger.py:42] Received request cmpl-67afe27eaa034010ad9f21ebd46aa961-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:36:06 [async_llm.py:261] Added request cmpl-67afe27eaa034010ad9f21ebd46aa961-0.
INFO 03-02 00:36:07 [logger.py:42] Received request cmpl-71054551ca614281b6d000d6dbf8ea7b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:36:07 [async_llm.py:261] Added request cmpl-71054551ca614281b6d000d6dbf8ea7b-0.
INFO 03-02 00:36:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:36:08 [logger.py:42] Received request cmpl-43de5dad75d842858fefd108f2c2f317-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:36:08 [async_llm.py:261] Added request cmpl-43de5dad75d842858fefd108f2c2f317-0.
INFO 03-02 00:36:09 [logger.py:42] Received request cmpl-c09ba12b7e8444baaecf0d23a37901ae-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:36:09 [async_llm.py:261] Added request cmpl-c09ba12b7e8444baaecf0d23a37901ae-0.
INFO 03-02 00:36:10 [logger.py:42] Received request cmpl-14b96f91d88043dca5e3fbbc355a32fe-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:36:10 [async_llm.py:261] Added request cmpl-14b96f91d88043dca5e3fbbc355a32fe-0.
INFO 03-02 00:36:12 [logger.py:42] Received request cmpl-8fffbabf70d94bfb8e0f047ea6dafd23-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:36:12 [async_llm.py:261] Added request cmpl-8fffbabf70d94bfb8e0f047ea6dafd23-0.
INFO 03-02 00:36:13 [logger.py:42] Received request cmpl-1be32552b45c4127bf483f6ff95bb902-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:36:13 [async_llm.py:261] Added request cmpl-1be32552b45c4127bf483f6ff95bb902-0.
INFO 03-02 00:36:14 [logger.py:42] Received request cmpl-cabb2520a25848ed823a5c493734a3b1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:36:14 [async_llm.py:261] Added request cmpl-cabb2520a25848ed823a5c493734a3b1-0.
INFO 03-02 00:36:15 [logger.py:42] Received request cmpl-bc6cf6633ab548a4a703cc5d6a9ffe58-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:36:15 [async_llm.py:261] Added request cmpl-bc6cf6633ab548a4a703cc5d6a9ffe58-0.
INFO 03-02 00:36:16 [logger.py:42] Received request cmpl-abbe58b7278047b1af9d0473713359db-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:36:16 [async_llm.py:261] Added request cmpl-abbe58b7278047b1af9d0473713359db-0.
INFO 03-02 00:36:17 [logger.py:42] Received request cmpl-c688adb81ca7491da44d9f8b561d66ac-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:36:17 [async_llm.py:261] Added request cmpl-c688adb81ca7491da44d9f8b561d66ac-0.
INFO 03-02 00:36:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:36:19 [logger.py:42] Received request cmpl-5aa78798435040ca9df68b5949c0f615-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:36:19 [async_llm.py:261] Added request cmpl-5aa78798435040ca9df68b5949c0f615-0.
INFO 03-02 00:36:20 [logger.py:42] Received request cmpl-d2a11d87ecc0484d85149b9aa88d2a0f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:36:20 [async_llm.py:261] Added request cmpl-d2a11d87ecc0484d85149b9aa88d2a0f-0.
INFO 03-02 00:36:21 [logger.py:42] Received request cmpl-60512c01dffe409c85861c5974379526-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:36:21 [async_llm.py:261] Added request cmpl-60512c01dffe409c85861c5974379526-0.
INFO 03-02 00:36:22 [logger.py:42] Received request cmpl-57b237df63d8464bbc165bf19365a1d7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:36:22 [async_llm.py:261] Added request cmpl-57b237df63d8464bbc165bf19365a1d7-0.
INFO 03-02 00:36:23 [logger.py:42] Received request cmpl-9450c9f579814a3f90a237f9ddbb2492-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:36:23 [async_llm.py:261] Added request cmpl-9450c9f579814a3f90a237f9ddbb2492-0.
INFO 03-02 00:36:24 [logger.py:42] Received request cmpl-1ca8296afd6943fe96bcc2ec94bb2bfb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:36:24 [async_llm.py:261] Added request cmpl-1ca8296afd6943fe96bcc2ec94bb2bfb-0.
INFO 03-02 00:36:25 [logger.py:42] Received request cmpl-6b465509f4de43ffbc5497ad4faeb91c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:36:25 [async_llm.py:261] Added request cmpl-6b465509f4de43ffbc5497ad4faeb91c-0.
INFO 03-02 00:36:27 [logger.py:42] Received request cmpl-566717f602554e0a8b47ac25754b7ab9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:36:27 [async_llm.py:261] Added request cmpl-566717f602554e0a8b47ac25754b7ab9-0.
INFO 03-02 00:36:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:36:28 [logger.py:42] Received request cmpl-ec275017e7d54d1391f80ac68993b718-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:36:28 [async_llm.py:261] Added request cmpl-ec275017e7d54d1391f80ac68993b718-0.
INFO 03-02 00:36:29 [logger.py:42] Received request cmpl-cab43d5255a840768466cbaece139fcc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:36:29 [async_llm.py:261] Added request cmpl-cab43d5255a840768466cbaece139fcc-0.
INFO 03-02 00:36:30 [logger.py:42] Received request cmpl-1f9d076f9ced46a7b4e561b054d08245-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:36:30 [async_llm.py:261] Added request cmpl-1f9d076f9ced46a7b4e561b054d08245-0.
INFO 03-02 00:36:31 [logger.py:42] Received request cmpl-b551dbcc62ac45efa71685776afe239e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:36:31 [async_llm.py:261] Added request cmpl-b551dbcc62ac45efa71685776afe239e-0.
INFO 03-02 00:36:32 [logger.py:42] Received request cmpl-50dcf314ac6b426f85b831291e6afbe2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:36:32 [async_llm.py:261] Added request cmpl-50dcf314ac6b426f85b831291e6afbe2-0.
INFO 03-02 00:36:34 [logger.py:42] Received request cmpl-e94169c2bc2e4c398a2f09a862589be7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:36:34 [async_llm.py:261] Added request cmpl-e94169c2bc2e4c398a2f09a862589be7-0.
INFO 03-02 00:36:35 [logger.py:42] Received request cmpl-eb3705809043406db4e526d66e5167fa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:36:35 [async_llm.py:261] Added request cmpl-eb3705809043406db4e526d66e5167fa-0.
INFO 03-02 00:36:36 [logger.py:42] Received request cmpl-7ff149bed64145ee927d6e486df76c55-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:36:36 [async_llm.py:261] Added request cmpl-7ff149bed64145ee927d6e486df76c55-0.
INFO 03-02 00:36:37 [logger.py:42] Received request cmpl-add7627de97f4a1792e8d457fd31aa71-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:36:37 [async_llm.py:261] Added request cmpl-add7627de97f4a1792e8d457fd31aa71-0.
INFO 03-02 00:36:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:36:38 [logger.py:42] Received request cmpl-363339d728084a6eb666c2aef4a5db82-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:36:38 [async_llm.py:261] Added request cmpl-363339d728084a6eb666c2aef4a5db82-0.
INFO 03-02 00:36:39 [logger.py:42] Received request cmpl-89ce19ebab1a49daa8dca19d7b1e8456-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:36:39 [async_llm.py:261] Added request cmpl-89ce19ebab1a49daa8dca19d7b1e8456-0.
INFO 03-02 00:36:40 [logger.py:42] Received request cmpl-6a98289cbdd74dd2be14657cfafcca5e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:36:40 [async_llm.py:261] Added request cmpl-6a98289cbdd74dd2be14657cfafcca5e-0.
INFO 03-02 00:36:42 [logger.py:42] Received request cmpl-db9620c7e58342c58aec89ec2edcb77b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:36:42 [async_llm.py:261] Added request cmpl-db9620c7e58342c58aec89ec2edcb77b-0.
INFO 03-02 00:36:43 [logger.py:42] Received request cmpl-0464dd1eab4244bb9560710c41e50de0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:36:43 [async_llm.py:261] Added request cmpl-0464dd1eab4244bb9560710c41e50de0-0.
INFO 03-02 00:36:44 [logger.py:42] Received request cmpl-f53eea59277b48229aab9713ba353cfa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:36:44 [async_llm.py:261] Added request cmpl-f53eea59277b48229aab9713ba353cfa-0.
INFO 03-02 00:36:45 [logger.py:42] Received request cmpl-ff8330da6033411980534f74b6d7d172-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:36:45 [async_llm.py:261] Added request cmpl-ff8330da6033411980534f74b6d7d172-0.
INFO 03-02 00:36:46 [logger.py:42] Received request cmpl-69fe9cfd8f2c432297dea6fa2a1b64fa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:36:46 [async_llm.py:261] Added request cmpl-69fe9cfd8f2c432297dea6fa2a1b64fa-0.
INFO 03-02 00:36:47 [logger.py:42] Received request cmpl-f33c427482664772b0e15c9ed4f54f01-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:36:47 [async_llm.py:261] Added request cmpl-f33c427482664772b0e15c9ed4f54f01-0.
INFO 03-02 00:36:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:36:49 [logger.py:42] Received request cmpl-e50bfe3d27f4449b83e00d446841576b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:36:49 [async_llm.py:261] Added request cmpl-e50bfe3d27f4449b83e00d446841576b-0.
INFO 03-02 00:36:50 [logger.py:42] Received request cmpl-fc9d661fdc3f4af2845b4ebc77520e01-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:36:50 [async_llm.py:261] Added request cmpl-fc9d661fdc3f4af2845b4ebc77520e01-0.
INFO 03-02 00:36:51 [logger.py:42] Received request cmpl-3912ae89dd4041f78b7c459a6f1e9145-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:36:51 [async_llm.py:261] Added request cmpl-3912ae89dd4041f78b7c459a6f1e9145-0.
INFO 03-02 00:36:52 [logger.py:42] Received request cmpl-a79f14712db34f1c9eb682b08441e890-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:36:52 [async_llm.py:261] Added request cmpl-a79f14712db34f1c9eb682b08441e890-0.
INFO 03-02 00:36:53 [logger.py:42] Received request cmpl-53779768ae2e4d4aa628815b783d5515-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:36:53 [async_llm.py:261] Added request cmpl-53779768ae2e4d4aa628815b783d5515-0.
INFO 03-02 00:36:54 [logger.py:42] Received request cmpl-924afca4109244e7a12314c6195e74cd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:36:54 [async_llm.py:261] Added request cmpl-924afca4109244e7a12314c6195e74cd-0.
INFO 03-02 00:36:55 [logger.py:42] Received request cmpl-5f36c1c268e643048f5f157e16313534-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:36:55 [async_llm.py:261] Added request cmpl-5f36c1c268e643048f5f157e16313534-0.
INFO 03-02 00:36:57 [logger.py:42] Received request cmpl-c8f4b7509d754821a3cc6539c51e991b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:36:57 [async_llm.py:261] Added request cmpl-c8f4b7509d754821a3cc6539c51e991b-0.
INFO 03-02 00:36:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:36:58 [logger.py:42] Received request cmpl-0f8c779b4090447991c192b2d7322928-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:36:58 [async_llm.py:261] Added request cmpl-0f8c779b4090447991c192b2d7322928-0.
INFO 03-02 00:36:59 [logger.py:42] Received request cmpl-f47b34b63ba44da7a75c2a6b8cc62f42-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:36:59 [async_llm.py:261] Added request cmpl-f47b34b63ba44da7a75c2a6b8cc62f42-0.
INFO 03-02 00:37:00 [logger.py:42] Received request cmpl-31b15698b17f4853943313678db631b9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:37:00 [async_llm.py:261] Added request cmpl-31b15698b17f4853943313678db631b9-0.
INFO 03-02 00:37:01 [logger.py:42] Received request cmpl-0b3cda609d5d4a7b9ad96001176da293-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:37:01 [async_llm.py:261] Added request cmpl-0b3cda609d5d4a7b9ad96001176da293-0.
INFO 03-02 00:37:02 [logger.py:42] Received request cmpl-7827a044c41a4d4d8e9632388343d8f5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:37:02 [async_llm.py:261] Added request cmpl-7827a044c41a4d4d8e9632388343d8f5-0.
INFO 03-02 00:37:04 [logger.py:42] Received request cmpl-ea74c2012ccc4b508f948cabdf41402d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:37:04 [async_llm.py:261] Added request cmpl-ea74c2012ccc4b508f948cabdf41402d-0.
INFO 03-02 00:37:05 [logger.py:42] Received request cmpl-bd844a73dafe4b5183b53edd74edd2b4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:37:05 [async_llm.py:261] Added request cmpl-bd844a73dafe4b5183b53edd74edd2b4-0.
INFO 03-02 00:37:06 [logger.py:42] Received request cmpl-7a9a44039d9d41fa972fce6b552c4785-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:37:06 [async_llm.py:261] Added request cmpl-7a9a44039d9d41fa972fce6b552c4785-0.
INFO 03-02 00:37:07 [logger.py:42] Received request cmpl-b84b48f6257f43b58202ad85576e83ac-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:37:07 [async_llm.py:261] Added request cmpl-b84b48f6257f43b58202ad85576e83ac-0.
INFO 03-02 00:37:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:37:08 [logger.py:42] Received request cmpl-cb90bdfef1b641a7a3a1836f4074a29f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:37:08 [async_llm.py:261] Added request cmpl-cb90bdfef1b641a7a3a1836f4074a29f-0.
INFO 03-02 00:37:09 [logger.py:42] Received request cmpl-86d2cdbcb33641af9a8ec00113e31d72-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:37:09 [async_llm.py:261] Added request cmpl-86d2cdbcb33641af9a8ec00113e31d72-0.
INFO 03-02 00:37:10 [logger.py:42] Received request cmpl-f612edc6e290487c8752ac611a8b0bfc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:37:10 [async_llm.py:261] Added request cmpl-f612edc6e290487c8752ac611a8b0bfc-0.
INFO 03-02 00:37:12 [logger.py:42] Received request cmpl-4aade86b04694a429cf04825f4dbd423-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:37:12 [async_llm.py:261] Added request cmpl-4aade86b04694a429cf04825f4dbd423-0.
INFO 03-02 00:37:13 [logger.py:42] Received request cmpl-9c8a15818f284015a51eba69feb4db0e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:37:13 [async_llm.py:261] Added request cmpl-9c8a15818f284015a51eba69feb4db0e-0.
INFO 03-02 00:37:14 [logger.py:42] Received request cmpl-308e1afe3fb54bf3b9722db5544bac73-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:37:14 [async_llm.py:261] Added request cmpl-308e1afe3fb54bf3b9722db5544bac73-0.
INFO 03-02 00:37:15 [logger.py:42] Received request cmpl-776351758ab948f4bc4f2942c277d960-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:37:15 [async_llm.py:261] Added request cmpl-776351758ab948f4bc4f2942c277d960-0.
INFO 03-02 00:37:16 [logger.py:42] Received request cmpl-66388e8d976a4833afe1de77c835356f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:37:16 [async_llm.py:261] Added request cmpl-66388e8d976a4833afe1de77c835356f-0.
INFO 03-02 00:37:17 [logger.py:42] Received request cmpl-7125627af3b44b3597f88e266eb01a02-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:37:17 [async_llm.py:261] Added request cmpl-7125627af3b44b3597f88e266eb01a02-0.
INFO 03-02 00:37:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:37:19 [logger.py:42] Received request cmpl-5971acb258a242179e067db2057e590d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:37:19 [async_llm.py:261] Added request cmpl-5971acb258a242179e067db2057e590d-0.
INFO 03-02 00:37:20 [logger.py:42] Received request cmpl-840968cf4a4141fdbc62d23ae1cacfd3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:37:20 [async_llm.py:261] Added request cmpl-840968cf4a4141fdbc62d23ae1cacfd3-0.
INFO 03-02 00:37:21 [logger.py:42] Received request cmpl-b50af10e0ad64d40a703e21b8f93895e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:37:21 [async_llm.py:261] Added request cmpl-b50af10e0ad64d40a703e21b8f93895e-0.
INFO 03-02 00:37:22 [logger.py:42] Received request cmpl-ebb597ea93724b8fb2e3b946e59ec52f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:37:22 [async_llm.py:261] Added request cmpl-ebb597ea93724b8fb2e3b946e59ec52f-0.
INFO 03-02 00:37:23 [logger.py:42] Received request cmpl-dabd66811c2242098cc3f2afe3d50dc0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:37:23 [async_llm.py:261] Added request cmpl-dabd66811c2242098cc3f2afe3d50dc0-0.
INFO 03-02 00:37:24 [logger.py:42] Received request cmpl-aa68041765234696bac12984ac2fc18b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:37:24 [async_llm.py:261] Added request cmpl-aa68041765234696bac12984ac2fc18b-0.
INFO 03-02 00:37:25 [logger.py:42] Received request cmpl-0a6bab0fab5e47ac91d33bad415689c2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:37:25 [async_llm.py:261] Added request cmpl-0a6bab0fab5e47ac91d33bad415689c2-0.
INFO 03-02 00:37:27 [logger.py:42] Received request cmpl-fa90f22b15d8458598d3106b0f453647-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:37:27 [async_llm.py:261] Added request cmpl-fa90f22b15d8458598d3106b0f453647-0.
INFO 03-02 00:37:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:37:28 [logger.py:42] Received request cmpl-f783d8c4986d4f1c996e7889d638aa35-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:37:28 [async_llm.py:261] Added request cmpl-f783d8c4986d4f1c996e7889d638aa35-0.
INFO 03-02 00:37:29 [logger.py:42] Received request cmpl-373675cb545742508c3fa31e1aeb97b7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:37:29 [async_llm.py:261] Added request cmpl-373675cb545742508c3fa31e1aeb97b7-0.
INFO 03-02 00:37:30 [logger.py:42] Received request cmpl-5c5b0d11fd6a467181f86ca51702ff84-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:37:30 [async_llm.py:261] Added request cmpl-5c5b0d11fd6a467181f86ca51702ff84-0.
INFO 03-02 00:37:31 [logger.py:42] Received request cmpl-199531ae1dfa4ba4bbf4e4af3cc549ea-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:37:31 [async_llm.py:261] Added request cmpl-199531ae1dfa4ba4bbf4e4af3cc549ea-0.
INFO 03-02 00:37:32 [logger.py:42] Received request cmpl-7248ba66beac45f2b4be1c88b6f58af1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:37:32 [async_llm.py:261] Added request cmpl-7248ba66beac45f2b4be1c88b6f58af1-0.
INFO 03-02 00:37:34 [logger.py:42] Received request cmpl-2c6293c12c1f4089a254ee624085e64e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:37:34 [async_llm.py:261] Added request cmpl-2c6293c12c1f4089a254ee624085e64e-0.
INFO 03-02 00:37:35 [logger.py:42] Received request cmpl-bc0c02ae38414147b8e04bb51116b1e1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:37:35 [async_llm.py:261] Added request cmpl-bc0c02ae38414147b8e04bb51116b1e1-0.
INFO 03-02 00:37:36 [logger.py:42] Received request cmpl-44e8369dcadd4a0c9ee9bcabb77b7c87-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:37:36 [async_llm.py:261] Added request cmpl-44e8369dcadd4a0c9ee9bcabb77b7c87-0.
INFO 03-02 00:37:37 [logger.py:42] Received request cmpl-0c2f905ec0d64b7097ebd5b285120d80-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:37:37 [async_llm.py:261] Added request cmpl-0c2f905ec0d64b7097ebd5b285120d80-0.
INFO 03-02 00:37:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:37:38 [logger.py:42] Received request cmpl-d6564213a4224b8c93c07f55c6e3ddec-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:37:38 [async_llm.py:261] Added request cmpl-d6564213a4224b8c93c07f55c6e3ddec-0.
INFO 03-02 00:37:39 [logger.py:42] Received request cmpl-6318ec664b3b4046a5a53d6b7b9f6938-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:37:39 [async_llm.py:261] Added request cmpl-6318ec664b3b4046a5a53d6b7b9f6938-0.
INFO 03-02 00:37:40 [logger.py:42] Received request cmpl-879f57ec5c4049b884a521d2ea7e7fee-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:37:40 [async_llm.py:261] Added request cmpl-879f57ec5c4049b884a521d2ea7e7fee-0.
INFO 03-02 00:37:42 [logger.py:42] Received request cmpl-8da72ffcb3b74731bd866a5e2390102e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:37:42 [async_llm.py:261] Added request cmpl-8da72ffcb3b74731bd866a5e2390102e-0.
INFO 03-02 00:37:43 [logger.py:42] Received request cmpl-1dc0d67f32e14e619861fc255278dace-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:37:43 [async_llm.py:261] Added request cmpl-1dc0d67f32e14e619861fc255278dace-0.
INFO 03-02 00:37:44 [logger.py:42] Received request cmpl-cdabf8923b234324b78f5b6f0ff687b7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:37:44 [async_llm.py:261] Added request cmpl-cdabf8923b234324b78f5b6f0ff687b7-0.
INFO 03-02 00:37:45 [logger.py:42] Received request cmpl-b6ac0272050044d0a8408eb1180d61c7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:37:45 [async_llm.py:261] Added request cmpl-b6ac0272050044d0a8408eb1180d61c7-0.
INFO 03-02 00:37:46 [logger.py:42] Received request cmpl-91739923d6384845b1f99121dae9b220-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:37:46 [async_llm.py:261] Added request cmpl-91739923d6384845b1f99121dae9b220-0.
INFO 03-02 00:37:47 [logger.py:42] Received request cmpl-eb8ba333057c4e6eaa59e65c85b1c8b1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:37:47 [async_llm.py:261] Added request cmpl-eb8ba333057c4e6eaa59e65c85b1c8b1-0.
INFO 03-02 00:37:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:37:49 [logger.py:42] Received request cmpl-d351f6559bef4844b1bdfd559c790a59-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:37:49 [async_llm.py:261] Added request cmpl-d351f6559bef4844b1bdfd559c790a59-0.
INFO 03-02 00:37:50 [logger.py:42] Received request cmpl-fc256f8310734b48acfc2ed6f724f44f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:37:50 [async_llm.py:261] Added request cmpl-fc256f8310734b48acfc2ed6f724f44f-0.
INFO 03-02 00:37:51 [logger.py:42] Received request cmpl-b21d4b248b104d0bb32c7354bd0509a0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:37:51 [async_llm.py:261] Added request cmpl-b21d4b248b104d0bb32c7354bd0509a0-0.
INFO 03-02 00:37:52 [logger.py:42] Received request cmpl-bdb39fc38ee34f508bc0a983ae54798b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:37:52 [async_llm.py:261] Added request cmpl-bdb39fc38ee34f508bc0a983ae54798b-0.
INFO 03-02 00:37:53 [logger.py:42] Received request cmpl-b722ad8add084cfdb9521cdc644f2c4e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:37:53 [async_llm.py:261] Added request cmpl-b722ad8add084cfdb9521cdc644f2c4e-0.
INFO 03-02 00:37:54 [logger.py:42] Received request cmpl-6f456d8026c248ae88aca70a69eb6773-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:37:54 [async_llm.py:261] Added request cmpl-6f456d8026c248ae88aca70a69eb6773-0.
INFO 03-02 00:37:56 [logger.py:42] Received request cmpl-4825d942ffe44acabefe68572bc03581-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:37:56 [async_llm.py:261] Added request cmpl-4825d942ffe44acabefe68572bc03581-0.
INFO 03-02 00:37:57 [logger.py:42] Received request cmpl-412a7296887a4017a386c539fc07ea64-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:37:57 [async_llm.py:261] Added request cmpl-412a7296887a4017a386c539fc07ea64-0.
INFO 03-02 00:37:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:37:58 [logger.py:42] Received request cmpl-89e7fcbe897941219a64ad3f01c82e39-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:37:58 [async_llm.py:261] Added request cmpl-89e7fcbe897941219a64ad3f01c82e39-0.
INFO 03-02 00:37:59 [logger.py:42] Received request cmpl-a568f55446894af4be52e1a45e19ccfc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:37:59 [async_llm.py:261] Added request cmpl-a568f55446894af4be52e1a45e19ccfc-0.
INFO 03-02 00:38:00 [logger.py:42] Received request cmpl-554d421497504947926ce572de72c97c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:38:00 [async_llm.py:261] Added request cmpl-554d421497504947926ce572de72c97c-0.
INFO 03-02 00:38:01 [logger.py:42] Received request cmpl-38960748654c4b52b295c9c34256c7ca-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:38:01 [async_llm.py:261] Added request cmpl-38960748654c4b52b295c9c34256c7ca-0.
INFO 03-02 00:38:02 [logger.py:42] Received request cmpl-9f59f87015c1441582d45374834dcca9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:38:02 [async_llm.py:261] Added request cmpl-9f59f87015c1441582d45374834dcca9-0.
INFO 03-02 00:38:04 [logger.py:42] Received request cmpl-603185b7550848d1b803f77c0d76dbf6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:38:04 [async_llm.py:261] Added request cmpl-603185b7550848d1b803f77c0d76dbf6-0.
INFO 03-02 00:38:05 [logger.py:42] Received request cmpl-6b36e95a966e444ebfe3592ed3cb7f15-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:38:05 [async_llm.py:261] Added request cmpl-6b36e95a966e444ebfe3592ed3cb7f15-0.
INFO 03-02 00:38:06 [logger.py:42] Received request cmpl-0c4578652368426e9d9cf653228ff062-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:38:06 [async_llm.py:261] Added request cmpl-0c4578652368426e9d9cf653228ff062-0.
INFO 03-02 00:38:07 [logger.py:42] Received request cmpl-589314d7b1ea48e5a454352de390e54c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:38:07 [async_llm.py:261] Added request cmpl-589314d7b1ea48e5a454352de390e54c-0.
INFO 03-02 00:38:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:38:08 [logger.py:42] Received request cmpl-3b6b760ccb364563b61ad8da156a5700-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:38:08 [async_llm.py:261] Added request cmpl-3b6b760ccb364563b61ad8da156a5700-0.
INFO 03-02 00:38:09 [logger.py:42] Received request cmpl-68375c31859a4980aa898d144ffe87f3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:38:09 [async_llm.py:261] Added request cmpl-68375c31859a4980aa898d144ffe87f3-0.
INFO 03-02 00:38:11 [logger.py:42] Received request cmpl-473167c0ba7c4ba4ba7fe349c14881df-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:38:11 [async_llm.py:261] Added request cmpl-473167c0ba7c4ba4ba7fe349c14881df-0.
INFO 03-02 00:38:12 [logger.py:42] Received request cmpl-92dba15b0f5a4452abb0efae73302bc5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:38:12 [async_llm.py:261] Added request cmpl-92dba15b0f5a4452abb0efae73302bc5-0.
INFO 03-02 00:38:13 [logger.py:42] Received request cmpl-20e227427eed4b78967ee41f40770066-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:38:13 [async_llm.py:261] Added request cmpl-20e227427eed4b78967ee41f40770066-0.
INFO 03-02 00:38:14 [logger.py:42] Received request cmpl-63588192ee524c0b90f6fdd14f6b8c0a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:38:14 [async_llm.py:261] Added request cmpl-63588192ee524c0b90f6fdd14f6b8c0a-0.
INFO 03-02 00:38:15 [logger.py:42] Received request cmpl-b1117f09c1af40d8a8165a73df34e1ae-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:38:15 [async_llm.py:261] Added request cmpl-b1117f09c1af40d8a8165a73df34e1ae-0.
INFO 03-02 00:38:16 [logger.py:42] Received request cmpl-f47b286892d6441cafebc718045fc1a8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:38:16 [async_llm.py:261] Added request cmpl-f47b286892d6441cafebc718045fc1a8-0.
INFO 03-02 00:38:17 [logger.py:42] Received request cmpl-decfa42e5c544252b0ad565a747240b0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:38:17 [async_llm.py:261] Added request cmpl-decfa42e5c544252b0ad565a747240b0-0.
INFO 03-02 00:38:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:38:19 [logger.py:42] Received request cmpl-6136a9166b9f4835bb18c03ac2920752-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:38:19 [async_llm.py:261] Added request cmpl-6136a9166b9f4835bb18c03ac2920752-0.
INFO 03-02 00:38:20 [logger.py:42] Received request cmpl-e36f045820da4e73a5a694697dae77b4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:38:20 [async_llm.py:261] Added request cmpl-e36f045820da4e73a5a694697dae77b4-0.
INFO 03-02 00:38:21 [logger.py:42] Received request cmpl-a9fcdecb9cc84783b16fd75c1ddef155-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:38:21 [async_llm.py:261] Added request cmpl-a9fcdecb9cc84783b16fd75c1ddef155-0.
INFO 03-02 00:38:22 [logger.py:42] Received request cmpl-3b568d8ed2e9474198f18ba5f803ea1e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:38:22 [async_llm.py:261] Added request cmpl-3b568d8ed2e9474198f18ba5f803ea1e-0.
INFO 03-02 00:38:23 [logger.py:42] Received request cmpl-ae103c061c2a478a9bad3db995815c8f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:38:23 [async_llm.py:261] Added request cmpl-ae103c061c2a478a9bad3db995815c8f-0.
INFO 03-02 00:38:24 [logger.py:42] Received request cmpl-d989b2cd2daa4f478da15a21474c643d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:38:24 [async_llm.py:261] Added request cmpl-d989b2cd2daa4f478da15a21474c643d-0.
INFO 03-02 00:38:26 [logger.py:42] Received request cmpl-080432eac1c745ef9d502d493e975d10-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:38:26 [async_llm.py:261] Added request cmpl-080432eac1c745ef9d502d493e975d10-0.
INFO 03-02 00:38:27 [logger.py:42] Received request cmpl-266055fbfa6c45b69cab3c988202d4de-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:38:27 [async_llm.py:261] Added request cmpl-266055fbfa6c45b69cab3c988202d4de-0.
INFO 03-02 00:38:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:38:28 [logger.py:42] Received request cmpl-0a069add82da46b995cd2f4546ce43d8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:38:28 [async_llm.py:261] Added request cmpl-0a069add82da46b995cd2f4546ce43d8-0.
INFO 03-02 00:38:29 [logger.py:42] Received request cmpl-fd4365747aec471dacc569a175a92eaa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:38:29 [async_llm.py:261] Added request cmpl-fd4365747aec471dacc569a175a92eaa-0.
INFO 03-02 00:38:30 [logger.py:42] Received request cmpl-4cd7596b95c24cc8b16b95475dde259b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:38:30 [async_llm.py:261] Added request cmpl-4cd7596b95c24cc8b16b95475dde259b-0.
INFO 03-02 00:38:31 [logger.py:42] Received request cmpl-dfe2df713ebb4ec49f528e7ed915ee86-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:38:31 [async_llm.py:261] Added request cmpl-dfe2df713ebb4ec49f528e7ed915ee86-0.
INFO 03-02 00:38:32 [logger.py:42] Received request cmpl-e1e240d3a97740329febe80d7884a31b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:38:32 [async_llm.py:261] Added request cmpl-e1e240d3a97740329febe80d7884a31b-0.
INFO 03-02 00:38:34 [logger.py:42] Received request cmpl-c15f73ded227448b908e38b6af05fb06-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:38:34 [async_llm.py:261] Added request cmpl-c15f73ded227448b908e38b6af05fb06-0.
INFO 03-02 00:38:35 [logger.py:42] Received request cmpl-51da55e3ac324962baba338052811cf0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:38:35 [async_llm.py:261] Added request cmpl-51da55e3ac324962baba338052811cf0-0.
INFO 03-02 00:38:36 [logger.py:42] Received request cmpl-86c767bab40f48959046d3b445fad12c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:38:36 [async_llm.py:261] Added request cmpl-86c767bab40f48959046d3b445fad12c-0.
INFO 03-02 00:38:37 [logger.py:42] Received request cmpl-9a29d87b672e40cbba612b38e1e6a632-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:38:37 [async_llm.py:261] Added request cmpl-9a29d87b672e40cbba612b38e1e6a632-0.
INFO 03-02 00:38:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:38:38 [logger.py:42] Received request cmpl-70edb4164f924d7ab8460bd8392588ec-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:38:38 [async_llm.py:261] Added request cmpl-70edb4164f924d7ab8460bd8392588ec-0.
INFO 03-02 00:38:39 [logger.py:42] Received request cmpl-1bdefd375d264392a891782662cb4eb6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:38:39 [async_llm.py:261] Added request cmpl-1bdefd375d264392a891782662cb4eb6-0.
INFO 03-02 00:38:41 [logger.py:42] Received request cmpl-87993af089eb4d308ba858e59e45ee4d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:38:41 [async_llm.py:261] Added request cmpl-87993af089eb4d308ba858e59e45ee4d-0.
INFO 03-02 00:38:42 [logger.py:42] Received request cmpl-5276ec72d2694222a7aeb5bb84d6338e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:38:42 [async_llm.py:261] Added request cmpl-5276ec72d2694222a7aeb5bb84d6338e-0.
INFO 03-02 00:38:43 [logger.py:42] Received request cmpl-280616470c3847e782f39e5eff288ec2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:38:43 [async_llm.py:261] Added request cmpl-280616470c3847e782f39e5eff288ec2-0.
INFO 03-02 00:38:44 [logger.py:42] Received request cmpl-06fd1a3e454d41c68954533f6e2ad2d4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:38:44 [async_llm.py:261] Added request cmpl-06fd1a3e454d41c68954533f6e2ad2d4-0.
INFO 03-02 00:38:45 [logger.py:42] Received request cmpl-c69a42a4ea5c4e7fb33d719e7141cb81-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:38:45 [async_llm.py:261] Added request cmpl-c69a42a4ea5c4e7fb33d719e7141cb81-0.
INFO 03-02 00:38:46 [logger.py:42] Received request cmpl-762f53d2c8fe4cec83ecd6f7f735f696-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:38:46 [async_llm.py:261] Added request cmpl-762f53d2c8fe4cec83ecd6f7f735f696-0.
INFO 03-02 00:38:47 [logger.py:42] Received request cmpl-d2475c28d05849d2ac6197ec0e33542a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:38:47 [async_llm.py:261] Added request cmpl-d2475c28d05849d2ac6197ec0e33542a-0.
INFO 03-02 00:38:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.4 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.4%, Prefix cache hit rate: 51.6%
INFO 03-02 00:38:49 [logger.py:42] Received request cmpl-5b0a744c17404646b23a9bd72a28f770-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:38:49 [async_llm.py:261] Added request cmpl-5b0a744c17404646b23a9bd72a28f770-0.
INFO 03-02 00:38:50 [logger.py:42] Received request cmpl-aeacf98ebac84f419f8f123302552fac-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:38:50 [async_llm.py:261] Added request cmpl-aeacf98ebac84f419f8f123302552fac-0.
INFO 03-02 00:38:51 [logger.py:42] Received request cmpl-38592e1e5bce49b6955cafdaf223878c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:38:51 [async_llm.py:261] Added request cmpl-38592e1e5bce49b6955cafdaf223878c-0.
INFO 03-02 00:38:52 [logger.py:42] Received request cmpl-fdea08fecdb048f1a2a5cdfce5d7ee7f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:38:52 [async_llm.py:261] Added request cmpl-fdea08fecdb048f1a2a5cdfce5d7ee7f-0.
INFO 03-02 00:38:53 [logger.py:42] Received request cmpl-b549766c9bf34724bdd1feaa9283d268-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:38:53 [async_llm.py:261] Added request cmpl-b549766c9bf34724bdd1feaa9283d268-0.
INFO 03-02 00:38:54 [logger.py:42] Received request cmpl-0c4c0a9bbff4483894a9764d520a55ad-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:38:54 [async_llm.py:261] Added request cmpl-0c4c0a9bbff4483894a9764d520a55ad-0.
INFO 03-02 00:38:56 [logger.py:42] Received request cmpl-a138a43ed7e84ffa9263ef48b40a5073-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:38:56 [async_llm.py:261] Added request cmpl-a138a43ed7e84ffa9263ef48b40a5073-0.
INFO 03-02 00:38:57 [logger.py:42] Received request cmpl-d3747e7927414c0fa5685f72c99ddeb2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:38:57 [async_llm.py:261] Added request cmpl-d3747e7927414c0fa5685f72c99ddeb2-0.
INFO 03-02 00:38:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.1 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:38:58 [logger.py:42] Received request cmpl-b43a6d2eb216466cb9e636f7246faafd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:38:58 [async_llm.py:261] Added request cmpl-b43a6d2eb216466cb9e636f7246faafd-0.
INFO 03-02 00:38:59 [logger.py:42] Received request cmpl-c0eed9f0d39945699d7efe993add5d4b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:38:59 [async_llm.py:261] Added request cmpl-c0eed9f0d39945699d7efe993add5d4b-0.
INFO 03-02 00:39:00 [logger.py:42] Received request cmpl-535b8ebd80c041c49ffb111f75f79cd1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:39:00 [async_llm.py:261] Added request cmpl-535b8ebd80c041c49ffb111f75f79cd1-0.
INFO 03-02 00:39:01 [logger.py:42] Received request cmpl-4299f31c6a02411ca4e012c02b900e13-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:39:01 [async_llm.py:261] Added request cmpl-4299f31c6a02411ca4e012c02b900e13-0.
INFO 03-02 00:39:02 [logger.py:42] Received request cmpl-a606159826d2479895a3339e40af18ea-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:39:02 [async_llm.py:261] Added request cmpl-a606159826d2479895a3339e40af18ea-0.
INFO 03-02 00:39:04 [logger.py:42] Received request cmpl-6ed7d663962d47c79ffd38960edfe43f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:39:04 [async_llm.py:261] Added request cmpl-6ed7d663962d47c79ffd38960edfe43f-0.
INFO 03-02 00:39:05 [logger.py:42] Received request cmpl-8cbb22acc824485ebb3a860e6260638b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:39:05 [async_llm.py:261] Added request cmpl-8cbb22acc824485ebb3a860e6260638b-0.
INFO 03-02 00:39:06 [logger.py:42] Received request cmpl-8913e38474d04c0993ce484c12363956-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:39:06 [async_llm.py:261] Added request cmpl-8913e38474d04c0993ce484c12363956-0.
INFO 03-02 00:39:07 [logger.py:42] Received request cmpl-3b109c4c4ee64b469feb4f2b4f84812c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:39:07 [async_llm.py:261] Added request cmpl-3b109c4c4ee64b469feb4f2b4f84812c-0.
INFO 03-02 00:39:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:39:08 [logger.py:42] Received request cmpl-d69bce9b59b5410f984cc4c3285d395e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:39:08 [async_llm.py:261] Added request cmpl-d69bce9b59b5410f984cc4c3285d395e-0.
INFO 03-02 00:39:09 [logger.py:42] Received request cmpl-fb4af98ad79e49b4aa18c9175932280e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:39:09 [async_llm.py:261] Added request cmpl-fb4af98ad79e49b4aa18c9175932280e-0.
INFO 03-02 00:39:11 [logger.py:42] Received request cmpl-171cd5a572a64dc4bad97fc18c608a07-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:39:11 [async_llm.py:261] Added request cmpl-171cd5a572a64dc4bad97fc18c608a07-0.
INFO 03-02 00:39:12 [logger.py:42] Received request cmpl-e670c5ab37104107a92ea4bffda8821f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:39:12 [async_llm.py:261] Added request cmpl-e670c5ab37104107a92ea4bffda8821f-0.
INFO 03-02 00:39:13 [logger.py:42] Received request cmpl-fee26e9fac5f478b8bf82f5ad412dffb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:39:13 [async_llm.py:261] Added request cmpl-fee26e9fac5f478b8bf82f5ad412dffb-0.
INFO 03-02 00:39:14 [logger.py:42] Received request cmpl-c18fc6f7827a4d3bb31103be1df61668-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:39:14 [async_llm.py:261] Added request cmpl-c18fc6f7827a4d3bb31103be1df61668-0.
INFO 03-02 00:39:15 [logger.py:42] Received request cmpl-e15ceb6edd9547aebd7ec184524567af-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:39:15 [async_llm.py:261] Added request cmpl-e15ceb6edd9547aebd7ec184524567af-0.
INFO 03-02 00:39:16 [logger.py:42] Received request cmpl-383af373ff78455aaa01457585b17ce2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:39:16 [async_llm.py:261] Added request cmpl-383af373ff78455aaa01457585b17ce2-0.
INFO 03-02 00:39:17 [logger.py:42] Received request cmpl-67b9939cfda6472eb3726c4c2efaa978-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:39:17 [async_llm.py:261] Added request cmpl-67b9939cfda6472eb3726c4c2efaa978-0.
INFO 03-02 00:39:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:39:19 [logger.py:42] Received request cmpl-2c2c939f4a734047bd2240b85790a3b3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:39:19 [async_llm.py:261] Added request cmpl-2c2c939f4a734047bd2240b85790a3b3-0.
INFO 03-02 00:39:20 [logger.py:42] Received request cmpl-53c2178d0ea343a9ab33ed388c36e4a3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:39:20 [async_llm.py:261] Added request cmpl-53c2178d0ea343a9ab33ed388c36e4a3-0.
INFO 03-02 00:39:21 [logger.py:42] Received request cmpl-ef86ef5afee54a3a9bc412763c048560-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:39:21 [async_llm.py:261] Added request cmpl-ef86ef5afee54a3a9bc412763c048560-0.
INFO 03-02 00:39:22 [logger.py:42] Received request cmpl-1df36d172b6c4baba10336dbf57edc06-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:39:22 [async_llm.py:261] Added request cmpl-1df36d172b6c4baba10336dbf57edc06-0.
INFO 03-02 00:39:23 [logger.py:42] Received request cmpl-160e8cdfc2a04e5cae12c2802f8b4a6c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:39:23 [async_llm.py:261] Added request cmpl-160e8cdfc2a04e5cae12c2802f8b4a6c-0.
INFO 03-02 00:39:24 [logger.py:42] Received request cmpl-f49546e3e52c440bad752c9bba7013a0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:39:24 [async_llm.py:261] Added request cmpl-f49546e3e52c440bad752c9bba7013a0-0.
INFO 03-02 00:39:26 [logger.py:42] Received request cmpl-b506569e8c93455db7b6d50c806e1849-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:39:26 [async_llm.py:261] Added request cmpl-b506569e8c93455db7b6d50c806e1849-0.
INFO 03-02 00:39:27 [logger.py:42] Received request cmpl-a35969a222cb419ca5b32df576fd109d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:39:27 [async_llm.py:261] Added request cmpl-a35969a222cb419ca5b32df576fd109d-0.
INFO 03-02 00:39:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:39:28 [logger.py:42] Received request cmpl-f3302a334016454795e946d21e6e5d08-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:39:28 [async_llm.py:261] Added request cmpl-f3302a334016454795e946d21e6e5d08-0.
INFO 03-02 00:39:29 [logger.py:42] Received request cmpl-ce0d7e12534f4997af7eb1d66d320b84-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:39:29 [async_llm.py:261] Added request cmpl-ce0d7e12534f4997af7eb1d66d320b84-0.
INFO 03-02 00:39:30 [logger.py:42] Received request cmpl-55b1adea1c7041c2a37b2073a99c91bb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:39:30 [async_llm.py:261] Added request cmpl-55b1adea1c7041c2a37b2073a99c91bb-0.
INFO 03-02 00:39:31 [logger.py:42] Received request cmpl-5e57f83aa52344c095828c6563c1db05-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:39:31 [async_llm.py:261] Added request cmpl-5e57f83aa52344c095828c6563c1db05-0.
INFO 03-02 00:39:32 [logger.py:42] Received request cmpl-138151ba6dc44ee7b11986fe57d7fa07-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:39:32 [async_llm.py:261] Added request cmpl-138151ba6dc44ee7b11986fe57d7fa07-0.
INFO 03-02 00:39:34 [logger.py:42] Received request cmpl-62034853595741349e9b7de4d4a626b6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:39:34 [async_llm.py:261] Added request cmpl-62034853595741349e9b7de4d4a626b6-0.
INFO 03-02 00:39:35 [logger.py:42] Received request cmpl-aa4dcf87860c4c978299ab83efdef84f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:39:35 [async_llm.py:261] Added request cmpl-aa4dcf87860c4c978299ab83efdef84f-0.
INFO 03-02 00:39:36 [logger.py:42] Received request cmpl-3bbbc6ef0bd149be8c5be8834ed04937-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:39:36 [async_llm.py:261] Added request cmpl-3bbbc6ef0bd149be8c5be8834ed04937-0.
INFO 03-02 00:39:37 [logger.py:42] Received request cmpl-b97141928fad4bca838d9816be422754-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:39:37 [async_llm.py:261] Added request cmpl-b97141928fad4bca838d9816be422754-0.
INFO 03-02 00:39:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:39:38 [logger.py:42] Received request cmpl-5d740727dc864c428c72660e50d90136-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:39:38 [async_llm.py:261] Added request cmpl-5d740727dc864c428c72660e50d90136-0.
INFO 03-02 00:39:39 [logger.py:42] Received request cmpl-9a8d39ca7af241da8876c097f9f61a89-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:39:39 [async_llm.py:261] Added request cmpl-9a8d39ca7af241da8876c097f9f61a89-0.
INFO 03-02 00:39:40 [logger.py:42] Received request cmpl-07fff84ae5cf43fabbe15be280f9663e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:39:40 [async_llm.py:261] Added request cmpl-07fff84ae5cf43fabbe15be280f9663e-0.
INFO 03-02 00:39:42 [logger.py:42] Received request cmpl-3e1b63ac0b7146e9920053e9272552eb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:39:42 [async_llm.py:261] Added request cmpl-3e1b63ac0b7146e9920053e9272552eb-0.
INFO 03-02 00:39:43 [logger.py:42] Received request cmpl-a46344b50a134f829a52ed2143b10c1c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:39:43 [async_llm.py:261] Added request cmpl-a46344b50a134f829a52ed2143b10c1c-0.
INFO 03-02 00:39:44 [logger.py:42] Received request cmpl-c95cd32ea300400baf9d1a6f9646b22b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:39:44 [async_llm.py:261] Added request cmpl-c95cd32ea300400baf9d1a6f9646b22b-0.
INFO 03-02 00:39:45 [logger.py:42] Received request cmpl-7506803b23544f728b86873a78ba0e8d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:39:45 [async_llm.py:261] Added request cmpl-7506803b23544f728b86873a78ba0e8d-0.
INFO 03-02 00:39:46 [logger.py:42] Received request cmpl-ad06db0768e64cba82923b5678f0ccc6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:39:46 [async_llm.py:261] Added request cmpl-ad06db0768e64cba82923b5678f0ccc6-0.
INFO 03-02 00:39:47 [logger.py:42] Received request cmpl-23668d11db2344569b806c9070d51531-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:39:47 [async_llm.py:261] Added request cmpl-23668d11db2344569b806c9070d51531-0.
INFO 03-02 00:39:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:39:49 [logger.py:42] Received request cmpl-df498c60891f4eb894078bda870270d1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:39:49 [async_llm.py:261] Added request cmpl-df498c60891f4eb894078bda870270d1-0.
INFO 03-02 00:39:50 [logger.py:42] Received request cmpl-526fe4f1921c4deea66f0523afb04255-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:39:50 [async_llm.py:261] Added request cmpl-526fe4f1921c4deea66f0523afb04255-0.
INFO 03-02 00:39:51 [logger.py:42] Received request cmpl-bd4add73f9f34775a33eb93deda9e547-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:39:51 [async_llm.py:261] Added request cmpl-bd4add73f9f34775a33eb93deda9e547-0.
INFO 03-02 00:39:52 [logger.py:42] Received request cmpl-21b7f52f02e0492dbb86ce4b0000879f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:39:52 [async_llm.py:261] Added request cmpl-21b7f52f02e0492dbb86ce4b0000879f-0.
INFO 03-02 00:39:53 [logger.py:42] Received request cmpl-27093ba5caf14b1f897a3c7f96eb02c8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:39:53 [async_llm.py:261] Added request cmpl-27093ba5caf14b1f897a3c7f96eb02c8-0.
INFO 03-02 00:39:54 [logger.py:42] Received request cmpl-4ab46f9a8b0a4c8494fce704f052cbbc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:39:54 [async_llm.py:261] Added request cmpl-4ab46f9a8b0a4c8494fce704f052cbbc-0.
INFO 03-02 00:39:55 [logger.py:42] Received request cmpl-41b28be140214cb08d162efcb6865a3d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:39:55 [async_llm.py:261] Added request cmpl-41b28be140214cb08d162efcb6865a3d-0.
INFO 03-02 00:39:57 [logger.py:42] Received request cmpl-9b3d72a4f6fd4227bb6b445c97c51421-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:39:57 [async_llm.py:261] Added request cmpl-9b3d72a4f6fd4227bb6b445c97c51421-0.
INFO 03-02 00:39:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:39:58 [logger.py:42] Received request cmpl-c454db90493243b198ec6c6f37e0c448-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:39:58 [async_llm.py:261] Added request cmpl-c454db90493243b198ec6c6f37e0c448-0.
INFO 03-02 00:39:59 [logger.py:42] Received request cmpl-942320375c0b415eadc272de388ed9c4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:39:59 [async_llm.py:261] Added request cmpl-942320375c0b415eadc272de388ed9c4-0.
INFO 03-02 00:40:00 [logger.py:42] Received request cmpl-0143dcc1594c4e519d1b8716e4443d28-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:40:00 [async_llm.py:261] Added request cmpl-0143dcc1594c4e519d1b8716e4443d28-0.
INFO 03-02 00:40:01 [logger.py:42] Received request cmpl-827819aeea2e4c55aa342882a0ed977c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:40:01 [async_llm.py:261] Added request cmpl-827819aeea2e4c55aa342882a0ed977c-0.
INFO 03-02 00:40:02 [logger.py:42] Received request cmpl-7ffe1bafc5274d94b184a7c88645d7e6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:40:02 [async_llm.py:261] Added request cmpl-7ffe1bafc5274d94b184a7c88645d7e6-0.
INFO 03-02 00:40:04 [logger.py:42] Received request cmpl-9ddee18d8b8b452ba9eba7a37c54624b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:40:04 [async_llm.py:261] Added request cmpl-9ddee18d8b8b452ba9eba7a37c54624b-0.
INFO 03-02 00:40:05 [logger.py:42] Received request cmpl-ded227c71e5d4bbca48dd04d44bb7e0f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:40:05 [async_llm.py:261] Added request cmpl-ded227c71e5d4bbca48dd04d44bb7e0f-0.
INFO 03-02 00:40:06 [logger.py:42] Received request cmpl-c7b4c43479564ed1becdeb9061fe1721-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:40:06 [async_llm.py:261] Added request cmpl-c7b4c43479564ed1becdeb9061fe1721-0.
INFO 03-02 00:40:07 [logger.py:42] Received request cmpl-ac6bcc3d8c494ba7b9270e6e306b0cb7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:40:07 [async_llm.py:261] Added request cmpl-ac6bcc3d8c494ba7b9270e6e306b0cb7-0.
INFO 03-02 00:40:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:40:08 [logger.py:42] Received request cmpl-5fdcb9de578343dab40044ed6f1323ff-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:40:08 [async_llm.py:261] Added request cmpl-5fdcb9de578343dab40044ed6f1323ff-0.
INFO 03-02 00:40:09 [logger.py:42] Received request cmpl-8ec952b499274ebd9cac4a82a3584897-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:40:09 [async_llm.py:261] Added request cmpl-8ec952b499274ebd9cac4a82a3584897-0.
INFO 03-02 00:40:10 [logger.py:42] Received request cmpl-489b169e6b274bdc85d8604a0965eb48-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:40:10 [async_llm.py:261] Added request cmpl-489b169e6b274bdc85d8604a0965eb48-0.
INFO 03-02 00:40:12 [logger.py:42] Received request cmpl-35e3ef9e6d3e4ab5b6d52b7b633b9287-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:40:12 [async_llm.py:261] Added request cmpl-35e3ef9e6d3e4ab5b6d52b7b633b9287-0.
INFO 03-02 00:40:13 [logger.py:42] Received request cmpl-fbd6eee0ff8c4977ac15dc689cdae542-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:40:13 [async_llm.py:261] Added request cmpl-fbd6eee0ff8c4977ac15dc689cdae542-0.
INFO 03-02 00:40:14 [logger.py:42] Received request cmpl-89d33c6955b84b9fb8bfcf3a95e51623-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:40:14 [async_llm.py:261] Added request cmpl-89d33c6955b84b9fb8bfcf3a95e51623-0.
INFO 03-02 00:40:15 [logger.py:42] Received request cmpl-212babfaf59344a483b540e71e47db49-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:40:15 [async_llm.py:261] Added request cmpl-212babfaf59344a483b540e71e47db49-0.
INFO 03-02 00:40:16 [logger.py:42] Received request cmpl-c44a3ca751d846afad88fa1524bdd67c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:40:16 [async_llm.py:261] Added request cmpl-c44a3ca751d846afad88fa1524bdd67c-0.
INFO 03-02 00:40:17 [logger.py:42] Received request cmpl-f112a256e9fa43bf91c1deeadf31a545-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:40:17 [async_llm.py:261] Added request cmpl-f112a256e9fa43bf91c1deeadf31a545-0.
INFO 03-02 00:40:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:40:19 [logger.py:42] Received request cmpl-6e844031782247a3ae510451eec180b1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:40:19 [async_llm.py:261] Added request cmpl-6e844031782247a3ae510451eec180b1-0.
INFO 03-02 00:40:20 [logger.py:42] Received request cmpl-3485277de19f455f8730f20370cbc12f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:40:20 [async_llm.py:261] Added request cmpl-3485277de19f455f8730f20370cbc12f-0.
INFO 03-02 00:40:21 [logger.py:42] Received request cmpl-6cbbfff877494bf2acbc03fedbd56c83-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:40:21 [async_llm.py:261] Added request cmpl-6cbbfff877494bf2acbc03fedbd56c83-0.
INFO 03-02 00:40:22 [logger.py:42] Received request cmpl-f53ba077a2d04787bb5be30b22087dc2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:40:22 [async_llm.py:261] Added request cmpl-f53ba077a2d04787bb5be30b22087dc2-0.
INFO 03-02 00:40:23 [logger.py:42] Received request cmpl-eedde45b0b504b0a913b80ef85f7613b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:40:23 [async_llm.py:261] Added request cmpl-eedde45b0b504b0a913b80ef85f7613b-0.
INFO 03-02 00:40:24 [logger.py:42] Received request cmpl-f16f07ed47ca4088bf055b7473fba8d8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:40:24 [async_llm.py:261] Added request cmpl-f16f07ed47ca4088bf055b7473fba8d8-0.
INFO 03-02 00:40:25 [logger.py:42] Received request cmpl-5f54ea39b77349e6a0b4d9634785d5e7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:40:25 [async_llm.py:261] Added request cmpl-5f54ea39b77349e6a0b4d9634785d5e7-0.
INFO 03-02 00:40:27 [logger.py:42] Received request cmpl-5818801d64fc457a8ffc05d61a8daa3c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:40:27 [async_llm.py:261] Added request cmpl-5818801d64fc457a8ffc05d61a8daa3c-0.
INFO 03-02 00:40:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:40:28 [logger.py:42] Received request cmpl-8e491c1cc0134f689a36bc8cea7682d1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:40:28 [async_llm.py:261] Added request cmpl-8e491c1cc0134f689a36bc8cea7682d1-0.
INFO 03-02 00:40:29 [logger.py:42] Received request cmpl-73168ee12d7d42d29c5e639fc1e0c17a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:40:29 [async_llm.py:261] Added request cmpl-73168ee12d7d42d29c5e639fc1e0c17a-0.
INFO 03-02 00:40:30 [logger.py:42] Received request cmpl-0876119d670d47a9ba5deafbe12e161a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:40:30 [async_llm.py:261] Added request cmpl-0876119d670d47a9ba5deafbe12e161a-0.
INFO 03-02 00:40:31 [logger.py:42] Received request cmpl-82094ca2b93244718c94219ca61badc1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:40:31 [async_llm.py:261] Added request cmpl-82094ca2b93244718c94219ca61badc1-0.
INFO 03-02 00:40:32 [logger.py:42] Received request cmpl-44319491a468478c976a2f01284bb840-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:40:32 [async_llm.py:261] Added request cmpl-44319491a468478c976a2f01284bb840-0.
INFO 03-02 00:40:34 [logger.py:42] Received request cmpl-39f7f580ccdb4f90a28871599468beab-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:40:34 [async_llm.py:261] Added request cmpl-39f7f580ccdb4f90a28871599468beab-0.
INFO 03-02 00:40:35 [logger.py:42] Received request cmpl-03f7c8b700e245ecacd0e71d1af12779-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:40:35 [async_llm.py:261] Added request cmpl-03f7c8b700e245ecacd0e71d1af12779-0.
INFO 03-02 00:40:36 [logger.py:42] Received request cmpl-111ed209a00a4be895ce72dade5af45e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:40:36 [async_llm.py:261] Added request cmpl-111ed209a00a4be895ce72dade5af45e-0.
INFO 03-02 00:40:37 [logger.py:42] Received request cmpl-74578e2bb94c45f48b109f1a9a9f4e99-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:40:37 [async_llm.py:261] Added request cmpl-74578e2bb94c45f48b109f1a9a9f4e99-0.
INFO 03-02 00:40:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:40:38 [logger.py:42] Received request cmpl-7ca4cc74ad9d4100a3aec130eb5a39f3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:40:38 [async_llm.py:261] Added request cmpl-7ca4cc74ad9d4100a3aec130eb5a39f3-0.
INFO 03-02 00:40:39 [logger.py:42] Received request cmpl-299e1205f2b84fb0ab4746096e7d0329-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:40:39 [async_llm.py:261] Added request cmpl-299e1205f2b84fb0ab4746096e7d0329-0.
INFO 03-02 00:40:40 [logger.py:42] Received request cmpl-90f5cb522649460f8458f705b3289d11-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:40:40 [async_llm.py:261] Added request cmpl-90f5cb522649460f8458f705b3289d11-0.
INFO 03-02 00:40:42 [logger.py:42] Received request cmpl-bf58a5da52f54272aca0c57ae810eddb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:40:42 [async_llm.py:261] Added request cmpl-bf58a5da52f54272aca0c57ae810eddb-0.
INFO 03-02 00:40:43 [logger.py:42] Received request cmpl-11540b5f10be498e8241e455223b0f15-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:40:43 [async_llm.py:261] Added request cmpl-11540b5f10be498e8241e455223b0f15-0.
INFO 03-02 00:40:44 [logger.py:42] Received request cmpl-fa4b9bd80c55445ea28f39c615945b2a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:40:44 [async_llm.py:261] Added request cmpl-fa4b9bd80c55445ea28f39c615945b2a-0.
INFO 03-02 00:40:45 [logger.py:42] Received request cmpl-70af9d2f44c84e719d7b9b134eb01273-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:40:45 [async_llm.py:261] Added request cmpl-70af9d2f44c84e719d7b9b134eb01273-0.
INFO 03-02 00:40:46 [logger.py:42] Received request cmpl-c6c1ea8f1aaa4bbab309717b915f76f7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:40:46 [async_llm.py:261] Added request cmpl-c6c1ea8f1aaa4bbab309717b915f76f7-0.
INFO 03-02 00:40:47 [logger.py:42] Received request cmpl-64b6882dc32a4a92bb1e5d2d51ad07c6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:40:47 [async_llm.py:261] Added request cmpl-64b6882dc32a4a92bb1e5d2d51ad07c6-0.
INFO 03-02 00:40:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:40:49 [logger.py:42] Received request cmpl-f56c3f0edd034ee690547a94e6533a9a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:40:49 [async_llm.py:261] Added request cmpl-f56c3f0edd034ee690547a94e6533a9a-0.
INFO 03-02 00:40:50 [logger.py:42] Received request cmpl-98f5e25357754ead87829ce0d4999930-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:40:50 [async_llm.py:261] Added request cmpl-98f5e25357754ead87829ce0d4999930-0.
INFO 03-02 00:40:51 [logger.py:42] Received request cmpl-2d55ac23ae6547f098d98dcd828fbb86-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:40:51 [async_llm.py:261] Added request cmpl-2d55ac23ae6547f098d98dcd828fbb86-0.
INFO 03-02 00:40:52 [logger.py:42] Received request cmpl-10e26d1861544d02b50530977c031cd6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:40:52 [async_llm.py:261] Added request cmpl-10e26d1861544d02b50530977c031cd6-0.
INFO 03-02 00:40:53 [logger.py:42] Received request cmpl-c8c13a6a2a424ed4a438213552cdb095-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:40:53 [async_llm.py:261] Added request cmpl-c8c13a6a2a424ed4a438213552cdb095-0.
INFO 03-02 00:40:54 [logger.py:42] Received request cmpl-bd9ef6d98ce848eca54cc1958dacbba6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:40:54 [async_llm.py:261] Added request cmpl-bd9ef6d98ce848eca54cc1958dacbba6-0.
INFO 03-02 00:40:55 [logger.py:42] Received request cmpl-2ede73a2e1cc475a9a492ceb3dae151c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:40:55 [async_llm.py:261] Added request cmpl-2ede73a2e1cc475a9a492ceb3dae151c-0.
INFO 03-02 00:40:57 [logger.py:42] Received request cmpl-346155f8b5f14c88905771fd3eeb6461-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:40:57 [async_llm.py:261] Added request cmpl-346155f8b5f14c88905771fd3eeb6461-0.
INFO 03-02 00:40:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:40:58 [logger.py:42] Received request cmpl-ffb912d4b13c45149f4f6b07c8755cb7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:40:58 [async_llm.py:261] Added request cmpl-ffb912d4b13c45149f4f6b07c8755cb7-0.
INFO 03-02 00:40:59 [logger.py:42] Received request cmpl-2bc313a1919048999e8c016f6b69a05a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:40:59 [async_llm.py:261] Added request cmpl-2bc313a1919048999e8c016f6b69a05a-0.
INFO 03-02 00:41:00 [logger.py:42] Received request cmpl-4f3e5a20eba64a539569ea0279515135-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:41:00 [async_llm.py:261] Added request cmpl-4f3e5a20eba64a539569ea0279515135-0.
INFO 03-02 00:41:01 [logger.py:42] Received request cmpl-2b4010d020e84076a64dfee4257f2ce5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:41:01 [async_llm.py:261] Added request cmpl-2b4010d020e84076a64dfee4257f2ce5-0.
INFO 03-02 00:41:02 [logger.py:42] Received request cmpl-53bc2c40b0f44e8faa1202a14bea71a0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:41:02 [async_llm.py:261] Added request cmpl-53bc2c40b0f44e8faa1202a14bea71a0-0.
INFO 03-02 00:41:04 [logger.py:42] Received request cmpl-29b32df91f8f4633b5ee60e707b6b6a7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:41:04 [async_llm.py:261] Added request cmpl-29b32df91f8f4633b5ee60e707b6b6a7-0.
INFO 03-02 00:41:05 [logger.py:42] Received request cmpl-ebb0e4383d8a49498d7fda64ea443299-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:41:05 [async_llm.py:261] Added request cmpl-ebb0e4383d8a49498d7fda64ea443299-0.
INFO 03-02 00:41:06 [logger.py:42] Received request cmpl-cdac7badcd8b49b39bce6afe727e8c61-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:41:06 [async_llm.py:261] Added request cmpl-cdac7badcd8b49b39bce6afe727e8c61-0.
INFO 03-02 00:41:07 [logger.py:42] Received request cmpl-9e93a84dae61418da0bef72d9b359e52-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:41:07 [async_llm.py:261] Added request cmpl-9e93a84dae61418da0bef72d9b359e52-0.
INFO 03-02 00:41:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:41:08 [logger.py:42] Received request cmpl-76b42e685acb421da1501770a840518b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:41:08 [async_llm.py:261] Added request cmpl-76b42e685acb421da1501770a840518b-0.
INFO 03-02 00:41:09 [logger.py:42] Received request cmpl-7e35522cd73a4c03b75101e7619048d4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:41:09 [async_llm.py:261] Added request cmpl-7e35522cd73a4c03b75101e7619048d4-0.
INFO 03-02 00:41:11 [logger.py:42] Received request cmpl-530b11b0d9a74b57aa35b41ff791f97d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:41:11 [async_llm.py:261] Added request cmpl-530b11b0d9a74b57aa35b41ff791f97d-0.
INFO 03-02 00:41:12 [logger.py:42] Received request cmpl-a77a26bcdf8941fab57a73c236405b1d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:41:12 [async_llm.py:261] Added request cmpl-a77a26bcdf8941fab57a73c236405b1d-0.
INFO 03-02 00:41:13 [logger.py:42] Received request cmpl-04b678189ed94842bc64823ab1967b7c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:41:13 [async_llm.py:261] Added request cmpl-04b678189ed94842bc64823ab1967b7c-0.
INFO 03-02 00:41:14 [logger.py:42] Received request cmpl-cceaeada23894a28a96f8348891d5c22-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:41:14 [async_llm.py:261] Added request cmpl-cceaeada23894a28a96f8348891d5c22-0.
INFO 03-02 00:41:15 [logger.py:42] Received request cmpl-dc873534f03740dc8e6fd72b98c0388c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:41:15 [async_llm.py:261] Added request cmpl-dc873534f03740dc8e6fd72b98c0388c-0.
INFO 03-02 00:41:16 [logger.py:42] Received request cmpl-44043d860f094545a7949efab5631533-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:41:16 [async_llm.py:261] Added request cmpl-44043d860f094545a7949efab5631533-0.
INFO 03-02 00:41:17 [logger.py:42] Received request cmpl-b43ddd6ec80f436fb70f731d0da10d7d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:41:17 [async_llm.py:261] Added request cmpl-b43ddd6ec80f436fb70f731d0da10d7d-0.
INFO 03-02 00:41:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:41:19 [logger.py:42] Received request cmpl-bdebc5ce37b842c19145d1273c43778f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:41:19 [async_llm.py:261] Added request cmpl-bdebc5ce37b842c19145d1273c43778f-0.
INFO 03-02 00:41:20 [logger.py:42] Received request cmpl-4424057a0d284983926e32fc8abc8d21-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:41:20 [async_llm.py:261] Added request cmpl-4424057a0d284983926e32fc8abc8d21-0.
INFO 03-02 00:41:21 [logger.py:42] Received request cmpl-3c96545e28924c50a737d0b8dbaeded5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:41:21 [async_llm.py:261] Added request cmpl-3c96545e28924c50a737d0b8dbaeded5-0.
INFO 03-02 00:41:22 [logger.py:42] Received request cmpl-fbf9fa3de02c4e5b83b1ceb02eb3e244-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:41:22 [async_llm.py:261] Added request cmpl-fbf9fa3de02c4e5b83b1ceb02eb3e244-0.
INFO 03-02 00:41:23 [logger.py:42] Received request cmpl-566aa7d590944695bcb875ce7d958029-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:41:23 [async_llm.py:261] Added request cmpl-566aa7d590944695bcb875ce7d958029-0.
INFO 03-02 00:41:24 [logger.py:42] Received request cmpl-b8c8994a20c7411988f7b489c24c0c25-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:41:24 [async_llm.py:261] Added request cmpl-b8c8994a20c7411988f7b489c24c0c25-0.
INFO 03-02 00:41:26 [logger.py:42] Received request cmpl-b29b1000ff9e435cb7f3f6fc97a2b1f8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:41:26 [async_llm.py:261] Added request cmpl-b29b1000ff9e435cb7f3f6fc97a2b1f8-0.
INFO 03-02 00:41:27 [logger.py:42] Received request cmpl-bd4405bc818644fa8d8ca80910feb6ee-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:41:27 [async_llm.py:261] Added request cmpl-bd4405bc818644fa8d8ca80910feb6ee-0.
INFO 03-02 00:41:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:41:28 [logger.py:42] Received request cmpl-e1ee7e60b1af4a0cb98a6e92be14e976-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:41:28 [async_llm.py:261] Added request cmpl-e1ee7e60b1af4a0cb98a6e92be14e976-0.
INFO 03-02 00:41:29 [logger.py:42] Received request cmpl-37ca42a164e34d1e9c66307dda114bbd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:41:29 [async_llm.py:261] Added request cmpl-37ca42a164e34d1e9c66307dda114bbd-0.
INFO 03-02 00:41:30 [logger.py:42] Received request cmpl-772fc5d2ebc74f589d9c28ac6133078a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:41:30 [async_llm.py:261] Added request cmpl-772fc5d2ebc74f589d9c28ac6133078a-0.
INFO 03-02 00:41:31 [logger.py:42] Received request cmpl-1b5bb498f7ab4c828e4d8698223ac82a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:41:31 [async_llm.py:261] Added request cmpl-1b5bb498f7ab4c828e4d8698223ac82a-0.
INFO 03-02 00:41:32 [logger.py:42] Received request cmpl-8285841632c4437dafaa0ab3ccb6cd8f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:41:32 [async_llm.py:261] Added request cmpl-8285841632c4437dafaa0ab3ccb6cd8f-0.
INFO 03-02 00:41:34 [logger.py:42] Received request cmpl-e37a557c10ea423eae1f606650540277-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:41:34 [async_llm.py:261] Added request cmpl-e37a557c10ea423eae1f606650540277-0.
INFO 03-02 00:41:35 [logger.py:42] Received request cmpl-2702108607ef48818f895cca5f995a11-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:41:35 [async_llm.py:261] Added request cmpl-2702108607ef48818f895cca5f995a11-0.
INFO 03-02 00:41:36 [logger.py:42] Received request cmpl-7a01fa110ee24d58bccd2a277f77d4e4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:41:36 [async_llm.py:261] Added request cmpl-7a01fa110ee24d58bccd2a277f77d4e4-0.
INFO 03-02 00:41:37 [logger.py:42] Received request cmpl-d54afe849a2d47eaaa617f47dab72459-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:41:37 [async_llm.py:261] Added request cmpl-d54afe849a2d47eaaa617f47dab72459-0.
INFO 03-02 00:41:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:41:38 [logger.py:42] Received request cmpl-fcb5af4f875c4cdb9118cf1d06f70809-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:41:38 [async_llm.py:261] Added request cmpl-fcb5af4f875c4cdb9118cf1d06f70809-0.
INFO 03-02 00:41:39 [logger.py:42] Received request cmpl-3bd57d3ab4da4a95b0360b301c5659d1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:41:39 [async_llm.py:261] Added request cmpl-3bd57d3ab4da4a95b0360b301c5659d1-0.
INFO 03-02 00:41:41 [logger.py:42] Received request cmpl-e3b9b25d9a8244698648ef7939add1e2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:41:41 [async_llm.py:261] Added request cmpl-e3b9b25d9a8244698648ef7939add1e2-0.
INFO 03-02 00:41:42 [logger.py:42] Received request cmpl-8903e4c80de342cebf596859990bfc3b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:41:42 [async_llm.py:261] Added request cmpl-8903e4c80de342cebf596859990bfc3b-0.
INFO 03-02 00:41:43 [logger.py:42] Received request cmpl-d9dd9ca906b2459899e04d3b3a28446f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:41:43 [async_llm.py:261] Added request cmpl-d9dd9ca906b2459899e04d3b3a28446f-0.
INFO 03-02 00:41:44 [logger.py:42] Received request cmpl-ec5b08d1a8c449e396a3d284b7d7964d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:41:44 [async_llm.py:261] Added request cmpl-ec5b08d1a8c449e396a3d284b7d7964d-0.
INFO 03-02 00:41:45 [logger.py:42] Received request cmpl-b484418edd5f4fe48e70dc7d4f661641-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:41:45 [async_llm.py:261] Added request cmpl-b484418edd5f4fe48e70dc7d4f661641-0.
INFO 03-02 00:41:46 [logger.py:42] Received request cmpl-9cb73da60cad4c83bed6c1c686e43e6f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:41:46 [async_llm.py:261] Added request cmpl-9cb73da60cad4c83bed6c1c686e43e6f-0.
INFO 03-02 00:41:48 [logger.py:42] Received request cmpl-4d0f5d1050824270a377b46fbd9f3df9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:41:48 [async_llm.py:261] Added request cmpl-4d0f5d1050824270a377b46fbd9f3df9-0.
INFO 03-02 00:41:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.2 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.3%, Prefix cache hit rate: 51.6%
INFO 03-02 00:41:49 [logger.py:42] Received request cmpl-b96393b5f28242e98fd2b988d93f6756-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:41:49 [async_llm.py:261] Added request cmpl-b96393b5f28242e98fd2b988d93f6756-0.
INFO 03-02 00:41:50 [logger.py:42] Received request cmpl-b68575a8f69f467e91fc4a553b1c8b98-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:41:50 [async_llm.py:261] Added request cmpl-b68575a8f69f467e91fc4a553b1c8b98-0.
INFO 03-02 00:41:51 [logger.py:42] Received request cmpl-1c599381aa8a46bb865e6ca77968c798-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:41:51 [async_llm.py:261] Added request cmpl-1c599381aa8a46bb865e6ca77968c798-0.
INFO 03-02 00:41:52 [logger.py:42] Received request cmpl-f52f5ee50ef243e8ab947653743add2b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:41:52 [async_llm.py:261] Added request cmpl-f52f5ee50ef243e8ab947653743add2b-0.
INFO 03-02 00:41:53 [logger.py:42] Received request cmpl-09a197bab2824baaaa7362824df576a0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:41:53 [async_llm.py:261] Added request cmpl-09a197bab2824baaaa7362824df576a0-0.
INFO 03-02 00:41:54 [logger.py:42] Received request cmpl-ee3dfec52c8942a4a2d85e3c239e87c0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:41:54 [async_llm.py:261] Added request cmpl-ee3dfec52c8942a4a2d85e3c239e87c0-0.
INFO 03-02 00:41:56 [logger.py:42] Received request cmpl-90a4382566274603806cd124e8901cbd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:41:56 [async_llm.py:261] Added request cmpl-90a4382566274603806cd124e8901cbd-0.
INFO 03-02 00:41:57 [logger.py:42] Received request cmpl-d38c47c87018448fb0bab748e5b12e35-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:41:57 [async_llm.py:261] Added request cmpl-d38c47c87018448fb0bab748e5b12e35-0.
INFO 03-02 00:41:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.3 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:41:58 [logger.py:42] Received request cmpl-c17b46b5567a4a5aae4d141e4a5b6022-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:41:58 [async_llm.py:261] Added request cmpl-c17b46b5567a4a5aae4d141e4a5b6022-0.
INFO 03-02 00:41:59 [logger.py:42] Received request cmpl-9b0567dfa96547e8be2f2e15c89a729a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:41:59 [async_llm.py:261] Added request cmpl-9b0567dfa96547e8be2f2e15c89a729a-0.
INFO 03-02 00:42:00 [logger.py:42] Received request cmpl-46f03094b141436d8e4986f33f9d318b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:42:00 [async_llm.py:261] Added request cmpl-46f03094b141436d8e4986f33f9d318b-0.
INFO 03-02 00:42:01 [logger.py:42] Received request cmpl-f0bc27dafe8442e9b66e889dc9786741-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:42:01 [async_llm.py:261] Added request cmpl-f0bc27dafe8442e9b66e889dc9786741-0.
INFO 03-02 00:42:03 [logger.py:42] Received request cmpl-96e0c0075f7e46b2933401f75223640a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:42:03 [async_llm.py:261] Added request cmpl-96e0c0075f7e46b2933401f75223640a-0.
INFO 03-02 00:42:04 [logger.py:42] Received request cmpl-bcd44e98410d4e4499ede5246452eedb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:42:04 [async_llm.py:261] Added request cmpl-bcd44e98410d4e4499ede5246452eedb-0.
INFO 03-02 00:42:05 [logger.py:42] Received request cmpl-43d90de3787044c782b2699bf0850ac7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:42:05 [async_llm.py:261] Added request cmpl-43d90de3787044c782b2699bf0850ac7-0.
INFO 03-02 00:42:06 [logger.py:42] Received request cmpl-4640b21db3cc4bda9612c7233ec30389-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:42:06 [async_llm.py:261] Added request cmpl-4640b21db3cc4bda9612c7233ec30389-0.
INFO 03-02 00:42:07 [logger.py:42] Received request cmpl-a7caaaccf82743bebd583a035e38b2fa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:42:07 [async_llm.py:261] Added request cmpl-a7caaaccf82743bebd583a035e38b2fa-0.
INFO 03-02 00:42:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:42:08 [logger.py:42] Received request cmpl-06770c5085484fe0a190963715cbc516-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:42:08 [async_llm.py:261] Added request cmpl-06770c5085484fe0a190963715cbc516-0.
INFO 03-02 00:42:09 [logger.py:42] Received request cmpl-497419fc4f78469db875dd92fc4e97e7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:42:09 [async_llm.py:261] Added request cmpl-497419fc4f78469db875dd92fc4e97e7-0.
INFO 03-02 00:42:11 [logger.py:42] Received request cmpl-b7d3e5b5804040388df569956182e057-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:42:11 [async_llm.py:261] Added request cmpl-b7d3e5b5804040388df569956182e057-0.
INFO 03-02 00:42:12 [logger.py:42] Received request cmpl-18e70d79a49d47ebbf73a9e4f35a10c6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:42:12 [async_llm.py:261] Added request cmpl-18e70d79a49d47ebbf73a9e4f35a10c6-0.
INFO 03-02 00:42:13 [logger.py:42] Received request cmpl-16590e32e4394073884dfed540954c29-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:42:13 [async_llm.py:261] Added request cmpl-16590e32e4394073884dfed540954c29-0.
INFO 03-02 00:42:14 [logger.py:42] Received request cmpl-85ac1e25e06540868dd55c85ace26ef5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:42:14 [async_llm.py:261] Added request cmpl-85ac1e25e06540868dd55c85ace26ef5-0.
INFO 03-02 00:42:15 [logger.py:42] Received request cmpl-9546a04573f543f2bf0dba9914becfde-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:42:15 [async_llm.py:261] Added request cmpl-9546a04573f543f2bf0dba9914becfde-0.
INFO 03-02 00:42:16 [logger.py:42] Received request cmpl-1ba8d23507924ba0ab06e5a8210832bf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:42:16 [async_llm.py:261] Added request cmpl-1ba8d23507924ba0ab06e5a8210832bf-0.
INFO 03-02 00:42:18 [logger.py:42] Received request cmpl-b356d72d08f1472a9cec28aaa7c49294-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:42:18 [async_llm.py:261] Added request cmpl-b356d72d08f1472a9cec28aaa7c49294-0.
INFO 03-02 00:42:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.3 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.4%, Prefix cache hit rate: 51.6%
INFO 03-02 00:42:19 [logger.py:42] Received request cmpl-a6ad9af62bc847979efa957b1d4d45c1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:42:19 [async_llm.py:261] Added request cmpl-a6ad9af62bc847979efa957b1d4d45c1-0.
INFO 03-02 00:42:20 [logger.py:42] Received request cmpl-bc9ef36464bb4cf5b95d0b3520a5874f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:42:20 [async_llm.py:261] Added request cmpl-bc9ef36464bb4cf5b95d0b3520a5874f-0.
INFO 03-02 00:42:21 [logger.py:42] Received request cmpl-069c29c5813a47cbac754d86b2dd07af-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:42:21 [async_llm.py:261] Added request cmpl-069c29c5813a47cbac754d86b2dd07af-0.
INFO 03-02 00:42:22 [logger.py:42] Received request cmpl-9e98f889488d48b4a8f803e53b4866cc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:42:22 [async_llm.py:261] Added request cmpl-9e98f889488d48b4a8f803e53b4866cc-0.
INFO 03-02 00:42:23 [logger.py:42] Received request cmpl-d44fb6587a55448790532a132743a9db-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:42:23 [async_llm.py:261] Added request cmpl-d44fb6587a55448790532a132743a9db-0.
INFO 03-02 00:42:24 [logger.py:42] Received request cmpl-cdbe7b82066240f68e56d3a1b8e01ebd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:42:24 [async_llm.py:261] Added request cmpl-cdbe7b82066240f68e56d3a1b8e01ebd-0.
INFO 03-02 00:42:26 [logger.py:42] Received request cmpl-9387fbbb5f854d48b8511d65cf7dd388-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:42:26 [async_llm.py:261] Added request cmpl-9387fbbb5f854d48b8511d65cf7dd388-0.
INFO 03-02 00:42:27 [logger.py:42] Received request cmpl-15704bd93ab34b2ab2d0c9f95aae8862-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:42:27 [async_llm.py:261] Added request cmpl-15704bd93ab34b2ab2d0c9f95aae8862-0.
INFO 03-02 00:42:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.2 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:42:28 [logger.py:42] Received request cmpl-87f99bcf1a4842c29a807f669de8a216-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:42:28 [async_llm.py:261] Added request cmpl-87f99bcf1a4842c29a807f669de8a216-0.
INFO 03-02 00:42:29 [logger.py:42] Received request cmpl-6716bb5ea80e4f0b9c6ee53907eecb05-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:42:29 [async_llm.py:261] Added request cmpl-6716bb5ea80e4f0b9c6ee53907eecb05-0.
INFO 03-02 00:42:30 [logger.py:42] Received request cmpl-e41749bba2254aa89de0a6aa88f64fd4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:42:30 [async_llm.py:261] Added request cmpl-e41749bba2254aa89de0a6aa88f64fd4-0.
INFO 03-02 00:42:31 [logger.py:42] Received request cmpl-fc33dfcbf73548d386b4ae2946bb65bd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:42:31 [async_llm.py:261] Added request cmpl-fc33dfcbf73548d386b4ae2946bb65bd-0.
INFO 03-02 00:42:32 [logger.py:42] Received request cmpl-13be3793fe37443cbc868d444333c9e7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:42:32 [async_llm.py:261] Added request cmpl-13be3793fe37443cbc868d444333c9e7-0.
INFO 03-02 00:42:34 [logger.py:42] Received request cmpl-36b3155b2054479aa7a8edd85cbdffe2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:42:34 [async_llm.py:261] Added request cmpl-36b3155b2054479aa7a8edd85cbdffe2-0.
INFO 03-02 00:42:35 [logger.py:42] Received request cmpl-4b17f12d9b6e4ef7a79f7174bae7bbf7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:42:35 [async_llm.py:261] Added request cmpl-4b17f12d9b6e4ef7a79f7174bae7bbf7-0.
INFO 03-02 00:42:36 [logger.py:42] Received request cmpl-2adfd8863f914cf8b71c0b6968987c47-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:42:36 [async_llm.py:261] Added request cmpl-2adfd8863f914cf8b71c0b6968987c47-0.
INFO 03-02 00:42:37 [logger.py:42] Received request cmpl-47d6c4a367dd416ab7ee2b69d02d3f2c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:42:37 [async_llm.py:261] Added request cmpl-47d6c4a367dd416ab7ee2b69d02d3f2c-0.
INFO 03-02 00:42:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:42:38 [logger.py:42] Received request cmpl-a2b4b33f29d742f6b3f1ee80a2607b04-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:42:38 [async_llm.py:261] Added request cmpl-a2b4b33f29d742f6b3f1ee80a2607b04-0.
INFO 03-02 00:42:39 [logger.py:42] Received request cmpl-cf70297488f54822a25fc2fe02a788a0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:42:39 [async_llm.py:261] Added request cmpl-cf70297488f54822a25fc2fe02a788a0-0.
INFO 03-02 00:42:41 [logger.py:42] Received request cmpl-ba99165b73f24d869d5ee0b91ef05288-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:42:41 [async_llm.py:261] Added request cmpl-ba99165b73f24d869d5ee0b91ef05288-0.
INFO 03-02 00:42:42 [logger.py:42] Received request cmpl-d55251603e624adb925b83b600eadbf4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:42:42 [async_llm.py:261] Added request cmpl-d55251603e624adb925b83b600eadbf4-0.
INFO 03-02 00:42:43 [logger.py:42] Received request cmpl-b1ed828d5f8342fc8c4fea9282fb1a6c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:42:43 [async_llm.py:261] Added request cmpl-b1ed828d5f8342fc8c4fea9282fb1a6c-0.
INFO 03-02 00:42:44 [logger.py:42] Received request cmpl-326a95ccd02346bab27dc8f6c91847d3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:42:44 [async_llm.py:261] Added request cmpl-326a95ccd02346bab27dc8f6c91847d3-0.
INFO 03-02 00:42:45 [logger.py:42] Received request cmpl-e875eba2000d4b9fa7406c61dd1065f0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:42:45 [async_llm.py:261] Added request cmpl-e875eba2000d4b9fa7406c61dd1065f0-0.
INFO 03-02 00:42:46 [logger.py:42] Received request cmpl-99a4da4bfdf041b29f7b2a6d2ded6c28-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:42:46 [async_llm.py:261] Added request cmpl-99a4da4bfdf041b29f7b2a6d2ded6c28-0.
INFO 03-02 00:42:47 [logger.py:42] Received request cmpl-505f3415ce4c42a69be3fb17c774691b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:42:47 [async_llm.py:261] Added request cmpl-505f3415ce4c42a69be3fb17c774691b-0.
INFO 03-02 00:42:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.3 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.4%, Prefix cache hit rate: 51.6%
INFO 03-02 00:42:49 [logger.py:42] Received request cmpl-165e45b48c874ce0b5a4917440038a1d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:42:49 [async_llm.py:261] Added request cmpl-165e45b48c874ce0b5a4917440038a1d-0.
INFO 03-02 00:42:50 [logger.py:42] Received request cmpl-ecda4eec0f7145b5a6b5e78521fd0aa8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:42:50 [async_llm.py:261] Added request cmpl-ecda4eec0f7145b5a6b5e78521fd0aa8-0.
INFO 03-02 00:42:51 [logger.py:42] Received request cmpl-777cd87066e14522a22070c12a9fd0ec-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:42:51 [async_llm.py:261] Added request cmpl-777cd87066e14522a22070c12a9fd0ec-0.
INFO 03-02 00:42:52 [logger.py:42] Received request cmpl-6588e4c36c804fa9a284489580874ef1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:42:52 [async_llm.py:261] Added request cmpl-6588e4c36c804fa9a284489580874ef1-0.
INFO 03-02 00:42:53 [logger.py:42] Received request cmpl-1be52eaa7f4440f48a9f6acefd5b6fc3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:42:53 [async_llm.py:261] Added request cmpl-1be52eaa7f4440f48a9f6acefd5b6fc3-0.
INFO 03-02 00:42:54 [logger.py:42] Received request cmpl-c2d1cd676a9d49b6871a2025ddc2ff03-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:42:54 [async_llm.py:261] Added request cmpl-c2d1cd676a9d49b6871a2025ddc2ff03-0.
INFO 03-02 00:42:56 [logger.py:42] Received request cmpl-9dabb595991c4a32b5a228bb7de68738-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:42:56 [async_llm.py:261] Added request cmpl-9dabb595991c4a32b5a228bb7de68738-0.
INFO 03-02 00:42:57 [logger.py:42] Received request cmpl-fe459dcfb1b6421b9ad5dea3e2f227e3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:42:57 [async_llm.py:261] Added request cmpl-fe459dcfb1b6421b9ad5dea3e2f227e3-0.
INFO 03-02 00:42:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.2 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:42:58 [logger.py:42] Received request cmpl-04709423940941bfa8196fc89e1ee5f7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:42:58 [async_llm.py:261] Added request cmpl-04709423940941bfa8196fc89e1ee5f7-0.
INFO 03-02 00:42:59 [logger.py:42] Received request cmpl-7cb38d1884d7462190d0e29c8a67d018-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:42:59 [async_llm.py:261] Added request cmpl-7cb38d1884d7462190d0e29c8a67d018-0.
INFO 03-02 00:43:00 [logger.py:42] Received request cmpl-0d1154456f394521ba09330e377ef214-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:43:00 [async_llm.py:261] Added request cmpl-0d1154456f394521ba09330e377ef214-0.
INFO 03-02 00:43:01 [logger.py:42] Received request cmpl-391f2e902e8e41bcbd7a6f91e3ac3080-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:43:01 [async_llm.py:261] Added request cmpl-391f2e902e8e41bcbd7a6f91e3ac3080-0.
INFO 03-02 00:43:02 [logger.py:42] Received request cmpl-f3879421f2f94971924376ff0e4e752c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:43:02 [async_llm.py:261] Added request cmpl-f3879421f2f94971924376ff0e4e752c-0.
INFO 03-02 00:43:04 [logger.py:42] Received request cmpl-12dcd8abbdd64df697dbf8c17d521ccd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:43:04 [async_llm.py:261] Added request cmpl-12dcd8abbdd64df697dbf8c17d521ccd-0.
INFO 03-02 00:43:05 [logger.py:42] Received request cmpl-1eb315fa08ae4312bc60f785866c94ef-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:43:05 [async_llm.py:261] Added request cmpl-1eb315fa08ae4312bc60f785866c94ef-0.
INFO 03-02 00:43:06 [logger.py:42] Received request cmpl-d6c00ec49e6a45e08777936c242dd428-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:43:06 [async_llm.py:261] Added request cmpl-d6c00ec49e6a45e08777936c242dd428-0.
INFO 03-02 00:43:07 [logger.py:42] Received request cmpl-022b9032bcfa4c98915008debd41b623-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:43:07 [async_llm.py:261] Added request cmpl-022b9032bcfa4c98915008debd41b623-0.
INFO 03-02 00:43:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:43:08 [logger.py:42] Received request cmpl-34ffba964a6046a389e255fb4439f526-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:43:08 [async_llm.py:261] Added request cmpl-34ffba964a6046a389e255fb4439f526-0.
INFO 03-02 00:43:09 [logger.py:42] Received request cmpl-3d3320991e9a4423866c7a69f10c8449-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:43:09 [async_llm.py:261] Added request cmpl-3d3320991e9a4423866c7a69f10c8449-0.
INFO 03-02 00:43:11 [logger.py:42] Received request cmpl-c114bdd645d74aab92fbd468fe1b2a1f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:43:11 [async_llm.py:261] Added request cmpl-c114bdd645d74aab92fbd468fe1b2a1f-0.
INFO 03-02 00:43:12 [logger.py:42] Received request cmpl-5b236213c0ba4ea4975819572a99c7e2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:43:12 [async_llm.py:261] Added request cmpl-5b236213c0ba4ea4975819572a99c7e2-0.
INFO 03-02 00:43:13 [logger.py:42] Received request cmpl-9879111e753a43b4bd0507ac93a6e735-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:43:13 [async_llm.py:261] Added request cmpl-9879111e753a43b4bd0507ac93a6e735-0.
INFO 03-02 00:43:14 [logger.py:42] Received request cmpl-c3a58862fca3492c98710419c925af2a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:43:14 [async_llm.py:261] Added request cmpl-c3a58862fca3492c98710419c925af2a-0.
INFO 03-02 00:43:15 [logger.py:42] Received request cmpl-93dd8988676243f5b55186a4aae8e3ee-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:43:15 [async_llm.py:261] Added request cmpl-93dd8988676243f5b55186a4aae8e3ee-0.
INFO 03-02 00:43:16 [logger.py:42] Received request cmpl-c0d4ebd418a6429c81179baafcdf1f4a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:43:16 [async_llm.py:261] Added request cmpl-c0d4ebd418a6429c81179baafcdf1f4a-0.
INFO 03-02 00:43:17 [logger.py:42] Received request cmpl-e5cbcf741097405c8b4305fd9fc8a09a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:43:17 [async_llm.py:261] Added request cmpl-e5cbcf741097405c8b4305fd9fc8a09a-0.
INFO 03-02 00:43:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.4 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.4%, Prefix cache hit rate: 51.6%
INFO 03-02 00:43:19 [logger.py:42] Received request cmpl-18356299637b4ee4bc7c7858728dcd19-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:43:19 [async_llm.py:261] Added request cmpl-18356299637b4ee4bc7c7858728dcd19-0.
INFO 03-02 00:43:20 [logger.py:42] Received request cmpl-e4f18805a2ee467fbcb89ad7784926be-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:43:20 [async_llm.py:261] Added request cmpl-e4f18805a2ee467fbcb89ad7784926be-0.
INFO 03-02 00:43:21 [logger.py:42] Received request cmpl-d49bbe520143443a95968a561e214dbd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:43:21 [async_llm.py:261] Added request cmpl-d49bbe520143443a95968a561e214dbd-0.
INFO 03-02 00:43:22 [logger.py:42] Received request cmpl-77b63065dc7c4ce3ac28c06abebb13b3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:43:22 [async_llm.py:261] Added request cmpl-77b63065dc7c4ce3ac28c06abebb13b3-0.
INFO 03-02 00:43:23 [logger.py:42] Received request cmpl-9ab40e929c1846f4ae3e13674996d36c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:43:23 [async_llm.py:261] Added request cmpl-9ab40e929c1846f4ae3e13674996d36c-0.
INFO 03-02 00:43:24 [logger.py:42] Received request cmpl-0b478b0552874f80a354a9ac60c9ce44-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:43:24 [async_llm.py:261] Added request cmpl-0b478b0552874f80a354a9ac60c9ce44-0.
INFO 03-02 00:43:26 [logger.py:42] Received request cmpl-b32753846f584f01b0a3ddf284c4d11f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:43:26 [async_llm.py:261] Added request cmpl-b32753846f584f01b0a3ddf284c4d11f-0.
INFO 03-02 00:43:27 [logger.py:42] Received request cmpl-375e56793fa240059af30ae5736eff80-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:43:27 [async_llm.py:261] Added request cmpl-375e56793fa240059af30ae5736eff80-0.
INFO 03-02 00:43:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.1 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:43:28 [logger.py:42] Received request cmpl-213c25caf1bb400fa4ee03648bef7955-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:43:28 [async_llm.py:261] Added request cmpl-213c25caf1bb400fa4ee03648bef7955-0.
INFO 03-02 00:43:29 [logger.py:42] Received request cmpl-5978cc79239045809fd7f0ca4f656b76-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:43:29 [async_llm.py:261] Added request cmpl-5978cc79239045809fd7f0ca4f656b76-0.
INFO 03-02 00:43:30 [logger.py:42] Received request cmpl-24dea26711c443f49fe37697500e5efd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:43:30 [async_llm.py:261] Added request cmpl-24dea26711c443f49fe37697500e5efd-0.
INFO 03-02 00:43:31 [logger.py:42] Received request cmpl-bdb1c0c2506f4d4cbddac037283436ba-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:43:31 [async_llm.py:261] Added request cmpl-bdb1c0c2506f4d4cbddac037283436ba-0.
INFO 03-02 00:43:32 [logger.py:42] Received request cmpl-f3bfaffe6d0c40108130725c3ff5cf65-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:43:32 [async_llm.py:261] Added request cmpl-f3bfaffe6d0c40108130725c3ff5cf65-0.
INFO 03-02 00:43:34 [logger.py:42] Received request cmpl-858bb53bd0b349babe3c86600092fd34-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:43:34 [async_llm.py:261] Added request cmpl-858bb53bd0b349babe3c86600092fd34-0.
INFO 03-02 00:43:35 [logger.py:42] Received request cmpl-2cf4539586a940ad8abad544c47a3695-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:43:35 [async_llm.py:261] Added request cmpl-2cf4539586a940ad8abad544c47a3695-0.
INFO 03-02 00:43:36 [logger.py:42] Received request cmpl-8909779d356740158edec1a74f7c1d51-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:43:36 [async_llm.py:261] Added request cmpl-8909779d356740158edec1a74f7c1d51-0.
INFO 03-02 00:43:37 [logger.py:42] Received request cmpl-c8503c7e362147b19b4c9678138b0ae6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:43:37 [async_llm.py:261] Added request cmpl-c8503c7e362147b19b4c9678138b0ae6-0.
INFO 03-02 00:43:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:43:38 [logger.py:42] Received request cmpl-1f292025b6e843c2b3cd1e23a8550f78-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:43:38 [async_llm.py:261] Added request cmpl-1f292025b6e843c2b3cd1e23a8550f78-0.
INFO 03-02 00:43:39 [logger.py:42] Received request cmpl-e983f72fc1c74ca2a2593c53cd9cf941-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:43:39 [async_llm.py:261] Added request cmpl-e983f72fc1c74ca2a2593c53cd9cf941-0.
INFO 03-02 00:43:41 [logger.py:42] Received request cmpl-d33a19758f60423b91012bd3d628d30a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:43:41 [async_llm.py:261] Added request cmpl-d33a19758f60423b91012bd3d628d30a-0.
INFO 03-02 00:43:42 [logger.py:42] Received request cmpl-7676bb89a7614e1c88249f064f36e907-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:43:42 [async_llm.py:261] Added request cmpl-7676bb89a7614e1c88249f064f36e907-0.
INFO 03-02 00:43:43 [logger.py:42] Received request cmpl-5b557f6297dc400392bbea4d46d8aa1b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:43:43 [async_llm.py:261] Added request cmpl-5b557f6297dc400392bbea4d46d8aa1b-0.
INFO 03-02 00:43:44 [logger.py:42] Received request cmpl-d5a546fd1f524b1b974cb80cbff356df-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:43:44 [async_llm.py:261] Added request cmpl-d5a546fd1f524b1b974cb80cbff356df-0.
INFO 03-02 00:43:45 [logger.py:42] Received request cmpl-b928334f80f74058841684d53a8690bf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:43:45 [async_llm.py:261] Added request cmpl-b928334f80f74058841684d53a8690bf-0.
INFO 03-02 00:43:46 [logger.py:42] Received request cmpl-19e332ad513c4643937cda822a69956f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:43:46 [async_llm.py:261] Added request cmpl-19e332ad513c4643937cda822a69956f-0.
INFO 03-02 00:43:47 [logger.py:42] Received request cmpl-c0aea50a02fe4c7a87d386eec3099de3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:43:47 [async_llm.py:261] Added request cmpl-c0aea50a02fe4c7a87d386eec3099de3-0.
INFO 03-02 00:43:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:43:49 [logger.py:42] Received request cmpl-f9bb1ce9ca274ad18aaae0a08872bcc2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:43:49 [async_llm.py:261] Added request cmpl-f9bb1ce9ca274ad18aaae0a08872bcc2-0.
INFO 03-02 00:43:50 [logger.py:42] Received request cmpl-ef932a928fb64b6ea3389ffe9ea4990b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:43:50 [async_llm.py:261] Added request cmpl-ef932a928fb64b6ea3389ffe9ea4990b-0.
INFO 03-02 00:43:51 [logger.py:42] Received request cmpl-d972f09bc5744907aef1004834619b80-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:43:51 [async_llm.py:261] Added request cmpl-d972f09bc5744907aef1004834619b80-0.
INFO 03-02 00:43:52 [logger.py:42] Received request cmpl-5d6c6e1f32534912a0931bb253384cbb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:43:52 [async_llm.py:261] Added request cmpl-5d6c6e1f32534912a0931bb253384cbb-0.
INFO 03-02 00:43:53 [logger.py:42] Received request cmpl-8862630a8b6146fb82cbfaa6b4f77910-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:43:53 [async_llm.py:261] Added request cmpl-8862630a8b6146fb82cbfaa6b4f77910-0.
INFO 03-02 00:43:54 [logger.py:42] Received request cmpl-d98f485761d045f5b47458b11ae4b5d1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:43:54 [async_llm.py:261] Added request cmpl-d98f485761d045f5b47458b11ae4b5d1-0.
INFO 03-02 00:43:56 [logger.py:42] Received request cmpl-d5b1304bd3834929a1426d6bb27193b1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:43:56 [async_llm.py:261] Added request cmpl-d5b1304bd3834929a1426d6bb27193b1-0.
INFO 03-02 00:43:57 [logger.py:42] Received request cmpl-9824762a5ed74055a137311175503252-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:43:57 [async_llm.py:261] Added request cmpl-9824762a5ed74055a137311175503252-0.
INFO 03-02 00:43:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:43:58 [logger.py:42] Received request cmpl-79a7e191b16e4959b41b2d96268e3613-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:43:58 [async_llm.py:261] Added request cmpl-79a7e191b16e4959b41b2d96268e3613-0.
INFO 03-02 00:43:59 [logger.py:42] Received request cmpl-8694d69ebc7148baa7ea4d644dc16939-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:43:59 [async_llm.py:261] Added request cmpl-8694d69ebc7148baa7ea4d644dc16939-0.
INFO 03-02 00:44:00 [logger.py:42] Received request cmpl-362ff9a383d64558a81608744c6e76f3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:44:00 [async_llm.py:261] Added request cmpl-362ff9a383d64558a81608744c6e76f3-0.
INFO 03-02 00:44:01 [logger.py:42] Received request cmpl-e169fa3e5fd04f728d53e6ea1998231f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:44:01 [async_llm.py:261] Added request cmpl-e169fa3e5fd04f728d53e6ea1998231f-0.
INFO 03-02 00:44:02 [logger.py:42] Received request cmpl-d308bd868cc04b94bc1e1836b69349f9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:44:02 [async_llm.py:261] Added request cmpl-d308bd868cc04b94bc1e1836b69349f9-0.
INFO 03-02 00:44:04 [logger.py:42] Received request cmpl-f1bc539d3af842c29e939df1804d98b5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:44:04 [async_llm.py:261] Added request cmpl-f1bc539d3af842c29e939df1804d98b5-0.
INFO 03-02 00:44:05 [logger.py:42] Received request cmpl-a64e92dcacbd403d9c63ad2951381caf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:44:05 [async_llm.py:261] Added request cmpl-a64e92dcacbd403d9c63ad2951381caf-0.
INFO 03-02 00:44:06 [logger.py:42] Received request cmpl-ddde9969c8ef44d9bb8c86e3f3c06999-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:44:06 [async_llm.py:261] Added request cmpl-ddde9969c8ef44d9bb8c86e3f3c06999-0.
INFO 03-02 00:44:07 [logger.py:42] Received request cmpl-caa2a1530b0e4f87a6b4a3dd367a6fcf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:44:07 [async_llm.py:261] Added request cmpl-caa2a1530b0e4f87a6b4a3dd367a6fcf-0.
INFO 03-02 00:44:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:44:08 [logger.py:42] Received request cmpl-104eb7ccedc4448eb40cbf16ff103b0e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:44:08 [async_llm.py:261] Added request cmpl-104eb7ccedc4448eb40cbf16ff103b0e-0.
INFO 03-02 00:44:09 [logger.py:42] Received request cmpl-84b04edacac54afa9c65327f0a3975ff-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:44:09 [async_llm.py:261] Added request cmpl-84b04edacac54afa9c65327f0a3975ff-0.
INFO 03-02 00:44:11 [logger.py:42] Received request cmpl-a32b98d3d8854a559d42f0182ce86b88-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:44:11 [async_llm.py:261] Added request cmpl-a32b98d3d8854a559d42f0182ce86b88-0.
INFO 03-02 00:44:12 [logger.py:42] Received request cmpl-89a79275bf524823ba1162c6a93bd817-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:44:12 [async_llm.py:261] Added request cmpl-89a79275bf524823ba1162c6a93bd817-0.
INFO 03-02 00:44:13 [logger.py:42] Received request cmpl-348837b19c204ed89fa17a5f4327265e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:44:13 [async_llm.py:261] Added request cmpl-348837b19c204ed89fa17a5f4327265e-0.
INFO 03-02 00:44:14 [logger.py:42] Received request cmpl-e1028da2a7fc40d6bf7e39d8e7a94f46-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:44:14 [async_llm.py:261] Added request cmpl-e1028da2a7fc40d6bf7e39d8e7a94f46-0.
INFO 03-02 00:44:15 [logger.py:42] Received request cmpl-d04ad7e5d71e489cae2dc648749d2ed7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:44:15 [async_llm.py:261] Added request cmpl-d04ad7e5d71e489cae2dc648749d2ed7-0.
INFO 03-02 00:44:16 [logger.py:42] Received request cmpl-b1d28e18ee2047778f97d962ef893d35-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:44:16 [async_llm.py:261] Added request cmpl-b1d28e18ee2047778f97d962ef893d35-0.
INFO 03-02 00:44:17 [logger.py:42] Received request cmpl-fbbd6a1a7db24033bd8439c0bc9dc5c8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:44:17 [async_llm.py:261] Added request cmpl-fbbd6a1a7db24033bd8439c0bc9dc5c8-0.
INFO 03-02 00:44:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:44:19 [logger.py:42] Received request cmpl-eb40dd6e1a384ab8ba4efa75994a03c0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:44:19 [async_llm.py:261] Added request cmpl-eb40dd6e1a384ab8ba4efa75994a03c0-0.
INFO 03-02 00:44:20 [logger.py:42] Received request cmpl-d8c75e2694b94b839f908bcf198e4322-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:44:20 [async_llm.py:261] Added request cmpl-d8c75e2694b94b839f908bcf198e4322-0.
INFO 03-02 00:44:21 [logger.py:42] Received request cmpl-473ff6c65f6b470d9d3f4db5182e510e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:44:21 [async_llm.py:261] Added request cmpl-473ff6c65f6b470d9d3f4db5182e510e-0.
INFO 03-02 00:44:22 [logger.py:42] Received request cmpl-77f0589fb71b472694a3c636e6bc7ee7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:44:22 [async_llm.py:261] Added request cmpl-77f0589fb71b472694a3c636e6bc7ee7-0.
INFO 03-02 00:44:23 [logger.py:42] Received request cmpl-600ef8c0ce2f46c7af7dc964f674ac77-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:44:23 [async_llm.py:261] Added request cmpl-600ef8c0ce2f46c7af7dc964f674ac77-0.
INFO 03-02 00:44:24 [logger.py:42] Received request cmpl-c9d8f156d92245e59576961f759d3e66-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:44:24 [async_llm.py:261] Added request cmpl-c9d8f156d92245e59576961f759d3e66-0.
INFO 03-02 00:44:26 [logger.py:42] Received request cmpl-9b835f62bf0f4b6fa403d51da2021e82-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:44:26 [async_llm.py:261] Added request cmpl-9b835f62bf0f4b6fa403d51da2021e82-0.
INFO 03-02 00:44:27 [logger.py:42] Received request cmpl-e5103f4880394b1e9c4ac3e09edd9fe8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:44:27 [async_llm.py:261] Added request cmpl-e5103f4880394b1e9c4ac3e09edd9fe8-0.
INFO 03-02 00:44:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:44:28 [logger.py:42] Received request cmpl-8571fcfe0fa042dea775e3a4ff2b7dc4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:44:28 [async_llm.py:261] Added request cmpl-8571fcfe0fa042dea775e3a4ff2b7dc4-0.
INFO 03-02 00:44:29 [logger.py:42] Received request cmpl-8419877c293b4401bef6118cbc2bf952-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:44:29 [async_llm.py:261] Added request cmpl-8419877c293b4401bef6118cbc2bf952-0.
INFO 03-02 00:44:30 [logger.py:42] Received request cmpl-9ba552ef99f348f3a48c8e08ae54a2fe-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:44:30 [async_llm.py:261] Added request cmpl-9ba552ef99f348f3a48c8e08ae54a2fe-0.
INFO 03-02 00:44:31 [logger.py:42] Received request cmpl-be9f71abf2524063aa95284e234da1a5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:44:31 [async_llm.py:261] Added request cmpl-be9f71abf2524063aa95284e234da1a5-0.
INFO 03-02 00:44:32 [logger.py:42] Received request cmpl-aa57fb643f0247c5ab39262dbebea06e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:44:32 [async_llm.py:261] Added request cmpl-aa57fb643f0247c5ab39262dbebea06e-0.
INFO 03-02 00:44:34 [logger.py:42] Received request cmpl-0026f72ec74f4e90bd63b28ebce7c655-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:44:34 [async_llm.py:261] Added request cmpl-0026f72ec74f4e90bd63b28ebce7c655-0.
INFO 03-02 00:44:35 [logger.py:42] Received request cmpl-d4b0395b9afd41e4b34dde6e9a016c03-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:44:35 [async_llm.py:261] Added request cmpl-d4b0395b9afd41e4b34dde6e9a016c03-0.
INFO 03-02 00:44:36 [logger.py:42] Received request cmpl-5007aab9cf014522b294fc5138af8f0b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:44:36 [async_llm.py:261] Added request cmpl-5007aab9cf014522b294fc5138af8f0b-0.
INFO 03-02 00:44:37 [logger.py:42] Received request cmpl-b9026efa2432414a8d6a054e3fcd3490-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:44:37 [async_llm.py:261] Added request cmpl-b9026efa2432414a8d6a054e3fcd3490-0.
INFO 03-02 00:44:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:44:38 [logger.py:42] Received request cmpl-e2cd44f3b98444a0b9954bef8b6070ac-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:44:38 [async_llm.py:261] Added request cmpl-e2cd44f3b98444a0b9954bef8b6070ac-0.
INFO 03-02 00:44:39 [logger.py:42] Received request cmpl-ae561f5edc2144a8bd897489b6029871-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:44:39 [async_llm.py:261] Added request cmpl-ae561f5edc2144a8bd897489b6029871-0.
INFO 03-02 00:44:41 [logger.py:42] Received request cmpl-e044b3550bd4478c92e20f4d198e146c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:44:41 [async_llm.py:261] Added request cmpl-e044b3550bd4478c92e20f4d198e146c-0.
INFO 03-02 00:44:42 [logger.py:42] Received request cmpl-00320f4719634a1392ead84701a7d33e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:44:42 [async_llm.py:261] Added request cmpl-00320f4719634a1392ead84701a7d33e-0.
INFO 03-02 00:44:43 [logger.py:42] Received request cmpl-8dd62187503d4381943a3985e826d486-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:44:43 [async_llm.py:261] Added request cmpl-8dd62187503d4381943a3985e826d486-0.
INFO 03-02 00:44:44 [logger.py:42] Received request cmpl-24b59561f3a54e3b945b2f5fb03f6d4b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:44:44 [async_llm.py:261] Added request cmpl-24b59561f3a54e3b945b2f5fb03f6d4b-0.
INFO 03-02 00:44:45 [logger.py:42] Received request cmpl-a10c686117e24f9eb28d8361367e1197-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:44:45 [async_llm.py:261] Added request cmpl-a10c686117e24f9eb28d8361367e1197-0.
INFO 03-02 00:44:46 [logger.py:42] Received request cmpl-6010a582d8f0479ca9a964d2966739dd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:44:46 [async_llm.py:261] Added request cmpl-6010a582d8f0479ca9a964d2966739dd-0.
INFO 03-02 00:44:47 [logger.py:42] Received request cmpl-876ad5f718a440849084094c7f801e6d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:44:47 [async_llm.py:261] Added request cmpl-876ad5f718a440849084094c7f801e6d-0.
INFO 03-02 00:44:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:44:49 [logger.py:42] Received request cmpl-a8bf0080fa1c4d35ad909c5d52ec3cde-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:44:49 [async_llm.py:261] Added request cmpl-a8bf0080fa1c4d35ad909c5d52ec3cde-0.
INFO 03-02 00:44:50 [logger.py:42] Received request cmpl-8f9915f40a024158b0d56f9a70b54f3b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:44:50 [async_llm.py:261] Added request cmpl-8f9915f40a024158b0d56f9a70b54f3b-0.
INFO 03-02 00:44:51 [logger.py:42] Received request cmpl-b38253f9ae5f4ac6b978d9aedc461503-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:44:51 [async_llm.py:261] Added request cmpl-b38253f9ae5f4ac6b978d9aedc461503-0.
INFO 03-02 00:44:52 [logger.py:42] Received request cmpl-fa8211d6c9104732b870c07cad7d36da-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:44:52 [async_llm.py:261] Added request cmpl-fa8211d6c9104732b870c07cad7d36da-0.
INFO 03-02 00:44:53 [logger.py:42] Received request cmpl-14cba1e576be48a48d826b5314d4ef9f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:44:53 [async_llm.py:261] Added request cmpl-14cba1e576be48a48d826b5314d4ef9f-0.
INFO 03-02 00:44:54 [logger.py:42] Received request cmpl-1d70d440efca4fc6a56e244c796b41e2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:44:54 [async_llm.py:261] Added request cmpl-1d70d440efca4fc6a56e244c796b41e2-0.
INFO 03-02 00:44:55 [logger.py:42] Received request cmpl-475e35cacc81448bbeb049af171f24af-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:44:55 [async_llm.py:261] Added request cmpl-475e35cacc81448bbeb049af171f24af-0.
INFO 03-02 00:44:57 [logger.py:42] Received request cmpl-d150573b30c14c0885b7f5084180d006-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:44:57 [async_llm.py:261] Added request cmpl-d150573b30c14c0885b7f5084180d006-0.
INFO 03-02 00:44:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:44:58 [logger.py:42] Received request cmpl-e1f3a403d8c841ca81c8bd021d462a44-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:44:58 [async_llm.py:261] Added request cmpl-e1f3a403d8c841ca81c8bd021d462a44-0.
INFO 03-02 00:44:59 [logger.py:42] Received request cmpl-233dff663b584c5eaebe67156794e978-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:44:59 [async_llm.py:261] Added request cmpl-233dff663b584c5eaebe67156794e978-0.
INFO 03-02 00:45:00 [logger.py:42] Received request cmpl-e18ed3f1afb74025ab9bed4d278274aa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:45:00 [async_llm.py:261] Added request cmpl-e18ed3f1afb74025ab9bed4d278274aa-0.
INFO 03-02 00:45:01 [logger.py:42] Received request cmpl-97fdf1accb9d49599a92da6b41812b0e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:45:01 [async_llm.py:261] Added request cmpl-97fdf1accb9d49599a92da6b41812b0e-0.
INFO 03-02 00:45:02 [logger.py:42] Received request cmpl-94649c9eb22c41049e6379c453ce2e20-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:45:02 [async_llm.py:261] Added request cmpl-94649c9eb22c41049e6379c453ce2e20-0.
INFO 03-02 00:45:04 [logger.py:42] Received request cmpl-6ee0e6056a64426dbbfb1021d8b3adaa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:45:04 [async_llm.py:261] Added request cmpl-6ee0e6056a64426dbbfb1021d8b3adaa-0.
INFO 03-02 00:45:05 [logger.py:42] Received request cmpl-9900229f3fcc45faa78081ea4c007d27-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:45:05 [async_llm.py:261] Added request cmpl-9900229f3fcc45faa78081ea4c007d27-0.
INFO 03-02 00:45:06 [logger.py:42] Received request cmpl-9e29e5d0e89445a3ac57095e0a4d8858-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:45:06 [async_llm.py:261] Added request cmpl-9e29e5d0e89445a3ac57095e0a4d8858-0.
INFO 03-02 00:45:07 [logger.py:42] Received request cmpl-e11d38876e5d49c08f0f4a220eed2bcc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:45:07 [async_llm.py:261] Added request cmpl-e11d38876e5d49c08f0f4a220eed2bcc-0.
INFO 03-02 00:45:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:45:08 [logger.py:42] Received request cmpl-da059684942a4147b13c5770da426896-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:45:08 [async_llm.py:261] Added request cmpl-da059684942a4147b13c5770da426896-0.
INFO 03-02 00:45:09 [logger.py:42] Received request cmpl-5fde074fa3f94175a17c7ef01682d006-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:45:09 [async_llm.py:261] Added request cmpl-5fde074fa3f94175a17c7ef01682d006-0.
INFO 03-02 00:45:10 [logger.py:42] Received request cmpl-791d99ce114c4f08994ee0d4f9b17316-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:45:10 [async_llm.py:261] Added request cmpl-791d99ce114c4f08994ee0d4f9b17316-0.
INFO 03-02 00:45:12 [logger.py:42] Received request cmpl-6edfab3d84bc406f9ebab01d719638a4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:45:12 [async_llm.py:261] Added request cmpl-6edfab3d84bc406f9ebab01d719638a4-0.
INFO 03-02 00:45:13 [logger.py:42] Received request cmpl-41b1d4287cf249b48a781e5a61866f5e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:45:13 [async_llm.py:261] Added request cmpl-41b1d4287cf249b48a781e5a61866f5e-0.
INFO 03-02 00:45:14 [logger.py:42] Received request cmpl-823c9b8eea8a4ecf89f7c4cd7f5f1318-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:45:14 [async_llm.py:261] Added request cmpl-823c9b8eea8a4ecf89f7c4cd7f5f1318-0.
INFO 03-02 00:45:15 [logger.py:42] Received request cmpl-3c6902db78574af59476b8cbe7f5f081-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:45:15 [async_llm.py:261] Added request cmpl-3c6902db78574af59476b8cbe7f5f081-0.
INFO 03-02 00:45:16 [logger.py:42] Received request cmpl-8d62326ac9f44f6cabd9f3d2c26480f5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:45:16 [async_llm.py:261] Added request cmpl-8d62326ac9f44f6cabd9f3d2c26480f5-0.
INFO 03-02 00:45:17 [logger.py:42] Received request cmpl-d157ed39589241f2b5fbe661bf5273ee-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:45:17 [async_llm.py:261] Added request cmpl-d157ed39589241f2b5fbe661bf5273ee-0.
INFO 03-02 00:45:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:45:19 [logger.py:42] Received request cmpl-f619b7bc962344d3bdcdf6408407eb51-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:45:19 [async_llm.py:261] Added request cmpl-f619b7bc962344d3bdcdf6408407eb51-0.
INFO 03-02 00:45:20 [logger.py:42] Received request cmpl-12ee32dde3664ea7b65d0daf3a255a7e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:45:20 [async_llm.py:261] Added request cmpl-12ee32dde3664ea7b65d0daf3a255a7e-0.
INFO 03-02 00:45:21 [logger.py:42] Received request cmpl-a58957112afc4f43923add0fd8e2dd50-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:45:21 [async_llm.py:261] Added request cmpl-a58957112afc4f43923add0fd8e2dd50-0.
INFO 03-02 00:45:22 [logger.py:42] Received request cmpl-8fef6bde18d14b9cbe974e0717bf9255-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:45:22 [async_llm.py:261] Added request cmpl-8fef6bde18d14b9cbe974e0717bf9255-0.
INFO 03-02 00:45:23 [logger.py:42] Received request cmpl-3998f4df238c4eedb60aab059569beb8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:45:23 [async_llm.py:261] Added request cmpl-3998f4df238c4eedb60aab059569beb8-0.
INFO 03-02 00:45:24 [logger.py:42] Received request cmpl-0f7a75295ca746948dcc115ca588f7bb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:45:24 [async_llm.py:261] Added request cmpl-0f7a75295ca746948dcc115ca588f7bb-0.
INFO 03-02 00:45:25 [logger.py:42] Received request cmpl-e4b869db81944536a74d895d36eb632a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:45:25 [async_llm.py:261] Added request cmpl-e4b869db81944536a74d895d36eb632a-0.
INFO 03-02 00:45:27 [logger.py:42] Received request cmpl-a930bc90eb004cef934074ec19a70148-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:45:27 [async_llm.py:261] Added request cmpl-a930bc90eb004cef934074ec19a70148-0.
INFO 03-02 00:45:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:45:28 [logger.py:42] Received request cmpl-7d104bf17fd248529e71a4082f18e017-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:45:28 [async_llm.py:261] Added request cmpl-7d104bf17fd248529e71a4082f18e017-0.
INFO 03-02 00:45:29 [logger.py:42] Received request cmpl-be2b48a653354aa898c39af433f4ba76-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:45:29 [async_llm.py:261] Added request cmpl-be2b48a653354aa898c39af433f4ba76-0.
INFO 03-02 00:45:30 [logger.py:42] Received request cmpl-032f686af007464c89e7bcc3a5622e22-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:45:30 [async_llm.py:261] Added request cmpl-032f686af007464c89e7bcc3a5622e22-0.
INFO 03-02 00:45:31 [logger.py:42] Received request cmpl-fca489da01374b17bc890d9c46b89fa5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:45:31 [async_llm.py:261] Added request cmpl-fca489da01374b17bc890d9c46b89fa5-0.
INFO 03-02 00:45:32 [logger.py:42] Received request cmpl-fb08cc0196214cb0b4cbf9ee3facfb63-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:45:32 [async_llm.py:261] Added request cmpl-fb08cc0196214cb0b4cbf9ee3facfb63-0.
INFO 03-02 00:45:34 [logger.py:42] Received request cmpl-423102b665ff46939b4341f316eea8ea-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:45:34 [async_llm.py:261] Added request cmpl-423102b665ff46939b4341f316eea8ea-0.
INFO 03-02 00:45:35 [logger.py:42] Received request cmpl-ff43c5476fe24c4595caa4b980af9ed7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:45:35 [async_llm.py:261] Added request cmpl-ff43c5476fe24c4595caa4b980af9ed7-0.
INFO 03-02 00:45:36 [logger.py:42] Received request cmpl-b8d49113498a499480db0c4cf0b47023-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:45:36 [async_llm.py:261] Added request cmpl-b8d49113498a499480db0c4cf0b47023-0.
INFO 03-02 00:45:37 [logger.py:42] Received request cmpl-4bff3659028c4da390e7817acbbb0df7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:45:37 [async_llm.py:261] Added request cmpl-4bff3659028c4da390e7817acbbb0df7-0.
INFO 03-02 00:45:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:45:38 [logger.py:42] Received request cmpl-7528ee291e6149da8986c376648dc447-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:45:38 [async_llm.py:261] Added request cmpl-7528ee291e6149da8986c376648dc447-0.
INFO 03-02 00:45:39 [logger.py:42] Received request cmpl-36eaf1cc28fd404cb2ca7eb4d2468deb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:45:39 [async_llm.py:261] Added request cmpl-36eaf1cc28fd404cb2ca7eb4d2468deb-0.
INFO 03-02 00:45:40 [logger.py:42] Received request cmpl-d0ce5bc6a6944c0680d32f3309e5cf59-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:45:40 [async_llm.py:261] Added request cmpl-d0ce5bc6a6944c0680d32f3309e5cf59-0.
INFO 03-02 00:45:42 [logger.py:42] Received request cmpl-ea60fb0df2e24768b207bb6ce6683c42-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:45:42 [async_llm.py:261] Added request cmpl-ea60fb0df2e24768b207bb6ce6683c42-0.
INFO 03-02 00:45:43 [logger.py:42] Received request cmpl-da830f488e50468ab95cb278d7689898-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:45:43 [async_llm.py:261] Added request cmpl-da830f488e50468ab95cb278d7689898-0.
INFO 03-02 00:45:44 [logger.py:42] Received request cmpl-58b64f6ed9004c09b405a806c336b521-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:45:44 [async_llm.py:261] Added request cmpl-58b64f6ed9004c09b405a806c336b521-0.
INFO 03-02 00:45:45 [logger.py:42] Received request cmpl-4706c0ade68c442b87cf56ecbdb1edfd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:45:45 [async_llm.py:261] Added request cmpl-4706c0ade68c442b87cf56ecbdb1edfd-0.
INFO 03-02 00:45:46 [logger.py:42] Received request cmpl-87435d80a0824c4da3029904a28ec3c5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:45:46 [async_llm.py:261] Added request cmpl-87435d80a0824c4da3029904a28ec3c5-0.
INFO 03-02 00:45:47 [logger.py:42] Received request cmpl-b6e4859394af4d1daf798494aaf33855-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:45:47 [async_llm.py:261] Added request cmpl-b6e4859394af4d1daf798494aaf33855-0.
INFO 03-02 00:45:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:45:49 [logger.py:42] Received request cmpl-3c74a56617644949868c43c7f5547806-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:45:49 [async_llm.py:261] Added request cmpl-3c74a56617644949868c43c7f5547806-0.
INFO 03-02 00:45:50 [logger.py:42] Received request cmpl-5dfa444b259249b88032aff30dc0e04d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:45:50 [async_llm.py:261] Added request cmpl-5dfa444b259249b88032aff30dc0e04d-0.
INFO 03-02 00:45:51 [logger.py:42] Received request cmpl-e3f715787ef94783a184261cdd9a67ad-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:45:51 [async_llm.py:261] Added request cmpl-e3f715787ef94783a184261cdd9a67ad-0.
INFO 03-02 00:45:52 [logger.py:42] Received request cmpl-77af3c9ebeb34ae0894a412cdcc44cde-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:45:52 [async_llm.py:261] Added request cmpl-77af3c9ebeb34ae0894a412cdcc44cde-0.
INFO 03-02 00:45:53 [logger.py:42] Received request cmpl-fa89e759ccc9466e81d102bc4d363416-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:45:53 [async_llm.py:261] Added request cmpl-fa89e759ccc9466e81d102bc4d363416-0.
INFO 03-02 00:45:54 [logger.py:42] Received request cmpl-4a67b60522a34708977843042227b42e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:45:54 [async_llm.py:261] Added request cmpl-4a67b60522a34708977843042227b42e-0.
INFO 03-02 00:45:55 [logger.py:42] Received request cmpl-809bd2526ec54b5bb9736d38702a8fad-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:45:55 [async_llm.py:261] Added request cmpl-809bd2526ec54b5bb9736d38702a8fad-0.
INFO 03-02 00:45:57 [logger.py:42] Received request cmpl-3c7c0be7f7624fd49a7d4ef3864306bc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:45:57 [async_llm.py:261] Added request cmpl-3c7c0be7f7624fd49a7d4ef3864306bc-0.
INFO 03-02 00:45:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:45:58 [logger.py:42] Received request cmpl-a59328e06afb460e83bf25c594aba4e1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:45:58 [async_llm.py:261] Added request cmpl-a59328e06afb460e83bf25c594aba4e1-0.
INFO 03-02 00:45:59 [logger.py:42] Received request cmpl-823d6d1bebd64b25a0378c69eec4f0a9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:45:59 [async_llm.py:261] Added request cmpl-823d6d1bebd64b25a0378c69eec4f0a9-0.
INFO 03-02 00:46:00 [logger.py:42] Received request cmpl-a32fbed49e734ffea8a684d2898f9355-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:46:00 [async_llm.py:261] Added request cmpl-a32fbed49e734ffea8a684d2898f9355-0.
INFO 03-02 00:46:01 [logger.py:42] Received request cmpl-0c80a9b6124b4d81ad87a6afd2063a71-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:46:01 [async_llm.py:261] Added request cmpl-0c80a9b6124b4d81ad87a6afd2063a71-0.
INFO 03-02 00:46:02 [logger.py:42] Received request cmpl-2f790d1f67a74491a2ef87537362e854-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:46:02 [async_llm.py:261] Added request cmpl-2f790d1f67a74491a2ef87537362e854-0.
INFO 03-02 00:46:04 [logger.py:42] Received request cmpl-f1b856c69fc74fb7afd6ff788851e58a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:46:04 [async_llm.py:261] Added request cmpl-f1b856c69fc74fb7afd6ff788851e58a-0.
INFO 03-02 00:46:05 [logger.py:42] Received request cmpl-cc4329e0480341ac98c85c45628d0534-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:46:05 [async_llm.py:261] Added request cmpl-cc4329e0480341ac98c85c45628d0534-0.
INFO 03-02 00:46:06 [logger.py:42] Received request cmpl-63efee7d8bbe434882367a818246e84b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:46:06 [async_llm.py:261] Added request cmpl-63efee7d8bbe434882367a818246e84b-0.
INFO 03-02 00:46:07 [logger.py:42] Received request cmpl-a13b5158b9e640909ebff3a3ee597b02-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:46:07 [async_llm.py:261] Added request cmpl-a13b5158b9e640909ebff3a3ee597b02-0.
INFO 03-02 00:46:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:46:08 [logger.py:42] Received request cmpl-27b5199d4dd5492dadc1263b2046e99f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:46:08 [async_llm.py:261] Added request cmpl-27b5199d4dd5492dadc1263b2046e99f-0.
INFO 03-02 00:46:09 [logger.py:42] Received request cmpl-c6622ca6eab34c03b62a4f70abdb37a1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:46:09 [async_llm.py:261] Added request cmpl-c6622ca6eab34c03b62a4f70abdb37a1-0.
INFO 03-02 00:46:10 [logger.py:42] Received request cmpl-b471dbc319c64c46aa0e492bebf89d3b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:46:10 [async_llm.py:261] Added request cmpl-b471dbc319c64c46aa0e492bebf89d3b-0.
INFO 03-02 00:46:12 [logger.py:42] Received request cmpl-b2195c099af14981a75475411fd38435-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:46:12 [async_llm.py:261] Added request cmpl-b2195c099af14981a75475411fd38435-0.
INFO 03-02 00:46:13 [logger.py:42] Received request cmpl-59fd80c14c4449d591ae65f7b4d1379e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:46:13 [async_llm.py:261] Added request cmpl-59fd80c14c4449d591ae65f7b4d1379e-0.
INFO 03-02 00:46:14 [logger.py:42] Received request cmpl-dc2c8e943201482a826f71ca40d33733-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:46:14 [async_llm.py:261] Added request cmpl-dc2c8e943201482a826f71ca40d33733-0.
INFO 03-02 00:46:15 [logger.py:42] Received request cmpl-ff05450e2e38443fb6926ebc95602141-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:46:15 [async_llm.py:261] Added request cmpl-ff05450e2e38443fb6926ebc95602141-0.
INFO 03-02 00:46:16 [logger.py:42] Received request cmpl-24d28efee8a64ae29fb1b4bfe8308869-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:46:16 [async_llm.py:261] Added request cmpl-24d28efee8a64ae29fb1b4bfe8308869-0.
INFO 03-02 00:46:17 [logger.py:42] Received request cmpl-53c46101a3cb4005868bac56ffe6a844-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:46:17 [async_llm.py:261] Added request cmpl-53c46101a3cb4005868bac56ffe6a844-0.
INFO 03-02 00:46:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:46:19 [logger.py:42] Received request cmpl-9eb6b5b191974ebcbbaf559ae7af2816-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:46:19 [async_llm.py:261] Added request cmpl-9eb6b5b191974ebcbbaf559ae7af2816-0.
INFO 03-02 00:46:20 [logger.py:42] Received request cmpl-23c050927c074b1795cbe16bbcbc9bd5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:46:20 [async_llm.py:261] Added request cmpl-23c050927c074b1795cbe16bbcbc9bd5-0.
INFO 03-02 00:46:21 [logger.py:42] Received request cmpl-ccfa75206fdb48fe843353104af3d075-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:46:21 [async_llm.py:261] Added request cmpl-ccfa75206fdb48fe843353104af3d075-0.
INFO 03-02 00:46:22 [logger.py:42] Received request cmpl-cb51361b028c4691b6d4eed214993d9a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:46:22 [async_llm.py:261] Added request cmpl-cb51361b028c4691b6d4eed214993d9a-0.
INFO 03-02 00:46:23 [logger.py:42] Received request cmpl-5c04bfe640a64b998cb6194af5de7861-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:46:23 [async_llm.py:261] Added request cmpl-5c04bfe640a64b998cb6194af5de7861-0.
INFO 03-02 00:46:24 [logger.py:42] Received request cmpl-f9212c75d4464fc3a3e98751def1578f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:46:24 [async_llm.py:261] Added request cmpl-f9212c75d4464fc3a3e98751def1578f-0.
INFO 03-02 00:46:25 [logger.py:42] Received request cmpl-79ff23ed1bb94002bdc79880a08bdb36-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:46:25 [async_llm.py:261] Added request cmpl-79ff23ed1bb94002bdc79880a08bdb36-0.
INFO 03-02 00:46:27 [logger.py:42] Received request cmpl-322d76cc26f9437f952e297d507bbbe4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:46:27 [async_llm.py:261] Added request cmpl-322d76cc26f9437f952e297d507bbbe4-0.
INFO 03-02 00:46:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:46:28 [logger.py:42] Received request cmpl-542b0d2232b84900baed895846484086-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:46:28 [async_llm.py:261] Added request cmpl-542b0d2232b84900baed895846484086-0.
INFO 03-02 00:46:29 [logger.py:42] Received request cmpl-a80d2cfe65e445698fa64ef7a41d759f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:46:29 [async_llm.py:261] Added request cmpl-a80d2cfe65e445698fa64ef7a41d759f-0.
INFO 03-02 00:46:30 [logger.py:42] Received request cmpl-75cfeb36f7ec4450a4b492a9372287a0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:46:30 [async_llm.py:261] Added request cmpl-75cfeb36f7ec4450a4b492a9372287a0-0.
INFO 03-02 00:46:31 [logger.py:42] Received request cmpl-3ba364e7cedd4e6b92b8699bf9c90ea4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:46:31 [async_llm.py:261] Added request cmpl-3ba364e7cedd4e6b92b8699bf9c90ea4-0.
INFO 03-02 00:46:32 [logger.py:42] Received request cmpl-c66db701a9954a3298176f881e5b73d6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:46:32 [async_llm.py:261] Added request cmpl-c66db701a9954a3298176f881e5b73d6-0.
INFO 03-02 00:46:34 [logger.py:42] Received request cmpl-40da45c8ea66497987e62a6658868546-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:46:34 [async_llm.py:261] Added request cmpl-40da45c8ea66497987e62a6658868546-0.
INFO 03-02 00:46:35 [logger.py:42] Received request cmpl-556eaac40c584004898d97c2acee7843-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:46:35 [async_llm.py:261] Added request cmpl-556eaac40c584004898d97c2acee7843-0.
INFO 03-02 00:46:36 [logger.py:42] Received request cmpl-fb6ad4d0e53b4b45b2c9a20573ad2355-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:46:36 [async_llm.py:261] Added request cmpl-fb6ad4d0e53b4b45b2c9a20573ad2355-0.
INFO 03-02 00:46:37 [logger.py:42] Received request cmpl-2166d72116154029b05427011315df92-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:46:37 [async_llm.py:261] Added request cmpl-2166d72116154029b05427011315df92-0.
INFO 03-02 00:46:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:46:38 [logger.py:42] Received request cmpl-e2e318182f604ef4a14851d58b128fde-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:46:38 [async_llm.py:261] Added request cmpl-e2e318182f604ef4a14851d58b128fde-0.
INFO 03-02 00:46:39 [logger.py:42] Received request cmpl-deeac2f5d4314e55948fea34d8478698-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:46:39 [async_llm.py:261] Added request cmpl-deeac2f5d4314e55948fea34d8478698-0.
INFO 03-02 00:46:40 [logger.py:42] Received request cmpl-5f3d5f10a4124993b14d19fd8fd1f868-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:46:40 [async_llm.py:261] Added request cmpl-5f3d5f10a4124993b14d19fd8fd1f868-0.
INFO 03-02 00:46:42 [logger.py:42] Received request cmpl-ec5a1c80bce345f8b0f07f63cff19594-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:46:42 [async_llm.py:261] Added request cmpl-ec5a1c80bce345f8b0f07f63cff19594-0.
INFO 03-02 00:46:43 [logger.py:42] Received request cmpl-a730897e16b94b798b8aa548fd0619ed-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:46:43 [async_llm.py:261] Added request cmpl-a730897e16b94b798b8aa548fd0619ed-0.
INFO 03-02 00:46:44 [logger.py:42] Received request cmpl-ff4b2d9e01884872a453426c3f3cfb80-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:46:44 [async_llm.py:261] Added request cmpl-ff4b2d9e01884872a453426c3f3cfb80-0.
INFO 03-02 00:46:45 [logger.py:42] Received request cmpl-7abcc21ca2df486994790dce54b2b0de-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:46:45 [async_llm.py:261] Added request cmpl-7abcc21ca2df486994790dce54b2b0de-0.
INFO 03-02 00:46:46 [logger.py:42] Received request cmpl-120ac8b94edb428f878e1af70a5b42c3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:46:46 [async_llm.py:261] Added request cmpl-120ac8b94edb428f878e1af70a5b42c3-0.
INFO 03-02 00:46:47 [logger.py:42] Received request cmpl-5b3f2dc9606f4ea2826d9a44c265b757-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:46:47 [async_llm.py:261] Added request cmpl-5b3f2dc9606f4ea2826d9a44c265b757-0.
INFO 03-02 00:46:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:46:49 [logger.py:42] Received request cmpl-c5ef78caf173440a88e81b1169cf8d5a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:46:49 [async_llm.py:261] Added request cmpl-c5ef78caf173440a88e81b1169cf8d5a-0.
INFO 03-02 00:46:50 [logger.py:42] Received request cmpl-d6dee808597d43a98f4f20212da3782f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:46:50 [async_llm.py:261] Added request cmpl-d6dee808597d43a98f4f20212da3782f-0.
INFO 03-02 00:46:51 [logger.py:42] Received request cmpl-c4c5de667d1c422b8c3b914684eb14bf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:46:51 [async_llm.py:261] Added request cmpl-c4c5de667d1c422b8c3b914684eb14bf-0.
INFO 03-02 00:46:52 [logger.py:42] Received request cmpl-5e6d444d054444d7a20f7cd375b40835-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:46:52 [async_llm.py:261] Added request cmpl-5e6d444d054444d7a20f7cd375b40835-0.
INFO 03-02 00:46:53 [logger.py:42] Received request cmpl-bec10dcbd0354d0c87c223b47805ecba-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:46:53 [async_llm.py:261] Added request cmpl-bec10dcbd0354d0c87c223b47805ecba-0.
INFO 03-02 00:46:54 [logger.py:42] Received request cmpl-3d5f03f1970f41859ae987ee56e8eae5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:46:54 [async_llm.py:261] Added request cmpl-3d5f03f1970f41859ae987ee56e8eae5-0.
INFO 03-02 00:46:55 [logger.py:42] Received request cmpl-950726672a364ddda18c141767314adc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:46:55 [async_llm.py:261] Added request cmpl-950726672a364ddda18c141767314adc-0.
INFO 03-02 00:46:57 [logger.py:42] Received request cmpl-aa0baa9cc3e14357929e6de7b81b0d17-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:46:57 [async_llm.py:261] Added request cmpl-aa0baa9cc3e14357929e6de7b81b0d17-0.
INFO 03-02 00:46:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:46:58 [logger.py:42] Received request cmpl-d2b507a814314355ab38fd26c2c115ce-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:46:58 [async_llm.py:261] Added request cmpl-d2b507a814314355ab38fd26c2c115ce-0.
INFO 03-02 00:46:59 [logger.py:42] Received request cmpl-831c3d799978409bbadd5e412aa54989-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:46:59 [async_llm.py:261] Added request cmpl-831c3d799978409bbadd5e412aa54989-0.
INFO 03-02 00:47:00 [logger.py:42] Received request cmpl-038dbb7cce5d4560bc4cbf1a46a74462-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:47:00 [async_llm.py:261] Added request cmpl-038dbb7cce5d4560bc4cbf1a46a74462-0.
INFO 03-02 00:47:01 [logger.py:42] Received request cmpl-2b1abc4fc575494eb44ef2ac1d23c3df-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:47:01 [async_llm.py:261] Added request cmpl-2b1abc4fc575494eb44ef2ac1d23c3df-0.
INFO 03-02 00:47:02 [logger.py:42] Received request cmpl-e7f574c1d85545e7949b2e4bb86cba17-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:47:02 [async_llm.py:261] Added request cmpl-e7f574c1d85545e7949b2e4bb86cba17-0.
INFO 03-02 00:47:04 [logger.py:42] Received request cmpl-1c483f266ec74fa9ac2f9279f63374c4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:47:04 [async_llm.py:261] Added request cmpl-1c483f266ec74fa9ac2f9279f63374c4-0.
INFO 03-02 00:47:05 [logger.py:42] Received request cmpl-b5832873c5e645fcaaf2f141bbb566d5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:47:05 [async_llm.py:261] Added request cmpl-b5832873c5e645fcaaf2f141bbb566d5-0.
INFO 03-02 00:47:06 [logger.py:42] Received request cmpl-6ea221080b104f6eafb638e592455831-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:47:06 [async_llm.py:261] Added request cmpl-6ea221080b104f6eafb638e592455831-0.
INFO 03-02 00:47:07 [logger.py:42] Received request cmpl-35cefde52fd54a839bf610a6397bbc8e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:47:07 [async_llm.py:261] Added request cmpl-35cefde52fd54a839bf610a6397bbc8e-0.
INFO 03-02 00:47:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:47:08 [logger.py:42] Received request cmpl-c2b1e831ca7440ebb7f1c4ca1ac76422-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:47:08 [async_llm.py:261] Added request cmpl-c2b1e831ca7440ebb7f1c4ca1ac76422-0.
INFO 03-02 00:47:09 [logger.py:42] Received request cmpl-1945c567eae447b29c98414a8d276d91-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:47:09 [async_llm.py:261] Added request cmpl-1945c567eae447b29c98414a8d276d91-0.
INFO 03-02 00:47:10 [logger.py:42] Received request cmpl-011c06b4973e4481992caa75006a872e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:47:10 [async_llm.py:261] Added request cmpl-011c06b4973e4481992caa75006a872e-0.
INFO 03-02 00:47:12 [logger.py:42] Received request cmpl-54521778a5ae4440a65bc1bb97bd23db-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:47:12 [async_llm.py:261] Added request cmpl-54521778a5ae4440a65bc1bb97bd23db-0.
INFO 03-02 00:47:13 [logger.py:42] Received request cmpl-e2abdda3f5fd431b853c79f6525ac1f1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:47:13 [async_llm.py:261] Added request cmpl-e2abdda3f5fd431b853c79f6525ac1f1-0.
INFO 03-02 00:47:14 [logger.py:42] Received request cmpl-60c297289db649d5bafb37534ae74060-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:47:14 [async_llm.py:261] Added request cmpl-60c297289db649d5bafb37534ae74060-0.
INFO 03-02 00:47:15 [logger.py:42] Received request cmpl-341d398fa1064c71aaa5648b592e5f09-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:47:15 [async_llm.py:261] Added request cmpl-341d398fa1064c71aaa5648b592e5f09-0.
INFO 03-02 00:47:16 [logger.py:42] Received request cmpl-89a0f1c34a4848428a0a1a1b06f7eaca-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:47:16 [async_llm.py:261] Added request cmpl-89a0f1c34a4848428a0a1a1b06f7eaca-0.
INFO 03-02 00:47:17 [logger.py:42] Received request cmpl-cdabb4d4fd834bcc86d8f220716a1bf1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:47:17 [async_llm.py:261] Added request cmpl-cdabb4d4fd834bcc86d8f220716a1bf1-0.
INFO 03-02 00:47:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:47:19 [logger.py:42] Received request cmpl-7a7c5731c96d4743a039b6fe0ade63db-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:47:19 [async_llm.py:261] Added request cmpl-7a7c5731c96d4743a039b6fe0ade63db-0.
INFO 03-02 00:47:20 [logger.py:42] Received request cmpl-f16bd727c9d94a2593f6d997fcb86c72-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:47:20 [async_llm.py:261] Added request cmpl-f16bd727c9d94a2593f6d997fcb86c72-0.
INFO 03-02 00:47:21 [logger.py:42] Received request cmpl-f718789f7b3441d79892f8b6266cad6f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:47:21 [async_llm.py:261] Added request cmpl-f718789f7b3441d79892f8b6266cad6f-0.
INFO 03-02 00:47:22 [logger.py:42] Received request cmpl-e969210e32d94260af055b1426cc5439-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:47:22 [async_llm.py:261] Added request cmpl-e969210e32d94260af055b1426cc5439-0.
INFO 03-02 00:47:23 [logger.py:42] Received request cmpl-e79898a3b8254e668b8e093630e1cda9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:47:23 [async_llm.py:261] Added request cmpl-e79898a3b8254e668b8e093630e1cda9-0.
INFO 03-02 00:47:24 [logger.py:42] Received request cmpl-956c40a87fda4eb292a5528a93dc456f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:47:24 [async_llm.py:261] Added request cmpl-956c40a87fda4eb292a5528a93dc456f-0.
INFO 03-02 00:47:25 [logger.py:42] Received request cmpl-443c2f5586ac404f8c0aa00a65448655-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:47:25 [async_llm.py:261] Added request cmpl-443c2f5586ac404f8c0aa00a65448655-0.
INFO 03-02 00:47:27 [logger.py:42] Received request cmpl-0347e031ccae4e31979b497abd0ee520-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:47:27 [async_llm.py:261] Added request cmpl-0347e031ccae4e31979b497abd0ee520-0.
INFO 03-02 00:47:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:47:28 [logger.py:42] Received request cmpl-6051f12712a44f6f9907ace8ca93e974-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:47:28 [async_llm.py:261] Added request cmpl-6051f12712a44f6f9907ace8ca93e974-0.
INFO 03-02 00:47:29 [logger.py:42] Received request cmpl-27ea17aa7de84da38f69215751c6a9ee-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:47:29 [async_llm.py:261] Added request cmpl-27ea17aa7de84da38f69215751c6a9ee-0.
INFO 03-02 00:47:30 [logger.py:42] Received request cmpl-29314b07ebdb435483ff65e154847cc0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:47:30 [async_llm.py:261] Added request cmpl-29314b07ebdb435483ff65e154847cc0-0.
INFO 03-02 00:47:31 [logger.py:42] Received request cmpl-fa35f2fa86404473acbb57b77f411a87-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:47:31 [async_llm.py:261] Added request cmpl-fa35f2fa86404473acbb57b77f411a87-0.
INFO 03-02 00:47:32 [logger.py:42] Received request cmpl-a93fb9246b3e490a85319388c7c31981-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:47:32 [async_llm.py:261] Added request cmpl-a93fb9246b3e490a85319388c7c31981-0.
INFO 03-02 00:47:34 [logger.py:42] Received request cmpl-027b591310114ee98b19bd0472293106-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:47:34 [async_llm.py:261] Added request cmpl-027b591310114ee98b19bd0472293106-0.
INFO 03-02 00:47:35 [logger.py:42] Received request cmpl-8bce7671fefb418d9501ff26c274a6e1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:47:35 [async_llm.py:261] Added request cmpl-8bce7671fefb418d9501ff26c274a6e1-0.
INFO 03-02 00:47:36 [logger.py:42] Received request cmpl-99e11854dc2f429db5d02854e02fadaf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:47:36 [async_llm.py:261] Added request cmpl-99e11854dc2f429db5d02854e02fadaf-0.
INFO 03-02 00:47:37 [logger.py:42] Received request cmpl-db1c2e7c446041eb8216e0eace2d7a12-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:47:37 [async_llm.py:261] Added request cmpl-db1c2e7c446041eb8216e0eace2d7a12-0.
INFO 03-02 00:47:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:47:38 [logger.py:42] Received request cmpl-c4b35d1d2db9490eb6d42bff0cfcbf5d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:47:38 [async_llm.py:261] Added request cmpl-c4b35d1d2db9490eb6d42bff0cfcbf5d-0.
INFO 03-02 00:47:39 [logger.py:42] Received request cmpl-e79793445c1748e599c97ab4829b413e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:47:39 [async_llm.py:261] Added request cmpl-e79793445c1748e599c97ab4829b413e-0.
INFO 03-02 00:47:40 [logger.py:42] Received request cmpl-34cb32d09d154d6ca81fc84a0d82f914-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:47:40 [async_llm.py:261] Added request cmpl-34cb32d09d154d6ca81fc84a0d82f914-0.
INFO 03-02 00:47:42 [logger.py:42] Received request cmpl-db26ac384cda4aa7965aa5603c53dfdf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:47:42 [async_llm.py:261] Added request cmpl-db26ac384cda4aa7965aa5603c53dfdf-0.
INFO 03-02 00:47:43 [logger.py:42] Received request cmpl-a3e2a98470784a6aa3c6c02a44cbfd2f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:47:43 [async_llm.py:261] Added request cmpl-a3e2a98470784a6aa3c6c02a44cbfd2f-0.
INFO 03-02 00:47:44 [logger.py:42] Received request cmpl-7dbb01628c454303ac043fa966f662e3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:47:44 [async_llm.py:261] Added request cmpl-7dbb01628c454303ac043fa966f662e3-0.
INFO 03-02 00:47:45 [logger.py:42] Received request cmpl-23b0728cf2984b54af42f58625d026ab-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:47:45 [async_llm.py:261] Added request cmpl-23b0728cf2984b54af42f58625d026ab-0.
INFO 03-02 00:47:46 [logger.py:42] Received request cmpl-30c5fb54c52c4bc9a4145e5a704f8dc7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:47:46 [async_llm.py:261] Added request cmpl-30c5fb54c52c4bc9a4145e5a704f8dc7-0.
INFO 03-02 00:47:47 [logger.py:42] Received request cmpl-77ae3bdadba948d885cfe1e490c0b7df-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:47:47 [async_llm.py:261] Added request cmpl-77ae3bdadba948d885cfe1e490c0b7df-0.
INFO 03-02 00:47:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:47:49 [logger.py:42] Received request cmpl-15f6ada205884ae9915e6d899b61f74c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:47:49 [async_llm.py:261] Added request cmpl-15f6ada205884ae9915e6d899b61f74c-0.
INFO 03-02 00:47:50 [logger.py:42] Received request cmpl-58408151f5c440e5ac691152fb8e1d90-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:47:50 [async_llm.py:261] Added request cmpl-58408151f5c440e5ac691152fb8e1d90-0.
INFO 03-02 00:47:51 [logger.py:42] Received request cmpl-97dede04b9ec4cdf97eb24934c1f261f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:47:51 [async_llm.py:261] Added request cmpl-97dede04b9ec4cdf97eb24934c1f261f-0.
INFO 03-02 00:47:52 [logger.py:42] Received request cmpl-a219f42f8e344f8389e9fd90fa75f4ec-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:47:52 [async_llm.py:261] Added request cmpl-a219f42f8e344f8389e9fd90fa75f4ec-0.
INFO 03-02 00:47:53 [logger.py:42] Received request cmpl-56dc36e1c26f497ea1415455e154e219-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:47:53 [async_llm.py:261] Added request cmpl-56dc36e1c26f497ea1415455e154e219-0.
INFO 03-02 00:47:54 [logger.py:42] Received request cmpl-fdbaac383e8f4457bc0eafbaf8964804-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:47:54 [async_llm.py:261] Added request cmpl-fdbaac383e8f4457bc0eafbaf8964804-0.
INFO 03-02 00:47:55 [logger.py:42] Received request cmpl-ef33a11dcd4e4ee0bab2786ebefc520b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:47:55 [async_llm.py:261] Added request cmpl-ef33a11dcd4e4ee0bab2786ebefc520b-0.
INFO 03-02 00:47:57 [logger.py:42] Received request cmpl-487e2ff1c3494551b7f12c707be5ad26-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:47:57 [async_llm.py:261] Added request cmpl-487e2ff1c3494551b7f12c707be5ad26-0.
INFO 03-02 00:47:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:47:58 [logger.py:42] Received request cmpl-adc23637b8034821885ed3bed603f91c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:47:58 [async_llm.py:261] Added request cmpl-adc23637b8034821885ed3bed603f91c-0.
INFO 03-02 00:47:59 [logger.py:42] Received request cmpl-e384cd9133da4b5ca27ed2648f3424a2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:47:59 [async_llm.py:261] Added request cmpl-e384cd9133da4b5ca27ed2648f3424a2-0.
INFO 03-02 00:48:00 [logger.py:42] Received request cmpl-bfe58e7f96c54a1d9a28c11c4430eda9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:48:00 [async_llm.py:261] Added request cmpl-bfe58e7f96c54a1d9a28c11c4430eda9-0.
INFO 03-02 00:48:01 [logger.py:42] Received request cmpl-950870f956bd443aabe82d367ae86796-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:48:01 [async_llm.py:261] Added request cmpl-950870f956bd443aabe82d367ae86796-0.
INFO 03-02 00:48:02 [logger.py:42] Received request cmpl-53ed7d6339d6497dad72d456cdba80ed-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:48:02 [async_llm.py:261] Added request cmpl-53ed7d6339d6497dad72d456cdba80ed-0.
INFO 03-02 00:48:04 [logger.py:42] Received request cmpl-69d25ceac2214e8ab1bbebd3dd8e9cab-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:48:04 [async_llm.py:261] Added request cmpl-69d25ceac2214e8ab1bbebd3dd8e9cab-0.
INFO 03-02 00:48:05 [logger.py:42] Received request cmpl-378c531176be4f49b01eb5577ffeb556-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:48:05 [async_llm.py:261] Added request cmpl-378c531176be4f49b01eb5577ffeb556-0.
INFO 03-02 00:48:06 [logger.py:42] Received request cmpl-df2158825dc94481b771227dfef8bc7e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:48:06 [async_llm.py:261] Added request cmpl-df2158825dc94481b771227dfef8bc7e-0.
INFO 03-02 00:48:07 [logger.py:42] Received request cmpl-82cbb2458fd2457eada910cf8f0f4e4a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:48:07 [async_llm.py:261] Added request cmpl-82cbb2458fd2457eada910cf8f0f4e4a-0.
INFO 03-02 00:48:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:48:08 [logger.py:42] Received request cmpl-0c88772b8e75426883b6256abf653735-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:48:08 [async_llm.py:261] Added request cmpl-0c88772b8e75426883b6256abf653735-0.
INFO 03-02 00:48:09 [logger.py:42] Received request cmpl-c21591cdaba8401aaea6b34d9d849ec1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:48:09 [async_llm.py:261] Added request cmpl-c21591cdaba8401aaea6b34d9d849ec1-0.
INFO 03-02 00:48:10 [logger.py:42] Received request cmpl-5f280a7a30fd46cf86292da1c74494b3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:48:10 [async_llm.py:261] Added request cmpl-5f280a7a30fd46cf86292da1c74494b3-0.
INFO 03-02 00:48:12 [logger.py:42] Received request cmpl-4d97b7ffe6dd489bb9bb27ba184e53cc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:48:12 [async_llm.py:261] Added request cmpl-4d97b7ffe6dd489bb9bb27ba184e53cc-0.
INFO 03-02 00:48:13 [logger.py:42] Received request cmpl-4d3787f55cf5407fad00b90fa534c283-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:48:13 [async_llm.py:261] Added request cmpl-4d3787f55cf5407fad00b90fa534c283-0.
INFO 03-02 00:48:14 [logger.py:42] Received request cmpl-ff96602e905b42fb96b51736a206f288-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:48:14 [async_llm.py:261] Added request cmpl-ff96602e905b42fb96b51736a206f288-0.
INFO 03-02 00:48:15 [logger.py:42] Received request cmpl-462069bc48994335b4ea99865c8c9f06-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:48:15 [async_llm.py:261] Added request cmpl-462069bc48994335b4ea99865c8c9f06-0.
INFO 03-02 00:48:16 [logger.py:42] Received request cmpl-489c10218ebf4030a528a84d4b7fad5d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:48:16 [async_llm.py:261] Added request cmpl-489c10218ebf4030a528a84d4b7fad5d-0.
INFO 03-02 00:48:17 [logger.py:42] Received request cmpl-a2d9491bab16418b90d96afb92b7493f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:48:17 [async_llm.py:261] Added request cmpl-a2d9491bab16418b90d96afb92b7493f-0.
INFO 03-02 00:48:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:48:19 [logger.py:42] Received request cmpl-49fbedb5ebe34b85ae63fe97508582d3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:48:19 [async_llm.py:261] Added request cmpl-49fbedb5ebe34b85ae63fe97508582d3-0.
INFO 03-02 00:48:20 [logger.py:42] Received request cmpl-0b15a3ce55a848f6acb178559ca3be26-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:48:20 [async_llm.py:261] Added request cmpl-0b15a3ce55a848f6acb178559ca3be26-0.
INFO 03-02 00:48:21 [logger.py:42] Received request cmpl-2a749c30ade342498af441d8d3631985-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:48:21 [async_llm.py:261] Added request cmpl-2a749c30ade342498af441d8d3631985-0.
INFO 03-02 00:48:22 [logger.py:42] Received request cmpl-06f4edf141d140e5bf981dc565da0ac7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:48:22 [async_llm.py:261] Added request cmpl-06f4edf141d140e5bf981dc565da0ac7-0.
INFO 03-02 00:48:23 [logger.py:42] Received request cmpl-15ff094be7d44a5c9a981be4d0184f42-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:48:23 [async_llm.py:261] Added request cmpl-15ff094be7d44a5c9a981be4d0184f42-0.
INFO 03-02 00:48:24 [logger.py:42] Received request cmpl-6859132b123d4533b1ea53a248daa3f8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:48:24 [async_llm.py:261] Added request cmpl-6859132b123d4533b1ea53a248daa3f8-0.
INFO 03-02 00:48:25 [logger.py:42] Received request cmpl-4c21d22fa0974ac48e6e4e3aff959710-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:48:25 [async_llm.py:261] Added request cmpl-4c21d22fa0974ac48e6e4e3aff959710-0.
INFO 03-02 00:48:27 [logger.py:42] Received request cmpl-c52aff1c02cf4a6fb315c9d6d801b0de-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:48:27 [async_llm.py:261] Added request cmpl-c52aff1c02cf4a6fb315c9d6d801b0de-0.
INFO 03-02 00:48:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:48:28 [logger.py:42] Received request cmpl-cbec298f899b433cbc11adaa67e2b174-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:48:28 [async_llm.py:261] Added request cmpl-cbec298f899b433cbc11adaa67e2b174-0.
INFO 03-02 00:48:29 [logger.py:42] Received request cmpl-242b32eda94c43bca5cd23892b2200b5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:48:29 [async_llm.py:261] Added request cmpl-242b32eda94c43bca5cd23892b2200b5-0.
INFO 03-02 00:48:30 [logger.py:42] Received request cmpl-024e867656584d6db24ab7abac886fa9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:48:30 [async_llm.py:261] Added request cmpl-024e867656584d6db24ab7abac886fa9-0.
INFO 03-02 00:48:31 [logger.py:42] Received request cmpl-1598550e29b04e3684ec37b17cb98334-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:48:31 [async_llm.py:261] Added request cmpl-1598550e29b04e3684ec37b17cb98334-0.
INFO 03-02 00:48:32 [logger.py:42] Received request cmpl-99dfc4e8bdc34032916d03a5fe6346f3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:48:32 [async_llm.py:261] Added request cmpl-99dfc4e8bdc34032916d03a5fe6346f3-0.
INFO 03-02 00:48:34 [logger.py:42] Received request cmpl-ac74122b23b7400e8eeb3e0ec4a31160-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:48:34 [async_llm.py:261] Added request cmpl-ac74122b23b7400e8eeb3e0ec4a31160-0.
INFO 03-02 00:48:35 [logger.py:42] Received request cmpl-f9f6a4563e3f46e3adf894c9d32d5ba7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:48:35 [async_llm.py:261] Added request cmpl-f9f6a4563e3f46e3adf894c9d32d5ba7-0.
INFO 03-02 00:48:36 [logger.py:42] Received request cmpl-495315aac1e54010b2139727bfb389b5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:48:36 [async_llm.py:261] Added request cmpl-495315aac1e54010b2139727bfb389b5-0.
INFO 03-02 00:48:37 [logger.py:42] Received request cmpl-9cf1dafa5fb74c73b8534cfdae2b46e1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:48:37 [async_llm.py:261] Added request cmpl-9cf1dafa5fb74c73b8534cfdae2b46e1-0.
INFO 03-02 00:48:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:48:38 [logger.py:42] Received request cmpl-d494015baf524731bc819a087fd381c4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:48:38 [async_llm.py:261] Added request cmpl-d494015baf524731bc819a087fd381c4-0.
INFO 03-02 00:48:39 [logger.py:42] Received request cmpl-253ad17174c64ce497cb54d14fe6cb1b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:48:39 [async_llm.py:261] Added request cmpl-253ad17174c64ce497cb54d14fe6cb1b-0.
INFO 03-02 00:48:40 [logger.py:42] Received request cmpl-732d12421a394ccba2a10ec3939a98dd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:48:40 [async_llm.py:261] Added request cmpl-732d12421a394ccba2a10ec3939a98dd-0.
INFO 03-02 00:48:42 [logger.py:42] Received request cmpl-a303f2fd8c2642cd829ea57d5ee53fd2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:48:42 [async_llm.py:261] Added request cmpl-a303f2fd8c2642cd829ea57d5ee53fd2-0.
INFO 03-02 00:48:43 [logger.py:42] Received request cmpl-784631b6651841f1b3c0b6ca275dcb39-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:48:43 [async_llm.py:261] Added request cmpl-784631b6651841f1b3c0b6ca275dcb39-0.
INFO 03-02 00:48:44 [logger.py:42] Received request cmpl-d6b716440be5483bbe082e812f70f70a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:48:44 [async_llm.py:261] Added request cmpl-d6b716440be5483bbe082e812f70f70a-0.
INFO 03-02 00:48:45 [logger.py:42] Received request cmpl-1005ac4a197442f4b8a34cce914b8cdf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:48:45 [async_llm.py:261] Added request cmpl-1005ac4a197442f4b8a34cce914b8cdf-0.
INFO 03-02 00:48:46 [logger.py:42] Received request cmpl-b02fcf10ff1e46f69aef2b0c54438e01-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:48:46 [async_llm.py:261] Added request cmpl-b02fcf10ff1e46f69aef2b0c54438e01-0.
INFO 03-02 00:48:47 [logger.py:42] Received request cmpl-39953832422a4a36918551e62994ea3e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:48:47 [async_llm.py:261] Added request cmpl-39953832422a4a36918551e62994ea3e-0.
INFO 03-02 00:48:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:48:49 [logger.py:42] Received request cmpl-2bc0728501a2472ba8a5e6c46bca0ad7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:48:49 [async_llm.py:261] Added request cmpl-2bc0728501a2472ba8a5e6c46bca0ad7-0.
INFO 03-02 00:48:50 [logger.py:42] Received request cmpl-eda2ddc0125b4a9bb359ca222625fbd7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:48:50 [async_llm.py:261] Added request cmpl-eda2ddc0125b4a9bb359ca222625fbd7-0.
INFO 03-02 00:48:51 [logger.py:42] Received request cmpl-66455bb2bfd146d99c812ae20b38a483-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:48:51 [async_llm.py:261] Added request cmpl-66455bb2bfd146d99c812ae20b38a483-0.
INFO 03-02 00:48:52 [logger.py:42] Received request cmpl-8057a800050f4225b0340197c5da4582-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:48:52 [async_llm.py:261] Added request cmpl-8057a800050f4225b0340197c5da4582-0.
INFO 03-02 00:48:53 [logger.py:42] Received request cmpl-182c74ee1eeb4e0d8e9b1862ae2198f6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:48:53 [async_llm.py:261] Added request cmpl-182c74ee1eeb4e0d8e9b1862ae2198f6-0.
INFO 03-02 00:48:54 [logger.py:42] Received request cmpl-d6fdcc33721a46c5b8c5ad3f21cc3aff-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:48:54 [async_llm.py:261] Added request cmpl-d6fdcc33721a46c5b8c5ad3f21cc3aff-0.
INFO 03-02 00:48:55 [logger.py:42] Received request cmpl-4c3fcc4a92d24a49862c22fea8f69c61-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:48:55 [async_llm.py:261] Added request cmpl-4c3fcc4a92d24a49862c22fea8f69c61-0.
INFO 03-02 00:48:57 [logger.py:42] Received request cmpl-9819f1ec103d494aa0d72d2f27cf69e3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:48:57 [async_llm.py:261] Added request cmpl-9819f1ec103d494aa0d72d2f27cf69e3-0.
INFO 03-02 00:48:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:48:58 [logger.py:42] Received request cmpl-52589d6c7c45465f925e1ac90904eafb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:48:58 [async_llm.py:261] Added request cmpl-52589d6c7c45465f925e1ac90904eafb-0.
INFO 03-02 00:48:59 [logger.py:42] Received request cmpl-755352bb6e1d4caa9f7fe27acf3f314e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:48:59 [async_llm.py:261] Added request cmpl-755352bb6e1d4caa9f7fe27acf3f314e-0.
INFO 03-02 00:49:00 [logger.py:42] Received request cmpl-4507d283595a42a69b772e9c353c9157-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:49:00 [async_llm.py:261] Added request cmpl-4507d283595a42a69b772e9c353c9157-0.
INFO 03-02 00:49:01 [logger.py:42] Received request cmpl-8646dc56f557432f99de81b1e8cc18e5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:49:01 [async_llm.py:261] Added request cmpl-8646dc56f557432f99de81b1e8cc18e5-0.
INFO 03-02 00:49:02 [logger.py:42] Received request cmpl-0d486247d7f94cde9979d63e1857d254-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:49:02 [async_llm.py:261] Added request cmpl-0d486247d7f94cde9979d63e1857d254-0.
INFO 03-02 00:49:04 [logger.py:42] Received request cmpl-c693d9c491ac44e28d8aa6ceed09a00c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:49:04 [async_llm.py:261] Added request cmpl-c693d9c491ac44e28d8aa6ceed09a00c-0.
INFO 03-02 00:49:05 [logger.py:42] Received request cmpl-4148fb6aba76436daeb02373c9ece698-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:49:05 [async_llm.py:261] Added request cmpl-4148fb6aba76436daeb02373c9ece698-0.
INFO 03-02 00:49:06 [logger.py:42] Received request cmpl-86177555c5c740fc95f96343cb263504-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:49:06 [async_llm.py:261] Added request cmpl-86177555c5c740fc95f96343cb263504-0.
INFO 03-02 00:49:07 [logger.py:42] Received request cmpl-1e7e5b18cf4e417292e1942a538dd463-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:49:07 [async_llm.py:261] Added request cmpl-1e7e5b18cf4e417292e1942a538dd463-0.
INFO 03-02 00:49:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:49:08 [logger.py:42] Received request cmpl-73356f535abb4da290455939be6baeff-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:49:08 [async_llm.py:261] Added request cmpl-73356f535abb4da290455939be6baeff-0.
INFO 03-02 00:49:09 [logger.py:42] Received request cmpl-b0794e83a23546e5884eb5966b7bb4e8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:49:09 [async_llm.py:261] Added request cmpl-b0794e83a23546e5884eb5966b7bb4e8-0.
INFO 03-02 00:49:10 [logger.py:42] Received request cmpl-b75594c8043e4198b826018fae5091c3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:49:10 [async_llm.py:261] Added request cmpl-b75594c8043e4198b826018fae5091c3-0.
INFO 03-02 00:49:12 [logger.py:42] Received request cmpl-4569c508cf4c470eaa379ee566553142-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:49:12 [async_llm.py:261] Added request cmpl-4569c508cf4c470eaa379ee566553142-0.
INFO 03-02 00:49:13 [logger.py:42] Received request cmpl-ae4ca5f40b484ea99b733e104f742ecf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:49:13 [async_llm.py:261] Added request cmpl-ae4ca5f40b484ea99b733e104f742ecf-0.
INFO 03-02 00:49:14 [logger.py:42] Received request cmpl-8a466918fae842b68dc347c4a238a753-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:49:14 [async_llm.py:261] Added request cmpl-8a466918fae842b68dc347c4a238a753-0.
INFO 03-02 00:49:15 [logger.py:42] Received request cmpl-d8318e5e8df64c90971b722af126aa3c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:49:15 [async_llm.py:261] Added request cmpl-d8318e5e8df64c90971b722af126aa3c-0.
INFO 03-02 00:49:16 [logger.py:42] Received request cmpl-51e35749135446d996e6e8aeff5d9cf6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:49:16 [async_llm.py:261] Added request cmpl-51e35749135446d996e6e8aeff5d9cf6-0.
INFO 03-02 00:49:17 [logger.py:42] Received request cmpl-8f66501800e549dc85260bc5cc35ca14-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:49:17 [async_llm.py:261] Added request cmpl-8f66501800e549dc85260bc5cc35ca14-0.
INFO 03-02 00:49:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:49:19 [logger.py:42] Received request cmpl-d84d3b2a89064d53bfed8ed02beefe3c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:49:19 [async_llm.py:261] Added request cmpl-d84d3b2a89064d53bfed8ed02beefe3c-0.
INFO 03-02 00:49:20 [logger.py:42] Received request cmpl-350b3adaae3a4dd48097c3d7baca1ab8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:49:20 [async_llm.py:261] Added request cmpl-350b3adaae3a4dd48097c3d7baca1ab8-0.
INFO 03-02 00:49:21 [logger.py:42] Received request cmpl-2545025c5ad347babd38743266dd53fa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:49:21 [async_llm.py:261] Added request cmpl-2545025c5ad347babd38743266dd53fa-0.
INFO 03-02 00:49:22 [logger.py:42] Received request cmpl-3b60938b84c24e6787280a11343c2790-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:49:22 [async_llm.py:261] Added request cmpl-3b60938b84c24e6787280a11343c2790-0.
INFO 03-02 00:49:23 [logger.py:42] Received request cmpl-a458d2c20b3949cfa342dbe1b314bd2d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:49:23 [async_llm.py:261] Added request cmpl-a458d2c20b3949cfa342dbe1b314bd2d-0.
INFO 03-02 00:49:24 [logger.py:42] Received request cmpl-66044c693d06407d9bcfadaf52384308-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:49:24 [async_llm.py:261] Added request cmpl-66044c693d06407d9bcfadaf52384308-0.
INFO 03-02 00:49:25 [logger.py:42] Received request cmpl-3903bbdab92a4889b3bde42300ec2273-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:49:25 [async_llm.py:261] Added request cmpl-3903bbdab92a4889b3bde42300ec2273-0.
INFO 03-02 00:49:27 [logger.py:42] Received request cmpl-1ac8db3fe64c4db395e5e9bf77eb931e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:49:27 [async_llm.py:261] Added request cmpl-1ac8db3fe64c4db395e5e9bf77eb931e-0.
INFO 03-02 00:49:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:49:28 [logger.py:42] Received request cmpl-2def8405e31c4c06a5ade5c63c5cf6ec-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:49:28 [async_llm.py:261] Added request cmpl-2def8405e31c4c06a5ade5c63c5cf6ec-0.
INFO 03-02 00:49:29 [logger.py:42] Received request cmpl-d3ca8d276dde4f8eb3045df6d1b052a2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:49:29 [async_llm.py:261] Added request cmpl-d3ca8d276dde4f8eb3045df6d1b052a2-0.
INFO 03-02 00:49:30 [logger.py:42] Received request cmpl-e098a2c11ce44ee08c48905563604091-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:49:30 [async_llm.py:261] Added request cmpl-e098a2c11ce44ee08c48905563604091-0.
INFO 03-02 00:49:31 [logger.py:42] Received request cmpl-d96e605ec1ce4a38b6c0fc4c7a29a1c1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:49:31 [async_llm.py:261] Added request cmpl-d96e605ec1ce4a38b6c0fc4c7a29a1c1-0.
INFO 03-02 00:49:32 [logger.py:42] Received request cmpl-9d8aac03382644adbabd41ba72d19f9b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:49:32 [async_llm.py:261] Added request cmpl-9d8aac03382644adbabd41ba72d19f9b-0.
INFO 03-02 00:49:34 [logger.py:42] Received request cmpl-bcd11f4a84c446cfa1879aaaf3649ddf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:49:34 [async_llm.py:261] Added request cmpl-bcd11f4a84c446cfa1879aaaf3649ddf-0.
INFO 03-02 00:49:35 [logger.py:42] Received request cmpl-2979682ad997468cac38f7b4cb0226f0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:49:35 [async_llm.py:261] Added request cmpl-2979682ad997468cac38f7b4cb0226f0-0.
INFO 03-02 00:49:36 [logger.py:42] Received request cmpl-c877442162624c8ba4bc0bfee2db3a2f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:49:36 [async_llm.py:261] Added request cmpl-c877442162624c8ba4bc0bfee2db3a2f-0.
INFO 03-02 00:49:37 [logger.py:42] Received request cmpl-79490aff5ecb4a73aec8090551c26911-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:49:37 [async_llm.py:261] Added request cmpl-79490aff5ecb4a73aec8090551c26911-0.
INFO 03-02 00:49:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:49:38 [logger.py:42] Received request cmpl-c06346ed15c3488984a17e35342aa276-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:49:38 [async_llm.py:261] Added request cmpl-c06346ed15c3488984a17e35342aa276-0.
INFO 03-02 00:49:39 [logger.py:42] Received request cmpl-e238cd6bac184288b00567332b35b6b8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:49:39 [async_llm.py:261] Added request cmpl-e238cd6bac184288b00567332b35b6b8-0.
INFO 03-02 00:49:40 [logger.py:42] Received request cmpl-d78e29c22e644c088a8321c626f15b02-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:49:40 [async_llm.py:261] Added request cmpl-d78e29c22e644c088a8321c626f15b02-0.
INFO 03-02 00:49:42 [logger.py:42] Received request cmpl-71006cc287ec4b8790d73bc2e15fa916-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:49:42 [async_llm.py:261] Added request cmpl-71006cc287ec4b8790d73bc2e15fa916-0.
INFO 03-02 00:49:43 [logger.py:42] Received request cmpl-2110e252f97246f3972c9c1a20dc5bda-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:49:43 [async_llm.py:261] Added request cmpl-2110e252f97246f3972c9c1a20dc5bda-0.
INFO 03-02 00:49:44 [logger.py:42] Received request cmpl-5ec5c9ac5ecc42f28fbad0b0f468325c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:49:44 [async_llm.py:261] Added request cmpl-5ec5c9ac5ecc42f28fbad0b0f468325c-0.
INFO 03-02 00:49:45 [logger.py:42] Received request cmpl-af4c2d1b57914ef2afb0e5648f0bcd14-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:49:45 [async_llm.py:261] Added request cmpl-af4c2d1b57914ef2afb0e5648f0bcd14-0.
INFO 03-02 00:49:46 [logger.py:42] Received request cmpl-d44f0fa612d14a88aa5e4c9dd18dccff-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:49:46 [async_llm.py:261] Added request cmpl-d44f0fa612d14a88aa5e4c9dd18dccff-0.
INFO 03-02 00:49:47 [logger.py:42] Received request cmpl-43fbfc20fc91436fb861a2fcc53f85c9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:49:47 [async_llm.py:261] Added request cmpl-43fbfc20fc91436fb861a2fcc53f85c9-0.
INFO 03-02 00:49:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:49:49 [logger.py:42] Received request cmpl-cd27640debfd40c0984b650900191264-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:49:49 [async_llm.py:261] Added request cmpl-cd27640debfd40c0984b650900191264-0.
INFO 03-02 00:49:50 [logger.py:42] Received request cmpl-69d3bc1cefd84ca5a7afe28e3966b9e3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:49:50 [async_llm.py:261] Added request cmpl-69d3bc1cefd84ca5a7afe28e3966b9e3-0.
INFO 03-02 00:49:51 [logger.py:42] Received request cmpl-d71675d1d4af41f091a79d5a23b95f63-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:49:51 [async_llm.py:261] Added request cmpl-d71675d1d4af41f091a79d5a23b95f63-0.
INFO 03-02 00:49:52 [logger.py:42] Received request cmpl-76f4dab772bd4b01abf5b5d929122cce-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:49:52 [async_llm.py:261] Added request cmpl-76f4dab772bd4b01abf5b5d929122cce-0.
INFO 03-02 00:49:53 [logger.py:42] Received request cmpl-2ca52668368f42c7a222250b44b9ec75-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:49:53 [async_llm.py:261] Added request cmpl-2ca52668368f42c7a222250b44b9ec75-0.
INFO 03-02 00:49:54 [logger.py:42] Received request cmpl-4d20907fe006435cafc3f06162e832c3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:49:54 [async_llm.py:261] Added request cmpl-4d20907fe006435cafc3f06162e832c3-0.
INFO 03-02 00:49:55 [logger.py:42] Received request cmpl-2823bcabf41d4cbc939486d7ff7fa5fc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:49:55 [async_llm.py:261] Added request cmpl-2823bcabf41d4cbc939486d7ff7fa5fc-0.
INFO 03-02 00:49:57 [logger.py:42] Received request cmpl-2390d5801f2647a8899d8ce25b259309-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:49:57 [async_llm.py:261] Added request cmpl-2390d5801f2647a8899d8ce25b259309-0.
INFO 03-02 00:49:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:49:58 [logger.py:42] Received request cmpl-de6cfc2eba004cb485fbb0ad42694940-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:49:58 [async_llm.py:261] Added request cmpl-de6cfc2eba004cb485fbb0ad42694940-0.
INFO 03-02 00:49:59 [logger.py:42] Received request cmpl-a0ca7308fc6b4a9084b382f2a427bcfa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:49:59 [async_llm.py:261] Added request cmpl-a0ca7308fc6b4a9084b382f2a427bcfa-0.
INFO 03-02 00:50:00 [logger.py:42] Received request cmpl-eaa78138fe6f4ae199ea73e6e1bdd6f5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:50:00 [async_llm.py:261] Added request cmpl-eaa78138fe6f4ae199ea73e6e1bdd6f5-0.
INFO 03-02 00:50:01 [logger.py:42] Received request cmpl-4fa73c64931e4c50981731ee3d640f07-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:50:01 [async_llm.py:261] Added request cmpl-4fa73c64931e4c50981731ee3d640f07-0.
INFO 03-02 00:50:02 [logger.py:42] Received request cmpl-c8e39d968579473ba6e1f9a72b4924a1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:50:02 [async_llm.py:261] Added request cmpl-c8e39d968579473ba6e1f9a72b4924a1-0.
INFO 03-02 00:50:03 [logger.py:42] Received request cmpl-8fde2dd463f64d47b11fe4e9330ec539-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:50:03 [async_llm.py:261] Added request cmpl-8fde2dd463f64d47b11fe4e9330ec539-0.
INFO 03-02 00:50:05 [logger.py:42] Received request cmpl-4cc1e8e8c3de4fe5a648c1c5218b7dc6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:50:05 [async_llm.py:261] Added request cmpl-4cc1e8e8c3de4fe5a648c1c5218b7dc6-0.
INFO 03-02 00:50:06 [logger.py:42] Received request cmpl-a0e91e9af90143f797429ffd16993986-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:50:06 [async_llm.py:261] Added request cmpl-a0e91e9af90143f797429ffd16993986-0.
INFO 03-02 00:50:07 [logger.py:42] Received request cmpl-a02768b070434948b021d3bdc5418c35-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:50:07 [async_llm.py:261] Added request cmpl-a02768b070434948b021d3bdc5418c35-0.
INFO 03-02 00:50:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:50:08 [logger.py:42] Received request cmpl-dc3e29a65149426fa81e15c8b27af078-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:50:08 [async_llm.py:261] Added request cmpl-dc3e29a65149426fa81e15c8b27af078-0.
INFO 03-02 00:50:09 [logger.py:42] Received request cmpl-bcfbc5bb0fb54f3db68db076e4c7b354-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:50:09 [async_llm.py:261] Added request cmpl-bcfbc5bb0fb54f3db68db076e4c7b354-0.
INFO 03-02 00:50:10 [logger.py:42] Received request cmpl-70e69af24e07435ab261e833e8e0d397-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:50:10 [async_llm.py:261] Added request cmpl-70e69af24e07435ab261e833e8e0d397-0.
INFO 03-02 00:50:12 [logger.py:42] Received request cmpl-c2bc1e85ee9e4b86b7dc0873b19477e3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:50:12 [async_llm.py:261] Added request cmpl-c2bc1e85ee9e4b86b7dc0873b19477e3-0.
INFO 03-02 00:50:13 [logger.py:42] Received request cmpl-892ab3f0240d4f9189c8c855d9a5a56a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:50:13 [async_llm.py:261] Added request cmpl-892ab3f0240d4f9189c8c855d9a5a56a-0.
INFO 03-02 00:50:14 [logger.py:42] Received request cmpl-67f717356f6449cba30eff84c98ec420-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:50:14 [async_llm.py:261] Added request cmpl-67f717356f6449cba30eff84c98ec420-0.
INFO 03-02 00:50:15 [logger.py:42] Received request cmpl-fd6a2b3183764d4e856e01e0d8fd68d0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:50:15 [async_llm.py:261] Added request cmpl-fd6a2b3183764d4e856e01e0d8fd68d0-0.
INFO 03-02 00:50:16 [logger.py:42] Received request cmpl-751b4447a6d246bb94cc1e6493a6bb5a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:50:16 [async_llm.py:261] Added request cmpl-751b4447a6d246bb94cc1e6493a6bb5a-0.
INFO 03-02 00:50:17 [logger.py:42] Received request cmpl-3ebc60276b44463994890cb404a9371c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:50:17 [async_llm.py:261] Added request cmpl-3ebc60276b44463994890cb404a9371c-0.
INFO 03-02 00:50:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:50:18 [logger.py:42] Received request cmpl-1ed80aff774144de80c202506a44474c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:50:18 [async_llm.py:261] Added request cmpl-1ed80aff774144de80c202506a44474c-0.
INFO 03-02 00:50:20 [logger.py:42] Received request cmpl-95fa41b14088442586221939ccf2a949-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:50:20 [async_llm.py:261] Added request cmpl-95fa41b14088442586221939ccf2a949-0.
INFO 03-02 00:50:21 [logger.py:42] Received request cmpl-b05707926c8e407380881a1456bcfc3a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:50:21 [async_llm.py:261] Added request cmpl-b05707926c8e407380881a1456bcfc3a-0.
INFO 03-02 00:50:22 [logger.py:42] Received request cmpl-21ad5a4c4b91480fba86adafc552fdeb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:50:22 [async_llm.py:261] Added request cmpl-21ad5a4c4b91480fba86adafc552fdeb-0.
INFO 03-02 00:50:23 [logger.py:42] Received request cmpl-247d7d0b1d664a3abab8a42610a1ecfb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:50:23 [async_llm.py:261] Added request cmpl-247d7d0b1d664a3abab8a42610a1ecfb-0.
INFO 03-02 00:50:24 [logger.py:42] Received request cmpl-d2d1df9303524b39b353859504254cff-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:50:24 [async_llm.py:261] Added request cmpl-d2d1df9303524b39b353859504254cff-0.
INFO 03-02 00:50:25 [logger.py:42] Received request cmpl-bd6ac7eba5974eb38aacae27e56b3905-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:50:25 [async_llm.py:261] Added request cmpl-bd6ac7eba5974eb38aacae27e56b3905-0.
INFO 03-02 00:50:27 [logger.py:42] Received request cmpl-211d6f62a12f4b4aa834b8199f94a1b5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:50:27 [async_llm.py:261] Added request cmpl-211d6f62a12f4b4aa834b8199f94a1b5-0.
INFO 03-02 00:50:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:50:28 [logger.py:42] Received request cmpl-b4247dbde49b4f6789ac5b1d469f9442-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:50:28 [async_llm.py:261] Added request cmpl-b4247dbde49b4f6789ac5b1d469f9442-0.
INFO 03-02 00:50:29 [logger.py:42] Received request cmpl-2295d428144e465db8bd58768f0f4132-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:50:29 [async_llm.py:261] Added request cmpl-2295d428144e465db8bd58768f0f4132-0.
INFO 03-02 00:50:30 [logger.py:42] Received request cmpl-82a34bc57d1d4ff8a5ac9b4671374046-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:50:30 [async_llm.py:261] Added request cmpl-82a34bc57d1d4ff8a5ac9b4671374046-0.
INFO 03-02 00:50:31 [logger.py:42] Received request cmpl-02bf3b447b1b4d14bb8104d8f5131505-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:50:31 [async_llm.py:261] Added request cmpl-02bf3b447b1b4d14bb8104d8f5131505-0.
INFO 03-02 00:50:32 [logger.py:42] Received request cmpl-dca233004f2d499c8a741deda9510a8e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:50:32 [async_llm.py:261] Added request cmpl-dca233004f2d499c8a741deda9510a8e-0.
INFO 03-02 00:50:33 [logger.py:42] Received request cmpl-0c0198fa199d4e45b65169772f1b735d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:50:33 [async_llm.py:261] Added request cmpl-0c0198fa199d4e45b65169772f1b735d-0.
INFO 03-02 00:50:35 [logger.py:42] Received request cmpl-39b35f77d4784582ad84270e733ef853-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:50:35 [async_llm.py:261] Added request cmpl-39b35f77d4784582ad84270e733ef853-0.
INFO 03-02 00:50:36 [logger.py:42] Received request cmpl-ccb1bfd14e124caaaefb52d2d18451e2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:50:36 [async_llm.py:261] Added request cmpl-ccb1bfd14e124caaaefb52d2d18451e2-0.
INFO 03-02 00:50:37 [logger.py:42] Received request cmpl-1f812da5df3b48ffa0936915c6cf5900-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:50:37 [async_llm.py:261] Added request cmpl-1f812da5df3b48ffa0936915c6cf5900-0.
INFO 03-02 00:50:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:50:38 [logger.py:42] Received request cmpl-f8a66ce01b7245319c1186d89b4235a6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:50:38 [async_llm.py:261] Added request cmpl-f8a66ce01b7245319c1186d89b4235a6-0.
INFO 03-02 00:50:39 [logger.py:42] Received request cmpl-5735f1fd2cb446a79a9abb16533f32ec-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:50:39 [async_llm.py:261] Added request cmpl-5735f1fd2cb446a79a9abb16533f32ec-0.
INFO 03-02 00:50:40 [logger.py:42] Received request cmpl-b3f37b0efd194856b3b7a1d5d142081d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:50:40 [async_llm.py:261] Added request cmpl-b3f37b0efd194856b3b7a1d5d142081d-0.
INFO 03-02 00:50:42 [logger.py:42] Received request cmpl-a4d1c57e39ba4b14b82aeeccc2a2e0f1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:50:42 [async_llm.py:261] Added request cmpl-a4d1c57e39ba4b14b82aeeccc2a2e0f1-0.
INFO 03-02 00:50:43 [logger.py:42] Received request cmpl-8ad966c8f12046dcab72acf0cc717aa2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:50:43 [async_llm.py:261] Added request cmpl-8ad966c8f12046dcab72acf0cc717aa2-0.
INFO 03-02 00:50:44 [logger.py:42] Received request cmpl-3edbb13c5c594f5a80787259dec29084-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:50:44 [async_llm.py:261] Added request cmpl-3edbb13c5c594f5a80787259dec29084-0.
INFO 03-02 00:50:45 [logger.py:42] Received request cmpl-91eb163c700a4ac1a4745a1fe4a4d44f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:50:45 [async_llm.py:261] Added request cmpl-91eb163c700a4ac1a4745a1fe4a4d44f-0.
INFO 03-02 00:50:46 [logger.py:42] Received request cmpl-5ffb78c4fa274804a40ee04d575d0d07-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:50:46 [async_llm.py:261] Added request cmpl-5ffb78c4fa274804a40ee04d575d0d07-0.
INFO 03-02 00:50:47 [logger.py:42] Received request cmpl-2f3b6622611e4a04b3bc53534ae4a8d1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:50:47 [async_llm.py:261] Added request cmpl-2f3b6622611e4a04b3bc53534ae4a8d1-0.
INFO 03-02 00:50:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:50:48 [logger.py:42] Received request cmpl-be601a4385264ed195948b53bbdd97b8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:50:48 [async_llm.py:261] Added request cmpl-be601a4385264ed195948b53bbdd97b8-0.
INFO 03-02 00:50:50 [logger.py:42] Received request cmpl-93d72f778d70420097bd6f31440f9cd3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:50:50 [async_llm.py:261] Added request cmpl-93d72f778d70420097bd6f31440f9cd3-0.
INFO 03-02 00:50:51 [logger.py:42] Received request cmpl-26b88a77e4d342ac85b3a3575df2a196-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:50:51 [async_llm.py:261] Added request cmpl-26b88a77e4d342ac85b3a3575df2a196-0.
INFO 03-02 00:50:52 [logger.py:42] Received request cmpl-2740ee7a74b24240a55f18782c224072-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:50:52 [async_llm.py:261] Added request cmpl-2740ee7a74b24240a55f18782c224072-0.
INFO 03-02 00:50:53 [logger.py:42] Received request cmpl-a56c6b83316d4ae2b253832374f8ff23-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:50:53 [async_llm.py:261] Added request cmpl-a56c6b83316d4ae2b253832374f8ff23-0.
INFO 03-02 00:50:54 [logger.py:42] Received request cmpl-a53327347f8644b281f6cd68bce3b57a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:50:54 [async_llm.py:261] Added request cmpl-a53327347f8644b281f6cd68bce3b57a-0.
INFO 03-02 00:50:55 [logger.py:42] Received request cmpl-48233e495c594e5ba8a08a6a2c49815f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:50:55 [async_llm.py:261] Added request cmpl-48233e495c594e5ba8a08a6a2c49815f-0.
INFO 03-02 00:50:57 [logger.py:42] Received request cmpl-822905b52bd0488d9fa573d0812b4088-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:50:57 [async_llm.py:261] Added request cmpl-822905b52bd0488d9fa573d0812b4088-0.
INFO 03-02 00:50:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:50:58 [logger.py:42] Received request cmpl-0c88c8f4683548709353874bc8118b28-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:50:58 [async_llm.py:261] Added request cmpl-0c88c8f4683548709353874bc8118b28-0.
INFO 03-02 00:50:59 [logger.py:42] Received request cmpl-70075b2a222d47cea4303b50d3f44518-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:50:59 [async_llm.py:261] Added request cmpl-70075b2a222d47cea4303b50d3f44518-0.
INFO 03-02 00:51:00 [logger.py:42] Received request cmpl-6523a966bcd84412a0e9043435773891-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:51:00 [async_llm.py:261] Added request cmpl-6523a966bcd84412a0e9043435773891-0.
INFO 03-02 00:51:01 [logger.py:42] Received request cmpl-f4656752f0224fd9a3af8a737aca5ee5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:51:01 [async_llm.py:261] Added request cmpl-f4656752f0224fd9a3af8a737aca5ee5-0.
INFO 03-02 00:51:02 [logger.py:42] Received request cmpl-f6b1b58fa0dd47d788d885ec6b0c4cb6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:51:02 [async_llm.py:261] Added request cmpl-f6b1b58fa0dd47d788d885ec6b0c4cb6-0.
INFO 03-02 00:51:03 [logger.py:42] Received request cmpl-f21782286db64569b949d0388e41ffd8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:51:03 [async_llm.py:261] Added request cmpl-f21782286db64569b949d0388e41ffd8-0.
INFO 03-02 00:51:05 [logger.py:42] Received request cmpl-1df39d0e7ee94938befbf5c3d9e9e51b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:51:05 [async_llm.py:261] Added request cmpl-1df39d0e7ee94938befbf5c3d9e9e51b-0.
INFO 03-02 00:51:06 [logger.py:42] Received request cmpl-2b81850116f74ade963f9c196da71dc4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:51:06 [async_llm.py:261] Added request cmpl-2b81850116f74ade963f9c196da71dc4-0.
INFO 03-02 00:51:07 [logger.py:42] Received request cmpl-e6ddfde488c046b5844b6ebb97aac144-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:51:07 [async_llm.py:261] Added request cmpl-e6ddfde488c046b5844b6ebb97aac144-0.
INFO 03-02 00:51:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:51:08 [logger.py:42] Received request cmpl-97670dcae1d946fda32fc4caa864dac6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:51:08 [async_llm.py:261] Added request cmpl-97670dcae1d946fda32fc4caa864dac6-0.
INFO 03-02 00:51:09 [logger.py:42] Received request cmpl-fff63a24751a4e1d95bf8c7101eead26-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:51:09 [async_llm.py:261] Added request cmpl-fff63a24751a4e1d95bf8c7101eead26-0.
INFO 03-02 00:51:10 [logger.py:42] Received request cmpl-313c54ea754346b6b0853baaa22fec19-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:51:10 [async_llm.py:261] Added request cmpl-313c54ea754346b6b0853baaa22fec19-0.
INFO 03-02 00:51:12 [logger.py:42] Received request cmpl-380ccd5ae4dc4265b9105fde34ba8caa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:51:12 [async_llm.py:261] Added request cmpl-380ccd5ae4dc4265b9105fde34ba8caa-0.
INFO 03-02 00:51:13 [logger.py:42] Received request cmpl-6fe55dc426d248b88558916f1dc91603-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:51:13 [async_llm.py:261] Added request cmpl-6fe55dc426d248b88558916f1dc91603-0.
INFO 03-02 00:51:14 [logger.py:42] Received request cmpl-54d1dd5e82234f5da3b60f7a6c4bb6ea-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:51:14 [async_llm.py:261] Added request cmpl-54d1dd5e82234f5da3b60f7a6c4bb6ea-0.
INFO 03-02 00:51:15 [logger.py:42] Received request cmpl-fa2a4d6b687446b3942fdff2a427b592-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:51:15 [async_llm.py:261] Added request cmpl-fa2a4d6b687446b3942fdff2a427b592-0.
INFO 03-02 00:51:16 [logger.py:42] Received request cmpl-18294102028647cf94dc2b1963cfce1c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:51:16 [async_llm.py:261] Added request cmpl-18294102028647cf94dc2b1963cfce1c-0.
INFO 03-02 00:51:17 [logger.py:42] Received request cmpl-a8f965a447bd4d74b82c3ce5ac78ee5f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:51:17 [async_llm.py:261] Added request cmpl-a8f965a447bd4d74b82c3ce5ac78ee5f-0.
INFO 03-02 00:51:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:51:18 [logger.py:42] Received request cmpl-4dbe8c1e432f413d8d4e14bfdef749cf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:51:18 [async_llm.py:261] Added request cmpl-4dbe8c1e432f413d8d4e14bfdef749cf-0.
INFO 03-02 00:51:20 [logger.py:42] Received request cmpl-4069b8e0531b4730be5821f3ff5626c8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:51:20 [async_llm.py:261] Added request cmpl-4069b8e0531b4730be5821f3ff5626c8-0.
INFO 03-02 00:51:21 [logger.py:42] Received request cmpl-6ae0856e15de4869913d5339bffea041-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:51:21 [async_llm.py:261] Added request cmpl-6ae0856e15de4869913d5339bffea041-0.
INFO 03-02 00:51:22 [logger.py:42] Received request cmpl-090035d590574783817ca2fff67a1aee-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:51:22 [async_llm.py:261] Added request cmpl-090035d590574783817ca2fff67a1aee-0.
INFO 03-02 00:51:23 [logger.py:42] Received request cmpl-b4db8e1fff9244f4803b3978fa1edbaa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:51:23 [async_llm.py:261] Added request cmpl-b4db8e1fff9244f4803b3978fa1edbaa-0.
INFO 03-02 00:51:24 [logger.py:42] Received request cmpl-5f8f9ab47b7d40f3ab3313fb0b35d13d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:51:24 [async_llm.py:261] Added request cmpl-5f8f9ab47b7d40f3ab3313fb0b35d13d-0.
INFO 03-02 00:51:25 [logger.py:42] Received request cmpl-3ac4dc8abd9f4eefacca03e548f065ec-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:51:25 [async_llm.py:261] Added request cmpl-3ac4dc8abd9f4eefacca03e548f065ec-0.
INFO 03-02 00:51:27 [logger.py:42] Received request cmpl-5962fe8634134b96b6c1311f949f056e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:51:27 [async_llm.py:261] Added request cmpl-5962fe8634134b96b6c1311f949f056e-0.
INFO 03-02 00:51:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:51:28 [logger.py:42] Received request cmpl-22e09d9fa5b642adb76bbda5dd548035-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:51:28 [async_llm.py:261] Added request cmpl-22e09d9fa5b642adb76bbda5dd548035-0.
INFO 03-02 00:51:29 [logger.py:42] Received request cmpl-5614e9a9231a4e3eba173cda9702e6c9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:51:29 [async_llm.py:261] Added request cmpl-5614e9a9231a4e3eba173cda9702e6c9-0.
INFO 03-02 00:51:30 [logger.py:42] Received request cmpl-938d951a083d4f4e98ad221f54c3afac-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:51:30 [async_llm.py:261] Added request cmpl-938d951a083d4f4e98ad221f54c3afac-0.
INFO 03-02 00:51:31 [logger.py:42] Received request cmpl-b6cdd2ddd57c442eac947a0531532901-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:51:31 [async_llm.py:261] Added request cmpl-b6cdd2ddd57c442eac947a0531532901-0.
INFO 03-02 00:51:32 [logger.py:42] Received request cmpl-0f99e5e91340434aa1ae139130ccdab4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:51:32 [async_llm.py:261] Added request cmpl-0f99e5e91340434aa1ae139130ccdab4-0.
INFO 03-02 00:51:33 [logger.py:42] Received request cmpl-7949477270f64001938fc437ff660825-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:51:33 [async_llm.py:261] Added request cmpl-7949477270f64001938fc437ff660825-0.
INFO 03-02 00:51:35 [logger.py:42] Received request cmpl-83042060f07445e5b773b920fdf0ab5a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:51:35 [async_llm.py:261] Added request cmpl-83042060f07445e5b773b920fdf0ab5a-0.
INFO 03-02 00:51:36 [logger.py:42] Received request cmpl-b815a156f7514d5399be851d0cfdcb11-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:51:36 [async_llm.py:261] Added request cmpl-b815a156f7514d5399be851d0cfdcb11-0.
INFO 03-02 00:51:37 [logger.py:42] Received request cmpl-2918907c1fd74b4b945540c37c55ede4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:51:37 [async_llm.py:261] Added request cmpl-2918907c1fd74b4b945540c37c55ede4-0.
INFO 03-02 00:51:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:51:38 [logger.py:42] Received request cmpl-0ed3960acb3d45ffb50358d48dbff47c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:51:38 [async_llm.py:261] Added request cmpl-0ed3960acb3d45ffb50358d48dbff47c-0.
INFO 03-02 00:51:39 [logger.py:42] Received request cmpl-c1c8eaa68a334692bcf0493e6bc241b9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:51:39 [async_llm.py:261] Added request cmpl-c1c8eaa68a334692bcf0493e6bc241b9-0.
INFO 03-02 00:51:40 [logger.py:42] Received request cmpl-664dd36cffef419f88489833cb3e65e4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:51:40 [async_llm.py:261] Added request cmpl-664dd36cffef419f88489833cb3e65e4-0.
INFO 03-02 00:51:42 [logger.py:42] Received request cmpl-de2478b8c8b2417ebdb1a9e3d19992a8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:51:42 [async_llm.py:261] Added request cmpl-de2478b8c8b2417ebdb1a9e3d19992a8-0.
INFO 03-02 00:51:43 [logger.py:42] Received request cmpl-83918577acb84e4f93470173f353f97d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:51:43 [async_llm.py:261] Added request cmpl-83918577acb84e4f93470173f353f97d-0.
INFO 03-02 00:51:44 [logger.py:42] Received request cmpl-b70879b2170045ce8fa5b5a757c65d2b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:51:44 [async_llm.py:261] Added request cmpl-b70879b2170045ce8fa5b5a757c65d2b-0.
INFO 03-02 00:51:45 [logger.py:42] Received request cmpl-1034e207bc4e4c48b1da4b2b383b2936-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:51:45 [async_llm.py:261] Added request cmpl-1034e207bc4e4c48b1da4b2b383b2936-0.
INFO 03-02 00:51:46 [logger.py:42] Received request cmpl-70b680009e4445f599901f4660ba444e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:51:46 [async_llm.py:261] Added request cmpl-70b680009e4445f599901f4660ba444e-0.
INFO 03-02 00:51:47 [logger.py:42] Received request cmpl-10a169efb9d8423fbb98cb465f857883-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:51:47 [async_llm.py:261] Added request cmpl-10a169efb9d8423fbb98cb465f857883-0.
INFO 03-02 00:51:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:51:48 [logger.py:42] Received request cmpl-3f5500fae36043fe90392ba41484aee1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:51:48 [async_llm.py:261] Added request cmpl-3f5500fae36043fe90392ba41484aee1-0.
INFO 03-02 00:51:50 [logger.py:42] Received request cmpl-f6aced62286146598cb8f0e2923eb60a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:51:50 [async_llm.py:261] Added request cmpl-f6aced62286146598cb8f0e2923eb60a-0.
INFO 03-02 00:51:51 [logger.py:42] Received request cmpl-72801c85bef94c908acbeed83a33d9b5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:51:51 [async_llm.py:261] Added request cmpl-72801c85bef94c908acbeed83a33d9b5-0.
INFO 03-02 00:51:52 [logger.py:42] Received request cmpl-33ebfc302a194ce5a106e2b9654e5514-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:51:52 [async_llm.py:261] Added request cmpl-33ebfc302a194ce5a106e2b9654e5514-0.
INFO 03-02 00:51:53 [logger.py:42] Received request cmpl-f1985d3a54d1425d990361051c08157f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:51:53 [async_llm.py:261] Added request cmpl-f1985d3a54d1425d990361051c08157f-0.
INFO 03-02 00:51:54 [logger.py:42] Received request cmpl-16c515ae8b6e4dd4940b65f0ec04ceb3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:51:54 [async_llm.py:261] Added request cmpl-16c515ae8b6e4dd4940b65f0ec04ceb3-0.
INFO 03-02 00:51:55 [logger.py:42] Received request cmpl-e665845620ea43568089b51aff4940b4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:51:55 [async_llm.py:261] Added request cmpl-e665845620ea43568089b51aff4940b4-0.
INFO 03-02 00:51:57 [logger.py:42] Received request cmpl-7e3f9c81f18a4f3a8f5a6a187bbb598f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:51:57 [async_llm.py:261] Added request cmpl-7e3f9c81f18a4f3a8f5a6a187bbb598f-0.
INFO 03-02 00:51:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:51:58 [logger.py:42] Received request cmpl-17585951d7b141b282739139ac68cd74-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:51:58 [async_llm.py:261] Added request cmpl-17585951d7b141b282739139ac68cd74-0.
INFO 03-02 00:51:59 [logger.py:42] Received request cmpl-ee3afbef50bb4b01ba843751f9d8c5e8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:51:59 [async_llm.py:261] Added request cmpl-ee3afbef50bb4b01ba843751f9d8c5e8-0.
INFO 03-02 00:52:00 [logger.py:42] Received request cmpl-90275117a2bc44db875a9667f9948110-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:52:00 [async_llm.py:261] Added request cmpl-90275117a2bc44db875a9667f9948110-0.
INFO 03-02 00:52:01 [logger.py:42] Received request cmpl-d0a8b7e860594c16869b23e19bdd7be1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:52:01 [async_llm.py:261] Added request cmpl-d0a8b7e860594c16869b23e19bdd7be1-0.
INFO 03-02 00:52:02 [logger.py:42] Received request cmpl-3e9d3b9f47104a77b49a756cd8a2cab0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:52:02 [async_llm.py:261] Added request cmpl-3e9d3b9f47104a77b49a756cd8a2cab0-0.
INFO 03-02 00:52:03 [logger.py:42] Received request cmpl-20b212a60ccf4eb79138086502950baa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:52:03 [async_llm.py:261] Added request cmpl-20b212a60ccf4eb79138086502950baa-0.
INFO 03-02 00:52:05 [logger.py:42] Received request cmpl-2c7074e282b14f7ea9de4fd29fb4c41d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:52:05 [async_llm.py:261] Added request cmpl-2c7074e282b14f7ea9de4fd29fb4c41d-0.
INFO 03-02 00:52:06 [logger.py:42] Received request cmpl-e8793ade7dcb43c098502645f4b1746b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:52:06 [async_llm.py:261] Added request cmpl-e8793ade7dcb43c098502645f4b1746b-0.
INFO 03-02 00:52:07 [logger.py:42] Received request cmpl-6e310119cb004a6d948db12736f8e742-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:52:07 [async_llm.py:261] Added request cmpl-6e310119cb004a6d948db12736f8e742-0.
INFO 03-02 00:52:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:52:08 [logger.py:42] Received request cmpl-a418c896625a4a83b1557fe60d4c4a76-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:52:08 [async_llm.py:261] Added request cmpl-a418c896625a4a83b1557fe60d4c4a76-0.
INFO 03-02 00:52:09 [logger.py:42] Received request cmpl-67f33a248ff94c25becc34334eb0d3ab-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:52:09 [async_llm.py:261] Added request cmpl-67f33a248ff94c25becc34334eb0d3ab-0.
INFO 03-02 00:52:10 [logger.py:42] Received request cmpl-7b9854d3b4b649d6a8ac07ffe8566c75-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:52:10 [async_llm.py:261] Added request cmpl-7b9854d3b4b649d6a8ac07ffe8566c75-0.
INFO 03-02 00:52:11 [logger.py:42] Received request cmpl-076b1fdc80b5464a9e4c0703eac4627f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:52:12 [async_llm.py:261] Added request cmpl-076b1fdc80b5464a9e4c0703eac4627f-0.
INFO 03-02 00:52:13 [logger.py:42] Received request cmpl-62cc86ab046d4da6aba4582af27caef3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:52:13 [async_llm.py:261] Added request cmpl-62cc86ab046d4da6aba4582af27caef3-0.
INFO 03-02 00:52:14 [logger.py:42] Received request cmpl-f544f0eafa1641abba7c0b06e1db2230-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:52:14 [async_llm.py:261] Added request cmpl-f544f0eafa1641abba7c0b06e1db2230-0.
INFO 03-02 00:52:15 [logger.py:42] Received request cmpl-46490826b512479ea1c8524747bc670c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:52:15 [async_llm.py:261] Added request cmpl-46490826b512479ea1c8524747bc670c-0.
INFO 03-02 00:52:16 [logger.py:42] Received request cmpl-e610c973c3b14eab998ecf00da3d9987-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:52:16 [async_llm.py:261] Added request cmpl-e610c973c3b14eab998ecf00da3d9987-0.
INFO 03-02 00:52:17 [logger.py:42] Received request cmpl-9970ff1e9652490d8c130780ec97c7e7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:52:17 [async_llm.py:261] Added request cmpl-9970ff1e9652490d8c130780ec97c7e7-0.
INFO 03-02 00:52:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:52:18 [logger.py:42] Received request cmpl-89b81bdb9648495ca22ea2cc18c827af-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:52:18 [async_llm.py:261] Added request cmpl-89b81bdb9648495ca22ea2cc18c827af-0.
INFO 03-02 00:52:20 [logger.py:42] Received request cmpl-e34f671103ad44099381d5782eb3cd06-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:52:20 [async_llm.py:261] Added request cmpl-e34f671103ad44099381d5782eb3cd06-0.
INFO 03-02 00:52:21 [logger.py:42] Received request cmpl-f63f9196dc4744ffa7790d82e88cca1c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:52:21 [async_llm.py:261] Added request cmpl-f63f9196dc4744ffa7790d82e88cca1c-0.
INFO 03-02 00:52:22 [logger.py:42] Received request cmpl-8b8ddec45e52423fb8a23e932021392e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:52:22 [async_llm.py:261] Added request cmpl-8b8ddec45e52423fb8a23e932021392e-0.
INFO 03-02 00:52:23 [logger.py:42] Received request cmpl-a0785d4734e8422b855fc649004710e3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:52:23 [async_llm.py:261] Added request cmpl-a0785d4734e8422b855fc649004710e3-0.
INFO 03-02 00:52:24 [logger.py:42] Received request cmpl-b98dce461a5b4c67afc3d426116b0ab6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:52:24 [async_llm.py:261] Added request cmpl-b98dce461a5b4c67afc3d426116b0ab6-0.
INFO 03-02 00:52:25 [logger.py:42] Received request cmpl-d4c716e7dd9f4a7aae524473b27641c6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:52:25 [async_llm.py:261] Added request cmpl-d4c716e7dd9f4a7aae524473b27641c6-0.
INFO 03-02 00:52:26 [logger.py:42] Received request cmpl-c63f087d32b1465dbfff7c8c3c1450ec-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:52:26 [async_llm.py:261] Added request cmpl-c63f087d32b1465dbfff7c8c3c1450ec-0.
INFO 03-02 00:52:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:52:28 [logger.py:42] Received request cmpl-23ed5339e9884ff6be4ffc68735102f7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:52:28 [async_llm.py:261] Added request cmpl-23ed5339e9884ff6be4ffc68735102f7-0.
INFO 03-02 00:52:29 [logger.py:42] Received request cmpl-b05f315e7e2541fe8ae8f0280d2f6be9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:52:29 [async_llm.py:261] Added request cmpl-b05f315e7e2541fe8ae8f0280d2f6be9-0.
INFO 03-02 00:52:30 [logger.py:42] Received request cmpl-0e66f0043a9e487380c362c1691d5dad-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:52:30 [async_llm.py:261] Added request cmpl-0e66f0043a9e487380c362c1691d5dad-0.
INFO 03-02 00:52:31 [logger.py:42] Received request cmpl-c85a8691342a4ef1a68025ff9520b47b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:52:31 [async_llm.py:261] Added request cmpl-c85a8691342a4ef1a68025ff9520b47b-0.
INFO 03-02 00:52:32 [logger.py:42] Received request cmpl-b3dc2ca9b01b4570b435875aa54aabba-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:52:32 [async_llm.py:261] Added request cmpl-b3dc2ca9b01b4570b435875aa54aabba-0.
INFO 03-02 00:52:33 [logger.py:42] Received request cmpl-e95b208bdb3049e0915ab1042594efce-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:52:33 [async_llm.py:261] Added request cmpl-e95b208bdb3049e0915ab1042594efce-0.
INFO 03-02 00:52:35 [logger.py:42] Received request cmpl-5389918b8a4846aeb859cc83a26dc330-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:52:35 [async_llm.py:261] Added request cmpl-5389918b8a4846aeb859cc83a26dc330-0.
INFO 03-02 00:52:36 [logger.py:42] Received request cmpl-60f59090da174facad08cd454e8123da-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:52:36 [async_llm.py:261] Added request cmpl-60f59090da174facad08cd454e8123da-0.
INFO 03-02 00:52:37 [logger.py:42] Received request cmpl-bd56bfec9dda4aa28d2f7ed80e77fc02-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:52:37 [async_llm.py:261] Added request cmpl-bd56bfec9dda4aa28d2f7ed80e77fc02-0.
INFO 03-02 00:52:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:52:38 [logger.py:42] Received request cmpl-02309724d5b24964bbff2763945cedbc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:52:38 [async_llm.py:261] Added request cmpl-02309724d5b24964bbff2763945cedbc-0.
INFO 03-02 00:52:39 [logger.py:42] Received request cmpl-9ee0096439fc43b5a6792e9930db80c5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:52:39 [async_llm.py:261] Added request cmpl-9ee0096439fc43b5a6792e9930db80c5-0.
INFO 03-02 00:52:40 [logger.py:42] Received request cmpl-92118f7d179b4ad6aa6d359843c9a9de-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:52:40 [async_llm.py:261] Added request cmpl-92118f7d179b4ad6aa6d359843c9a9de-0.
INFO 03-02 00:52:41 [logger.py:42] Received request cmpl-23d2ebe9f7a949d3936cca61bb9d2d0f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:52:41 [async_llm.py:261] Added request cmpl-23d2ebe9f7a949d3936cca61bb9d2d0f-0.
INFO 03-02 00:52:43 [logger.py:42] Received request cmpl-6314689b57bc4c0096a91fb039159160-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:52:43 [async_llm.py:261] Added request cmpl-6314689b57bc4c0096a91fb039159160-0.
INFO 03-02 00:52:44 [logger.py:42] Received request cmpl-7d6e372bcfd84e49ab5289398fa69956-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:52:44 [async_llm.py:261] Added request cmpl-7d6e372bcfd84e49ab5289398fa69956-0.
INFO 03-02 00:52:45 [logger.py:42] Received request cmpl-0262a3aa02874930803a3fc63dee3216-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:52:45 [async_llm.py:261] Added request cmpl-0262a3aa02874930803a3fc63dee3216-0.
INFO 03-02 00:52:46 [logger.py:42] Received request cmpl-641906136ace45a7a07c329599c46df0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:52:46 [async_llm.py:261] Added request cmpl-641906136ace45a7a07c329599c46df0-0.
INFO 03-02 00:52:47 [logger.py:42] Received request cmpl-578abe2b4952445797f11e1df071f39a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:52:47 [async_llm.py:261] Added request cmpl-578abe2b4952445797f11e1df071f39a-0.
INFO 03-02 00:52:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:52:48 [logger.py:42] Received request cmpl-7722319c219e495c9a48847c3e54dec3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:52:48 [async_llm.py:261] Added request cmpl-7722319c219e495c9a48847c3e54dec3-0.
INFO 03-02 00:52:50 [logger.py:42] Received request cmpl-3813c7b83d254fa1881cbb639c1a68ef-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:52:50 [async_llm.py:261] Added request cmpl-3813c7b83d254fa1881cbb639c1a68ef-0.
INFO 03-02 00:52:51 [logger.py:42] Received request cmpl-6a0226fc53b64107a8adb201da94dd4d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:52:51 [async_llm.py:261] Added request cmpl-6a0226fc53b64107a8adb201da94dd4d-0.
INFO 03-02 00:52:52 [logger.py:42] Received request cmpl-fb2f0db8f1c84b07a6b0e3a131135e3e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:52:52 [async_llm.py:261] Added request cmpl-fb2f0db8f1c84b07a6b0e3a131135e3e-0.
INFO 03-02 00:52:53 [logger.py:42] Received request cmpl-4917218ca4e049b4a5b5a9f904c703e1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:52:53 [async_llm.py:261] Added request cmpl-4917218ca4e049b4a5b5a9f904c703e1-0.
INFO 03-02 00:52:54 [logger.py:42] Received request cmpl-43fab4c73c3d4419aa1a82a02f661532-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:52:54 [async_llm.py:261] Added request cmpl-43fab4c73c3d4419aa1a82a02f661532-0.
INFO 03-02 00:52:55 [logger.py:42] Received request cmpl-46fb223b7d3a47aaa39c4cb1bab5dd5a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:52:55 [async_llm.py:261] Added request cmpl-46fb223b7d3a47aaa39c4cb1bab5dd5a-0.
INFO 03-02 00:52:56 [logger.py:42] Received request cmpl-848b58670c4c40aeae410a80575a525d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:52:56 [async_llm.py:261] Added request cmpl-848b58670c4c40aeae410a80575a525d-0.
INFO 03-02 00:52:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:52:58 [logger.py:42] Received request cmpl-ad6cdea95ed9436ebb534140f560be60-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:52:58 [async_llm.py:261] Added request cmpl-ad6cdea95ed9436ebb534140f560be60-0.
INFO 03-02 00:52:59 [logger.py:42] Received request cmpl-9cf6e70514544211b35d83073e85fd31-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:52:59 [async_llm.py:261] Added request cmpl-9cf6e70514544211b35d83073e85fd31-0.
INFO 03-02 00:53:00 [logger.py:42] Received request cmpl-9a1f47f7738e4d6eab9a133107b4cc09-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:53:00 [async_llm.py:261] Added request cmpl-9a1f47f7738e4d6eab9a133107b4cc09-0.
INFO 03-02 00:53:01 [logger.py:42] Received request cmpl-c8f33bebd1074d0ab567c8f016bd7058-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:53:01 [async_llm.py:261] Added request cmpl-c8f33bebd1074d0ab567c8f016bd7058-0.
INFO 03-02 00:53:02 [logger.py:42] Received request cmpl-0c612d3a6fef401e9283aa22d42249b6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:53:02 [async_llm.py:261] Added request cmpl-0c612d3a6fef401e9283aa22d42249b6-0.
INFO 03-02 00:53:03 [logger.py:42] Received request cmpl-70f6ac6005ac4a5694ada429cd93a412-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:53:03 [async_llm.py:261] Added request cmpl-70f6ac6005ac4a5694ada429cd93a412-0.
INFO 03-02 00:53:05 [logger.py:42] Received request cmpl-1160fae33ccc4381aa22ea7cbd728110-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:53:05 [async_llm.py:261] Added request cmpl-1160fae33ccc4381aa22ea7cbd728110-0.
INFO 03-02 00:53:06 [logger.py:42] Received request cmpl-9210dfc67c064b059c406acf7b975ac6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:53:06 [async_llm.py:261] Added request cmpl-9210dfc67c064b059c406acf7b975ac6-0.
INFO 03-02 00:53:07 [logger.py:42] Received request cmpl-a7145480172343f98e84b2b11787dc15-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:53:07 [async_llm.py:261] Added request cmpl-a7145480172343f98e84b2b11787dc15-0.
INFO 03-02 00:53:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:53:08 [logger.py:42] Received request cmpl-a2ae26db701641a68bce85cfb355ec0f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:53:08 [async_llm.py:261] Added request cmpl-a2ae26db701641a68bce85cfb355ec0f-0.
INFO 03-02 00:53:09 [logger.py:42] Received request cmpl-bd1b71b288024993baaa43edd488c165-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:53:09 [async_llm.py:261] Added request cmpl-bd1b71b288024993baaa43edd488c165-0.
INFO 03-02 00:53:10 [logger.py:42] Received request cmpl-10d3517fcd364d8db4c3d1204abb0244-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:53:10 [async_llm.py:261] Added request cmpl-10d3517fcd364d8db4c3d1204abb0244-0.
INFO 03-02 00:53:11 [logger.py:42] Received request cmpl-ad1cc558abb14f9691f5434269aa0505-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:53:11 [async_llm.py:261] Added request cmpl-ad1cc558abb14f9691f5434269aa0505-0.
INFO 03-02 00:53:13 [logger.py:42] Received request cmpl-8640aa71f2134a48ae251ded6855f060-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:53:13 [async_llm.py:261] Added request cmpl-8640aa71f2134a48ae251ded6855f060-0.
INFO 03-02 00:53:14 [logger.py:42] Received request cmpl-06f1416628be45f191b1eb6a6858dae1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:53:14 [async_llm.py:261] Added request cmpl-06f1416628be45f191b1eb6a6858dae1-0.
INFO 03-02 00:53:15 [logger.py:42] Received request cmpl-5afc01ed9ec24517a72387701ef96b21-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:53:15 [async_llm.py:261] Added request cmpl-5afc01ed9ec24517a72387701ef96b21-0.
INFO 03-02 00:53:16 [logger.py:42] Received request cmpl-9c0123512078458d8381f4c67818da4d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:53:16 [async_llm.py:261] Added request cmpl-9c0123512078458d8381f4c67818da4d-0.
INFO 03-02 00:53:17 [logger.py:42] Received request cmpl-861188d036764bbb8e501c3b7398d82d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:53:17 [async_llm.py:261] Added request cmpl-861188d036764bbb8e501c3b7398d82d-0.
INFO 03-02 00:53:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:53:18 [logger.py:42] Received request cmpl-a0afe86e1987479ebfa012499bf4068d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:53:18 [async_llm.py:261] Added request cmpl-a0afe86e1987479ebfa012499bf4068d-0.
INFO 03-02 00:53:20 [logger.py:42] Received request cmpl-8ec1d586cb174669b5be33a43006a7b2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:53:20 [async_llm.py:261] Added request cmpl-8ec1d586cb174669b5be33a43006a7b2-0.
INFO 03-02 00:53:21 [logger.py:42] Received request cmpl-6c12983e12184d21b9b9118759b45d16-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:53:21 [async_llm.py:261] Added request cmpl-6c12983e12184d21b9b9118759b45d16-0.
INFO 03-02 00:53:22 [logger.py:42] Received request cmpl-e76c4e9537104eb59ab242b7f3ef5888-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:53:22 [async_llm.py:261] Added request cmpl-e76c4e9537104eb59ab242b7f3ef5888-0.
INFO 03-02 00:53:23 [logger.py:42] Received request cmpl-396532f8fe2144258e01f68618e27b6c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:53:23 [async_llm.py:261] Added request cmpl-396532f8fe2144258e01f68618e27b6c-0.
INFO 03-02 00:53:24 [logger.py:42] Received request cmpl-e97dfe11131d4fd6aa9b1cb667e41151-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:53:24 [async_llm.py:261] Added request cmpl-e97dfe11131d4fd6aa9b1cb667e41151-0.
INFO 03-02 00:53:25 [logger.py:42] Received request cmpl-16141e630e4c4e6f901fb165cc2c882a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:53:25 [async_llm.py:261] Added request cmpl-16141e630e4c4e6f901fb165cc2c882a-0.
INFO 03-02 00:53:26 [logger.py:42] Received request cmpl-dc5fc39d20dc44fc9b48c16019d2910d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:53:26 [async_llm.py:261] Added request cmpl-dc5fc39d20dc44fc9b48c16019d2910d-0.
INFO 03-02 00:53:28 [logger.py:42] Received request cmpl-c20f8f0d0c2d489d9aa563997d9514f5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:53:28 [async_llm.py:261] Added request cmpl-c20f8f0d0c2d489d9aa563997d9514f5-0.
INFO 03-02 00:53:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:53:29 [logger.py:42] Received request cmpl-fd7e0eb30dee45f48b5b16213bbdecb5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:53:29 [async_llm.py:261] Added request cmpl-fd7e0eb30dee45f48b5b16213bbdecb5-0.
INFO 03-02 00:53:30 [logger.py:42] Received request cmpl-04ae4cfb707c4da483528079ee9880ca-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:53:30 [async_llm.py:261] Added request cmpl-04ae4cfb707c4da483528079ee9880ca-0.
INFO 03-02 00:53:31 [logger.py:42] Received request cmpl-b876bc58df7f4569bdc672cd49005b1a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:53:31 [async_llm.py:261] Added request cmpl-b876bc58df7f4569bdc672cd49005b1a-0.
INFO 03-02 00:53:32 [logger.py:42] Received request cmpl-2ea0749d8f954b4f8ffbcd97f3a0e08e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:53:32 [async_llm.py:261] Added request cmpl-2ea0749d8f954b4f8ffbcd97f3a0e08e-0.
INFO 03-02 00:53:33 [logger.py:42] Received request cmpl-ac2054b6eab243d98def04dce235a430-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:53:33 [async_llm.py:261] Added request cmpl-ac2054b6eab243d98def04dce235a430-0.
INFO 03-02 00:53:35 [logger.py:42] Received request cmpl-1f52fa2bccd24bb287849f301c4a7273-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:53:35 [async_llm.py:261] Added request cmpl-1f52fa2bccd24bb287849f301c4a7273-0.
INFO 03-02 00:53:36 [logger.py:42] Received request cmpl-ac4d689e30664a988d1042640e4e2fdb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:53:36 [async_llm.py:261] Added request cmpl-ac4d689e30664a988d1042640e4e2fdb-0.
INFO 03-02 00:53:37 [logger.py:42] Received request cmpl-6b4687daff684ebc9a9c269d9b154950-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:53:37 [async_llm.py:261] Added request cmpl-6b4687daff684ebc9a9c269d9b154950-0.
INFO 03-02 00:53:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:53:38 [logger.py:42] Received request cmpl-db57da7be0104e218675a2167ce8710f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:53:38 [async_llm.py:261] Added request cmpl-db57da7be0104e218675a2167ce8710f-0.
INFO 03-02 00:53:39 [logger.py:42] Received request cmpl-ed1f292fdc974bf7ac1daf48a6f91a78-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:53:39 [async_llm.py:261] Added request cmpl-ed1f292fdc974bf7ac1daf48a6f91a78-0.
INFO 03-02 00:53:40 [logger.py:42] Received request cmpl-3fc75cad92654817aca0313295830a5f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:53:40 [async_llm.py:261] Added request cmpl-3fc75cad92654817aca0313295830a5f-0.
INFO 03-02 00:53:41 [logger.py:42] Received request cmpl-f713390cc8b54c028926c862f9b52bf7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:53:41 [async_llm.py:261] Added request cmpl-f713390cc8b54c028926c862f9b52bf7-0.
INFO 03-02 00:53:43 [logger.py:42] Received request cmpl-fc572ca105424cdfa393549d93492b34-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:53:43 [async_llm.py:261] Added request cmpl-fc572ca105424cdfa393549d93492b34-0.
INFO 03-02 00:53:44 [logger.py:42] Received request cmpl-7adfab4636b24469815885755f27edf3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:53:44 [async_llm.py:261] Added request cmpl-7adfab4636b24469815885755f27edf3-0.
INFO 03-02 00:53:45 [logger.py:42] Received request cmpl-3ee67e4d51624bb9a085218ee0ff4ce6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:53:45 [async_llm.py:261] Added request cmpl-3ee67e4d51624bb9a085218ee0ff4ce6-0.
INFO 03-02 00:53:46 [logger.py:42] Received request cmpl-898f3b1b99684791980e2876c1dc5b22-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:53:46 [async_llm.py:261] Added request cmpl-898f3b1b99684791980e2876c1dc5b22-0.
INFO 03-02 00:53:47 [logger.py:42] Received request cmpl-47acbe58b9c548d7ad31c9c6a4dea17f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:53:47 [async_llm.py:261] Added request cmpl-47acbe58b9c548d7ad31c9c6a4dea17f-0.
INFO 03-02 00:53:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:53:48 [logger.py:42] Received request cmpl-21294114881c4a758d6003b26f035dd9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:53:48 [async_llm.py:261] Added request cmpl-21294114881c4a758d6003b26f035dd9-0.
INFO 03-02 00:53:50 [logger.py:42] Received request cmpl-dd5dc79bd75249a991e9b22edf7e66ad-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:53:50 [async_llm.py:261] Added request cmpl-dd5dc79bd75249a991e9b22edf7e66ad-0.
INFO 03-02 00:53:51 [logger.py:42] Received request cmpl-8d53eca7903f4173ac9ffa16c49ef065-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:53:51 [async_llm.py:261] Added request cmpl-8d53eca7903f4173ac9ffa16c49ef065-0.
INFO 03-02 00:53:52 [logger.py:42] Received request cmpl-6be8ea112cd34513b7c32f795916c8ca-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:53:52 [async_llm.py:261] Added request cmpl-6be8ea112cd34513b7c32f795916c8ca-0.
INFO 03-02 00:53:53 [logger.py:42] Received request cmpl-7c6fde0ce2f448b187efd5eddfee4e70-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:53:53 [async_llm.py:261] Added request cmpl-7c6fde0ce2f448b187efd5eddfee4e70-0.
INFO 03-02 00:53:54 [logger.py:42] Received request cmpl-fb56896e8a2f4a25a10b4cac73ca9219-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:53:54 [async_llm.py:261] Added request cmpl-fb56896e8a2f4a25a10b4cac73ca9219-0.
INFO 03-02 00:53:55 [logger.py:42] Received request cmpl-dbf9bf64dde1418487776c73b5f9a871-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:53:55 [async_llm.py:261] Added request cmpl-dbf9bf64dde1418487776c73b5f9a871-0.
INFO 03-02 00:53:56 [logger.py:42] Received request cmpl-1fe183c02d9f4bebaa571f0d709395d2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:53:56 [async_llm.py:261] Added request cmpl-1fe183c02d9f4bebaa571f0d709395d2-0.
INFO 03-02 00:53:58 [logger.py:42] Received request cmpl-7d65f1d1a2f04ea391a40806b85e9c61-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:53:58 [async_llm.py:261] Added request cmpl-7d65f1d1a2f04ea391a40806b85e9c61-0.
INFO 03-02 00:53:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:53:59 [logger.py:42] Received request cmpl-132776f62d6040a19e2c60403678d121-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:53:59 [async_llm.py:261] Added request cmpl-132776f62d6040a19e2c60403678d121-0.
INFO 03-02 00:54:00 [logger.py:42] Received request cmpl-16b373b55dd24da9967419a40b7ec409-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:54:00 [async_llm.py:261] Added request cmpl-16b373b55dd24da9967419a40b7ec409-0.
INFO 03-02 00:54:01 [logger.py:42] Received request cmpl-9226a0517f904276b375a16e2f529f3a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:54:01 [async_llm.py:261] Added request cmpl-9226a0517f904276b375a16e2f529f3a-0.
INFO 03-02 00:54:02 [logger.py:42] Received request cmpl-00ffa74faa6e40ff930dce4b57b43e0a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:54:02 [async_llm.py:261] Added request cmpl-00ffa74faa6e40ff930dce4b57b43e0a-0.
INFO 03-02 00:54:03 [logger.py:42] Received request cmpl-369d6eef15464c89b280e35cfe4d610c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:54:03 [async_llm.py:261] Added request cmpl-369d6eef15464c89b280e35cfe4d610c-0.
INFO 03-02 00:54:05 [logger.py:42] Received request cmpl-1a5326cab2c74d0687c4b9a7dcd7c3fc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:54:05 [async_llm.py:261] Added request cmpl-1a5326cab2c74d0687c4b9a7dcd7c3fc-0.
INFO 03-02 00:54:06 [logger.py:42] Received request cmpl-f832ff9756024c4089a43136b25a1c4d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:54:06 [async_llm.py:261] Added request cmpl-f832ff9756024c4089a43136b25a1c4d-0.
INFO 03-02 00:54:07 [logger.py:42] Received request cmpl-d35d4d80d7b44b08a61a201ec8cbbc36-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:54:07 [async_llm.py:261] Added request cmpl-d35d4d80d7b44b08a61a201ec8cbbc36-0.
INFO 03-02 00:54:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:54:08 [logger.py:42] Received request cmpl-fb60cc491cf141749368021f20f6071c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:54:08 [async_llm.py:261] Added request cmpl-fb60cc491cf141749368021f20f6071c-0.
INFO 03-02 00:54:09 [logger.py:42] Received request cmpl-136ce7f050e8464993b8c7aa917228fd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:54:09 [async_llm.py:261] Added request cmpl-136ce7f050e8464993b8c7aa917228fd-0.
INFO 03-02 00:54:10 [logger.py:42] Received request cmpl-519a309cceda4cc8b9f17c3c4cdaf4ec-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:54:10 [async_llm.py:261] Added request cmpl-519a309cceda4cc8b9f17c3c4cdaf4ec-0.
INFO 03-02 00:54:11 [logger.py:42] Received request cmpl-5286f8a66f33406f839cc5831b589ba7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:54:11 [async_llm.py:261] Added request cmpl-5286f8a66f33406f839cc5831b589ba7-0.
INFO 03-02 00:54:13 [logger.py:42] Received request cmpl-ab2e88d5c9674b25ac8b778c0c496bc9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:54:13 [async_llm.py:261] Added request cmpl-ab2e88d5c9674b25ac8b778c0c496bc9-0.
INFO 03-02 00:54:14 [logger.py:42] Received request cmpl-219ceea4d7624bd7b7c30ef53ae12662-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:54:14 [async_llm.py:261] Added request cmpl-219ceea4d7624bd7b7c30ef53ae12662-0.
INFO 03-02 00:54:15 [logger.py:42] Received request cmpl-1fef0128f06a47fa97b76a51d4ce9f54-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:54:15 [async_llm.py:261] Added request cmpl-1fef0128f06a47fa97b76a51d4ce9f54-0.
INFO 03-02 00:54:16 [logger.py:42] Received request cmpl-ed8a736cde254fa9ae4f067ba2f5963a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:54:16 [async_llm.py:261] Added request cmpl-ed8a736cde254fa9ae4f067ba2f5963a-0.
INFO 03-02 00:54:17 [logger.py:42] Received request cmpl-45d1053b157c479b80acb77d5121d61d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:54:17 [async_llm.py:261] Added request cmpl-45d1053b157c479b80acb77d5121d61d-0.
INFO 03-02 00:54:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:54:18 [logger.py:42] Received request cmpl-ac567e41c69245c7afc83aabc3f56434-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:54:18 [async_llm.py:261] Added request cmpl-ac567e41c69245c7afc83aabc3f56434-0.
INFO 03-02 00:54:20 [logger.py:42] Received request cmpl-2a94e72d1f664592ae85b96841b90db1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:54:20 [async_llm.py:261] Added request cmpl-2a94e72d1f664592ae85b96841b90db1-0.
INFO 03-02 00:54:21 [logger.py:42] Received request cmpl-327250d43c1d4b2fa3d1bdd34bffbc34-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:54:21 [async_llm.py:261] Added request cmpl-327250d43c1d4b2fa3d1bdd34bffbc34-0.
INFO 03-02 00:54:22 [logger.py:42] Received request cmpl-5f163e7a37b7400db912b79dadd339e6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:54:22 [async_llm.py:261] Added request cmpl-5f163e7a37b7400db912b79dadd339e6-0.
INFO 03-02 00:54:23 [logger.py:42] Received request cmpl-d8b009e82d184ba8b6737ea18f4a97cd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:54:23 [async_llm.py:261] Added request cmpl-d8b009e82d184ba8b6737ea18f4a97cd-0.
INFO 03-02 00:54:24 [logger.py:42] Received request cmpl-897d44163bdb4e15b9f12a16b57f704f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:54:24 [async_llm.py:261] Added request cmpl-897d44163bdb4e15b9f12a16b57f704f-0.
INFO 03-02 00:54:25 [logger.py:42] Received request cmpl-25348fb267af49b8a6b9bdb0690a12f9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:54:25 [async_llm.py:261] Added request cmpl-25348fb267af49b8a6b9bdb0690a12f9-0.
INFO 03-02 00:54:26 [logger.py:42] Received request cmpl-362a5aed5cee4d21942bef0c266206eb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:54:26 [async_llm.py:261] Added request cmpl-362a5aed5cee4d21942bef0c266206eb-0.
INFO 03-02 00:54:28 [logger.py:42] Received request cmpl-c4ff98a846354645b545ed60d4f8df4a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:54:28 [async_llm.py:261] Added request cmpl-c4ff98a846354645b545ed60d4f8df4a-0.
INFO 03-02 00:54:28 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.1 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.3%, Prefix cache hit rate: 51.6%
INFO 03-02 00:54:29 [logger.py:42] Received request cmpl-f75fb7bf58c9423aa2e483a506695225-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:54:29 [async_llm.py:261] Added request cmpl-f75fb7bf58c9423aa2e483a506695225-0.
INFO 03-02 00:54:30 [logger.py:42] Received request cmpl-38118c7726b94ddca17a84ca848b7fdb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:54:30 [async_llm.py:261] Added request cmpl-38118c7726b94ddca17a84ca848b7fdb-0.
INFO 03-02 00:54:31 [logger.py:42] Received request cmpl-34e9c0faaeca47eabc63a27187430b87-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:54:31 [async_llm.py:261] Added request cmpl-34e9c0faaeca47eabc63a27187430b87-0.
INFO 03-02 00:54:32 [logger.py:42] Received request cmpl-dbea715350754b28b410cf6c27708e93-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:54:32 [async_llm.py:261] Added request cmpl-dbea715350754b28b410cf6c27708e93-0.
INFO 03-02 00:54:33 [logger.py:42] Received request cmpl-3b3c0d8befb94233aeeaf76866275fff-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:54:33 [async_llm.py:261] Added request cmpl-3b3c0d8befb94233aeeaf76866275fff-0.
INFO 03-02 00:54:34 [logger.py:42] Received request cmpl-1a478902bb224444ab48dce35ec6b29c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:54:34 [async_llm.py:261] Added request cmpl-1a478902bb224444ab48dce35ec6b29c-0.
INFO 03-02 00:54:36 [logger.py:42] Received request cmpl-2e64f893355944d3b1c8674abf0c37e9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:54:36 [async_llm.py:261] Added request cmpl-2e64f893355944d3b1c8674abf0c37e9-0.
INFO 03-02 00:54:37 [logger.py:42] Received request cmpl-9410099efaf740668dfc191f71f68860-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:54:37 [async_llm.py:261] Added request cmpl-9410099efaf740668dfc191f71f68860-0.
INFO 03-02 00:54:38 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.4 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:54:38 [logger.py:42] Received request cmpl-dd570be707f3466ebdb35b7b9fb179b4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:54:38 [async_llm.py:261] Added request cmpl-dd570be707f3466ebdb35b7b9fb179b4-0.
INFO 03-02 00:54:39 [logger.py:42] Received request cmpl-13d2890037094b3a8be9eafc4a8f76b6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:54:39 [async_llm.py:261] Added request cmpl-13d2890037094b3a8be9eafc4a8f76b6-0.
INFO 03-02 00:54:40 [logger.py:42] Received request cmpl-e302b81c97c3402d923a9e1d30c41d37-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:54:40 [async_llm.py:261] Added request cmpl-e302b81c97c3402d923a9e1d30c41d37-0.
INFO 03-02 00:54:41 [logger.py:42] Received request cmpl-6c711616836d4efab5397fa9a6284732-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:54:41 [async_llm.py:261] Added request cmpl-6c711616836d4efab5397fa9a6284732-0.
INFO 03-02 00:54:43 [logger.py:42] Received request cmpl-1d27545c46894711bc774b50c164e3e4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:54:43 [async_llm.py:261] Added request cmpl-1d27545c46894711bc774b50c164e3e4-0.
INFO 03-02 00:54:44 [logger.py:42] Received request cmpl-b7aeea0fd3ec471eb8034cc0296cbf61-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:54:44 [async_llm.py:261] Added request cmpl-b7aeea0fd3ec471eb8034cc0296cbf61-0.
INFO 03-02 00:54:45 [logger.py:42] Received request cmpl-75989ce22d6b4f3ead6b6581426c9670-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:54:45 [async_llm.py:261] Added request cmpl-75989ce22d6b4f3ead6b6581426c9670-0.
INFO 03-02 00:54:46 [logger.py:42] Received request cmpl-3a95b14d15364f3e90d2eb5cd1a2b9e6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:54:46 [async_llm.py:261] Added request cmpl-3a95b14d15364f3e90d2eb5cd1a2b9e6-0.
INFO 03-02 00:54:47 [logger.py:42] Received request cmpl-93c659e69fc84523816f3ac3aa6f7c0b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:54:47 [async_llm.py:261] Added request cmpl-93c659e69fc84523816f3ac3aa6f7c0b-0.
INFO 03-02 00:54:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:54:48 [logger.py:42] Received request cmpl-397c5f63f9f043a1a973de1bfcd54147-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:54:48 [async_llm.py:261] Added request cmpl-397c5f63f9f043a1a973de1bfcd54147-0.
INFO 03-02 00:54:50 [logger.py:42] Received request cmpl-c9e519ff6715482088178a99335ff7d2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:54:50 [async_llm.py:261] Added request cmpl-c9e519ff6715482088178a99335ff7d2-0.
INFO 03-02 00:54:51 [logger.py:42] Received request cmpl-ed90ed23e07c45b09bcbce21c2e6b842-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:54:51 [async_llm.py:261] Added request cmpl-ed90ed23e07c45b09bcbce21c2e6b842-0.
INFO 03-02 00:54:52 [logger.py:42] Received request cmpl-ca74ed5f684f44eca03a5a138c3c6608-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:54:52 [async_llm.py:261] Added request cmpl-ca74ed5f684f44eca03a5a138c3c6608-0.
INFO 03-02 00:54:53 [logger.py:42] Received request cmpl-f290669fe49040318a7f041d50a0e4c4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:54:53 [async_llm.py:261] Added request cmpl-f290669fe49040318a7f041d50a0e4c4-0.
INFO 03-02 00:54:54 [logger.py:42] Received request cmpl-d654da25ef6b46ceaed01bfe7dea3400-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:54:54 [async_llm.py:261] Added request cmpl-d654da25ef6b46ceaed01bfe7dea3400-0.
INFO 03-02 00:54:55 [logger.py:42] Received request cmpl-e9692f9a278f49ca96958d2f00fc19ff-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:54:55 [async_llm.py:261] Added request cmpl-e9692f9a278f49ca96958d2f00fc19ff-0.
INFO 03-02 00:54:56 [logger.py:42] Received request cmpl-d92885b27f824a189650927b78204f43-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:54:56 [async_llm.py:261] Added request cmpl-d92885b27f824a189650927b78204f43-0.
INFO 03-02 00:54:58 [logger.py:42] Received request cmpl-8054b8044eca43e49a320a7e091993d9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:54:58 [async_llm.py:261] Added request cmpl-8054b8044eca43e49a320a7e091993d9-0.
INFO 03-02 00:54:58 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.1 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.3%, Prefix cache hit rate: 51.6%
INFO 03-02 00:54:59 [logger.py:42] Received request cmpl-05c42769c85f49fd92f7f4cdbe565fd1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:54:59 [async_llm.py:261] Added request cmpl-05c42769c85f49fd92f7f4cdbe565fd1-0.
INFO 03-02 00:55:00 [logger.py:42] Received request cmpl-35855fb78b3a459faf361e8284edf0af-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:55:00 [async_llm.py:261] Added request cmpl-35855fb78b3a459faf361e8284edf0af-0.
INFO 03-02 00:55:01 [logger.py:42] Received request cmpl-89ad680efc4b40ef814b1af66032cadf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:55:01 [async_llm.py:261] Added request cmpl-89ad680efc4b40ef814b1af66032cadf-0.
INFO 03-02 00:55:02 [logger.py:42] Received request cmpl-ed4496df5a9746cf82cb321a8bce24f0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:55:02 [async_llm.py:261] Added request cmpl-ed4496df5a9746cf82cb321a8bce24f0-0.
INFO 03-02 00:55:03 [logger.py:42] Received request cmpl-7992408bc6fa44a59bd3a8a02f4f0fd9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:55:03 [async_llm.py:261] Added request cmpl-7992408bc6fa44a59bd3a8a02f4f0fd9-0.
INFO 03-02 00:55:05 [logger.py:42] Received request cmpl-085d6e33eb0c4829ae5a284bf4dccc51-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:55:05 [async_llm.py:261] Added request cmpl-085d6e33eb0c4829ae5a284bf4dccc51-0.
INFO 03-02 00:55:06 [logger.py:42] Received request cmpl-c400c033b1834566bf57344b928a15dd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:55:06 [async_llm.py:261] Added request cmpl-c400c033b1834566bf57344b928a15dd-0.
INFO 03-02 00:55:07 [logger.py:42] Received request cmpl-8dab66edfd3b48a1906d94e4396e2186-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:55:07 [async_llm.py:261] Added request cmpl-8dab66edfd3b48a1906d94e4396e2186-0.
INFO 03-02 00:55:08 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.4 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:55:08 [logger.py:42] Received request cmpl-374c91fab5234e06a4e091530deb7dd4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:55:08 [async_llm.py:261] Added request cmpl-374c91fab5234e06a4e091530deb7dd4-0.
INFO 03-02 00:55:09 [logger.py:42] Received request cmpl-9c074bbc9ef74a138d2a8bbf60c016d6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:55:09 [async_llm.py:261] Added request cmpl-9c074bbc9ef74a138d2a8bbf60c016d6-0.
INFO 03-02 00:55:10 [logger.py:42] Received request cmpl-202bb9192ac94804ae3109af4f46eb65-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:55:10 [async_llm.py:261] Added request cmpl-202bb9192ac94804ae3109af4f46eb65-0.
INFO 03-02 00:55:11 [logger.py:42] Received request cmpl-3bc2ac83967846f98ec39fb3bac65000-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:55:11 [async_llm.py:261] Added request cmpl-3bc2ac83967846f98ec39fb3bac65000-0.
INFO 03-02 00:55:13 [logger.py:42] Received request cmpl-6f065a5456934bf984f5a5781f09f139-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:55:13 [async_llm.py:261] Added request cmpl-6f065a5456934bf984f5a5781f09f139-0.
INFO 03-02 00:55:14 [logger.py:42] Received request cmpl-dfad706f92bc46aba12de6b39e24afbf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:55:14 [async_llm.py:261] Added request cmpl-dfad706f92bc46aba12de6b39e24afbf-0.
INFO 03-02 00:55:15 [logger.py:42] Received request cmpl-8ae4398f0acf4eb8930d1e1b3c9b0884-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:55:15 [async_llm.py:261] Added request cmpl-8ae4398f0acf4eb8930d1e1b3c9b0884-0.
INFO 03-02 00:55:16 [logger.py:42] Received request cmpl-083691f614f040408e1a23c7765d8edb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:55:16 [async_llm.py:261] Added request cmpl-083691f614f040408e1a23c7765d8edb-0.
INFO 03-02 00:55:17 [logger.py:42] Received request cmpl-8a62267884e74c6e9668fb8348af43a2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:55:17 [async_llm.py:261] Added request cmpl-8a62267884e74c6e9668fb8348af43a2-0.
INFO 03-02 00:55:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:55:18 [logger.py:42] Received request cmpl-dc4261df86a84c44bcfc5a5e73f34b98-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:55:18 [async_llm.py:261] Added request cmpl-dc4261df86a84c44bcfc5a5e73f34b98-0.
INFO 03-02 00:55:20 [logger.py:42] Received request cmpl-1a89399182a84a17acf5544ee9b0e78b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:55:20 [async_llm.py:261] Added request cmpl-1a89399182a84a17acf5544ee9b0e78b-0.
INFO 03-02 00:55:21 [logger.py:42] Received request cmpl-a0ef9380b47a47b8803909bf2f7009ba-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:55:21 [async_llm.py:261] Added request cmpl-a0ef9380b47a47b8803909bf2f7009ba-0.
INFO 03-02 00:55:22 [logger.py:42] Received request cmpl-9822f352114d41dc9336d1e1d8411d09-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:55:22 [async_llm.py:261] Added request cmpl-9822f352114d41dc9336d1e1d8411d09-0.
INFO 03-02 00:55:23 [logger.py:42] Received request cmpl-4d5048033348424f8332929194eb52b9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:55:23 [async_llm.py:261] Added request cmpl-4d5048033348424f8332929194eb52b9-0.
INFO 03-02 00:55:24 [logger.py:42] Received request cmpl-677b40f3e96247c098efe2a9cb3415f1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:55:24 [async_llm.py:261] Added request cmpl-677b40f3e96247c098efe2a9cb3415f1-0.
INFO 03-02 00:55:25 [logger.py:42] Received request cmpl-9b4c5fd550fe4f718caea4ed66a9eb4f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:55:25 [async_llm.py:261] Added request cmpl-9b4c5fd550fe4f718caea4ed66a9eb4f-0.
INFO 03-02 00:55:26 [logger.py:42] Received request cmpl-fee819efc68042b7a173f2c139d64604-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:55:26 [async_llm.py:261] Added request cmpl-fee819efc68042b7a173f2c139d64604-0.
INFO 03-02 00:55:28 [logger.py:42] Received request cmpl-4adae8c5bac24f61b2b97cd416b6062f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:55:28 [async_llm.py:261] Added request cmpl-4adae8c5bac24f61b2b97cd416b6062f-0.
INFO 03-02 00:55:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:55:29 [logger.py:42] Received request cmpl-7d6fb97d6e674cecbf10d3c420ee1824-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:55:29 [async_llm.py:261] Added request cmpl-7d6fb97d6e674cecbf10d3c420ee1824-0.
INFO 03-02 00:55:30 [logger.py:42] Received request cmpl-24a0fd4fd241464db81276905c4ae98c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:55:30 [async_llm.py:261] Added request cmpl-24a0fd4fd241464db81276905c4ae98c-0.
INFO 03-02 00:55:31 [logger.py:42] Received request cmpl-70b7a89a537443fd87c91f6fa5744695-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:55:31 [async_llm.py:261] Added request cmpl-70b7a89a537443fd87c91f6fa5744695-0.
INFO 03-02 00:55:32 [logger.py:42] Received request cmpl-805d8ee30ec74476bef43f5023a6a042-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:55:32 [async_llm.py:261] Added request cmpl-805d8ee30ec74476bef43f5023a6a042-0.
INFO 03-02 00:55:33 [logger.py:42] Received request cmpl-c0a12d70e9cb4f33ba052d5029d645a0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:55:33 [async_llm.py:261] Added request cmpl-c0a12d70e9cb4f33ba052d5029d645a0-0.
INFO 03-02 00:55:35 [logger.py:42] Received request cmpl-5f5f2bd1078c4a91b0ad54cf082ce965-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:55:35 [async_llm.py:261] Added request cmpl-5f5f2bd1078c4a91b0ad54cf082ce965-0.
INFO 03-02 00:55:36 [logger.py:42] Received request cmpl-4b30bdd57a90425ba0f8bc8417984e07-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:55:36 [async_llm.py:261] Added request cmpl-4b30bdd57a90425ba0f8bc8417984e07-0.
INFO 03-02 00:55:37 [logger.py:42] Received request cmpl-bd137f7917734931b31a2dba0a118757-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:55:37 [async_llm.py:261] Added request cmpl-bd137f7917734931b31a2dba0a118757-0.
INFO 03-02 00:55:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:55:38 [logger.py:42] Received request cmpl-4177676294534294b24c1dffed163f98-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:55:38 [async_llm.py:261] Added request cmpl-4177676294534294b24c1dffed163f98-0.
INFO 03-02 00:55:39 [logger.py:42] Received request cmpl-f538ba60b7c44cc8ab6f02ddcebe4429-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:55:39 [async_llm.py:261] Added request cmpl-f538ba60b7c44cc8ab6f02ddcebe4429-0.
INFO 03-02 00:55:40 [logger.py:42] Received request cmpl-7ebfb2cb7e68429c8c300ddde7e0cadc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:55:40 [async_llm.py:261] Added request cmpl-7ebfb2cb7e68429c8c300ddde7e0cadc-0.
INFO 03-02 00:55:41 [logger.py:42] Received request cmpl-9d0fc146864a482197abf743f211841e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:55:41 [async_llm.py:261] Added request cmpl-9d0fc146864a482197abf743f211841e-0.
INFO 03-02 00:55:43 [logger.py:42] Received request cmpl-30ec3fc8004747a5bce314f7c3e2cd81-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:55:43 [async_llm.py:261] Added request cmpl-30ec3fc8004747a5bce314f7c3e2cd81-0.
INFO 03-02 00:55:44 [logger.py:42] Received request cmpl-eec3e128e2c341dca79a80c9eb698b3b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:55:44 [async_llm.py:261] Added request cmpl-eec3e128e2c341dca79a80c9eb698b3b-0.
INFO 03-02 00:55:45 [logger.py:42] Received request cmpl-878fba5208f345f8ba189f3dffd7184c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:55:45 [async_llm.py:261] Added request cmpl-878fba5208f345f8ba189f3dffd7184c-0.
INFO 03-02 00:55:46 [logger.py:42] Received request cmpl-e764f1332d9749088fda641b5b5de173-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:55:46 [async_llm.py:261] Added request cmpl-e764f1332d9749088fda641b5b5de173-0.
INFO 03-02 00:55:47 [logger.py:42] Received request cmpl-21831696c82649d588d01eca56b1be5f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:55:47 [async_llm.py:261] Added request cmpl-21831696c82649d588d01eca56b1be5f-0.
INFO 03-02 00:55:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:55:48 [logger.py:42] Received request cmpl-941173127e244283bb5c6b8d5cc73f0b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:55:48 [async_llm.py:261] Added request cmpl-941173127e244283bb5c6b8d5cc73f0b-0.
INFO 03-02 00:55:50 [logger.py:42] Received request cmpl-e984e6879b1a4d1a91fc483a1439cdaa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:55:50 [async_llm.py:261] Added request cmpl-e984e6879b1a4d1a91fc483a1439cdaa-0.
INFO 03-02 00:55:51 [logger.py:42] Received request cmpl-10a2b1bc17124041aac407d939ef7a1b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:55:51 [async_llm.py:261] Added request cmpl-10a2b1bc17124041aac407d939ef7a1b-0.
INFO 03-02 00:55:52 [logger.py:42] Received request cmpl-9474558dcd184999b674b2f51cae3efc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:55:52 [async_llm.py:261] Added request cmpl-9474558dcd184999b674b2f51cae3efc-0.
INFO 03-02 00:55:53 [logger.py:42] Received request cmpl-be661612797e4c628e5e9f9bd0ad9894-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:55:53 [async_llm.py:261] Added request cmpl-be661612797e4c628e5e9f9bd0ad9894-0.
INFO 03-02 00:55:54 [logger.py:42] Received request cmpl-43f8262deb84409ca93b0ec121b4c981-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:55:54 [async_llm.py:261] Added request cmpl-43f8262deb84409ca93b0ec121b4c981-0.
INFO 03-02 00:55:55 [logger.py:42] Received request cmpl-829f9011b68b4b4b85afe9e9424b5221-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:55:55 [async_llm.py:261] Added request cmpl-829f9011b68b4b4b85afe9e9424b5221-0.
INFO 03-02 00:55:56 [logger.py:42] Received request cmpl-2e299d414b274415a2b06fc3953a23c2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:55:56 [async_llm.py:261] Added request cmpl-2e299d414b274415a2b06fc3953a23c2-0.
INFO 03-02 00:55:58 [logger.py:42] Received request cmpl-2e608d93fc0942ab966ef3cee45502d2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:55:58 [async_llm.py:261] Added request cmpl-2e608d93fc0942ab966ef3cee45502d2-0.
INFO 03-02 00:55:58 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.1 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.3%, Prefix cache hit rate: 51.6%
INFO 03-02 00:55:59 [logger.py:42] Received request cmpl-c83788f3d2bb483ebf6b1e59c1ee016c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:55:59 [async_llm.py:261] Added request cmpl-c83788f3d2bb483ebf6b1e59c1ee016c-0.
INFO 03-02 00:56:00 [logger.py:42] Received request cmpl-5173bcbe663a4b8f90d562e712a90cfb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:56:00 [async_llm.py:261] Added request cmpl-5173bcbe663a4b8f90d562e712a90cfb-0.
INFO 03-02 00:56:01 [logger.py:42] Received request cmpl-561a3da3232d4fc9bff565c5bb532264-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:56:01 [async_llm.py:261] Added request cmpl-561a3da3232d4fc9bff565c5bb532264-0.
INFO 03-02 00:56:02 [logger.py:42] Received request cmpl-dd79669497a54578b9934fbc24fd6a4e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:56:02 [async_llm.py:261] Added request cmpl-dd79669497a54578b9934fbc24fd6a4e-0.
INFO 03-02 00:56:03 [logger.py:42] Received request cmpl-00fb67ab811645fa8913a8f3ad39d1ba-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:56:03 [async_llm.py:261] Added request cmpl-00fb67ab811645fa8913a8f3ad39d1ba-0.
INFO 03-02 00:56:05 [logger.py:42] Received request cmpl-9f5fdc50b21c4f548cf7834c7d00671c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:56:05 [async_llm.py:261] Added request cmpl-9f5fdc50b21c4f548cf7834c7d00671c-0.
INFO 03-02 00:56:06 [logger.py:42] Received request cmpl-a52f0f7a78b140b88fcc14421ae15c73-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:56:06 [async_llm.py:261] Added request cmpl-a52f0f7a78b140b88fcc14421ae15c73-0.
INFO 03-02 00:56:07 [logger.py:42] Received request cmpl-419211354fd448249252fe5c45c10daa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:56:07 [async_llm.py:261] Added request cmpl-419211354fd448249252fe5c45c10daa-0.
INFO 03-02 00:56:08 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.4 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:56:08 [logger.py:42] Received request cmpl-0c862eb936704e5eb813556ee0552c3f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:56:08 [async_llm.py:261] Added request cmpl-0c862eb936704e5eb813556ee0552c3f-0.
INFO 03-02 00:56:09 [logger.py:42] Received request cmpl-48fd835decfb4281ae3a7d481e3f2d25-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:56:09 [async_llm.py:261] Added request cmpl-48fd835decfb4281ae3a7d481e3f2d25-0.
INFO 03-02 00:56:10 [logger.py:42] Received request cmpl-ab9a581233294cbdb030861212152048-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:56:10 [async_llm.py:261] Added request cmpl-ab9a581233294cbdb030861212152048-0.
INFO 03-02 00:56:11 [logger.py:42] Received request cmpl-762afe78d89048b99739ad4d489c0fe7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:56:11 [async_llm.py:261] Added request cmpl-762afe78d89048b99739ad4d489c0fe7-0.
INFO 03-02 00:56:13 [logger.py:42] Received request cmpl-8f722ea3572747ea9c3452a0d89b866b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:56:13 [async_llm.py:261] Added request cmpl-8f722ea3572747ea9c3452a0d89b866b-0.
INFO 03-02 00:56:14 [logger.py:42] Received request cmpl-56078c982f0e4d96a763ffd6114761a6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:56:14 [async_llm.py:261] Added request cmpl-56078c982f0e4d96a763ffd6114761a6-0.
INFO 03-02 00:56:15 [logger.py:42] Received request cmpl-096ff93af74442b7974ebbb16571013f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:56:15 [async_llm.py:261] Added request cmpl-096ff93af74442b7974ebbb16571013f-0.
INFO 03-02 00:56:16 [logger.py:42] Received request cmpl-89f7ac7808d5412b86675d05d44ae96a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:56:16 [async_llm.py:261] Added request cmpl-89f7ac7808d5412b86675d05d44ae96a-0.
INFO 03-02 00:56:17 [logger.py:42] Received request cmpl-50d2bb4ea6bd4abbb1f7f169057d66f0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:56:17 [async_llm.py:261] Added request cmpl-50d2bb4ea6bd4abbb1f7f169057d66f0-0.
INFO 03-02 00:56:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:56:18 [logger.py:42] Received request cmpl-1f0a3675472548d7914bd260c97ff2d4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:56:18 [async_llm.py:261] Added request cmpl-1f0a3675472548d7914bd260c97ff2d4-0.
INFO 03-02 00:56:20 [logger.py:42] Received request cmpl-bc0c1270c0ba44c9a70b55b263fad307-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:56:20 [async_llm.py:261] Added request cmpl-bc0c1270c0ba44c9a70b55b263fad307-0.
INFO 03-02 00:56:21 [logger.py:42] Received request cmpl-049f614b78314455a5b57c29a303f744-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:56:21 [async_llm.py:261] Added request cmpl-049f614b78314455a5b57c29a303f744-0.
INFO 03-02 00:56:22 [logger.py:42] Received request cmpl-cb5281ab9b83448fb5b3455609ea1bab-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:56:22 [async_llm.py:261] Added request cmpl-cb5281ab9b83448fb5b3455609ea1bab-0.
INFO 03-02 00:56:23 [logger.py:42] Received request cmpl-58df8142b01c40eda130159d00eaa6a1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:56:23 [async_llm.py:261] Added request cmpl-58df8142b01c40eda130159d00eaa6a1-0.
INFO 03-02 00:56:24 [logger.py:42] Received request cmpl-f1518c4efeb742f682020d34262d318c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:56:24 [async_llm.py:261] Added request cmpl-f1518c4efeb742f682020d34262d318c-0.
INFO 03-02 00:56:25 [logger.py:42] Received request cmpl-bf52a5147475401588ad49962d1d757f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:56:25 [async_llm.py:261] Added request cmpl-bf52a5147475401588ad49962d1d757f-0.
INFO 03-02 00:56:26 [logger.py:42] Received request cmpl-17ae850d33204066bacf44278a2600a5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:56:26 [async_llm.py:261] Added request cmpl-17ae850d33204066bacf44278a2600a5-0.
INFO 03-02 00:56:28 [logger.py:42] Received request cmpl-95762153b365482f825ba37e5029b165-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:56:28 [async_llm.py:261] Added request cmpl-95762153b365482f825ba37e5029b165-0.
INFO 03-02 00:56:28 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.1 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.3%, Prefix cache hit rate: 51.6%
INFO 03-02 00:56:29 [logger.py:42] Received request cmpl-0f80dc254a1147598d0b43989992eefa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:56:29 [async_llm.py:261] Added request cmpl-0f80dc254a1147598d0b43989992eefa-0.
INFO 03-02 00:56:30 [logger.py:42] Received request cmpl-83a4d4a1e27a4808809b8754daac4a48-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:56:30 [async_llm.py:261] Added request cmpl-83a4d4a1e27a4808809b8754daac4a48-0.
INFO 03-02 00:56:31 [logger.py:42] Received request cmpl-9e535944f53749a79b95651281d69813-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:56:31 [async_llm.py:261] Added request cmpl-9e535944f53749a79b95651281d69813-0.
INFO 03-02 00:56:32 [logger.py:42] Received request cmpl-750c1bf6a4e3443db9825c7080c9cab9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:56:32 [async_llm.py:261] Added request cmpl-750c1bf6a4e3443db9825c7080c9cab9-0.
INFO 03-02 00:56:33 [logger.py:42] Received request cmpl-c7ccae2c78d744238f3a28531e3df356-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:56:33 [async_llm.py:261] Added request cmpl-c7ccae2c78d744238f3a28531e3df356-0.
INFO 03-02 00:56:34 [logger.py:42] Received request cmpl-6c0705c28e8145c6968a6b08477f1af0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:56:34 [async_llm.py:261] Added request cmpl-6c0705c28e8145c6968a6b08477f1af0-0.
INFO 03-02 00:56:36 [logger.py:42] Received request cmpl-6f09614b7eaa43259cc06561e12fd50a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:56:36 [async_llm.py:261] Added request cmpl-6f09614b7eaa43259cc06561e12fd50a-0.
INFO 03-02 00:56:37 [logger.py:42] Received request cmpl-ff1f6b6a1fe94355ab6a35680fdb13e9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:56:37 [async_llm.py:261] Added request cmpl-ff1f6b6a1fe94355ab6a35680fdb13e9-0.
INFO 03-02 00:56:38 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.4 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:56:38 [logger.py:42] Received request cmpl-a76b8138f9964ed88658922fa6c08da0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:56:38 [async_llm.py:261] Added request cmpl-a76b8138f9964ed88658922fa6c08da0-0.
INFO 03-02 00:56:39 [logger.py:42] Received request cmpl-f0a1b9c6eccd43969aedc58540d77663-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:56:39 [async_llm.py:261] Added request cmpl-f0a1b9c6eccd43969aedc58540d77663-0.
INFO 03-02 00:56:40 [logger.py:42] Received request cmpl-5f9b0b9e7c8b42f896b9dbb77b7313bc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:56:40 [async_llm.py:261] Added request cmpl-5f9b0b9e7c8b42f896b9dbb77b7313bc-0.
INFO 03-02 00:56:41 [logger.py:42] Received request cmpl-5400356af769403f8a0061ee9686a1c9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:56:41 [async_llm.py:261] Added request cmpl-5400356af769403f8a0061ee9686a1c9-0.
INFO 03-02 00:56:43 [logger.py:42] Received request cmpl-67856ecec0cb47beae89bd8fb50cd094-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:56:43 [async_llm.py:261] Added request cmpl-67856ecec0cb47beae89bd8fb50cd094-0.
INFO 03-02 00:56:44 [logger.py:42] Received request cmpl-3efdd70233a5485193bde29e488df825-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:56:44 [async_llm.py:261] Added request cmpl-3efdd70233a5485193bde29e488df825-0.
INFO 03-02 00:56:45 [logger.py:42] Received request cmpl-fffd76ed20644e9a975369afcccfcfc6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:56:45 [async_llm.py:261] Added request cmpl-fffd76ed20644e9a975369afcccfcfc6-0.
INFO 03-02 00:56:46 [logger.py:42] Received request cmpl-dd10afa879df40039fa2a6f3eed525e5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:56:46 [async_llm.py:261] Added request cmpl-dd10afa879df40039fa2a6f3eed525e5-0.
INFO 03-02 00:56:47 [logger.py:42] Received request cmpl-ca8fd826701744399d12e6606ae728b5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:56:47 [async_llm.py:261] Added request cmpl-ca8fd826701744399d12e6606ae728b5-0.
INFO 03-02 00:56:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:56:48 [logger.py:42] Received request cmpl-69e347c262a84951929e58c6756a4f6a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:56:48 [async_llm.py:261] Added request cmpl-69e347c262a84951929e58c6756a4f6a-0.
INFO 03-02 00:56:49 [logger.py:42] Received request cmpl-274982f97a0e4ab8b518349f766cb936-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:56:49 [async_llm.py:261] Added request cmpl-274982f97a0e4ab8b518349f766cb936-0.
INFO 03-02 00:56:51 [logger.py:42] Received request cmpl-77b2fd179f344d64889044d7d715dca0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:56:51 [async_llm.py:261] Added request cmpl-77b2fd179f344d64889044d7d715dca0-0.
INFO 03-02 00:56:52 [logger.py:42] Received request cmpl-8e834d96c1ce4909aaeb248695e779c1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:56:52 [async_llm.py:261] Added request cmpl-8e834d96c1ce4909aaeb248695e779c1-0.
INFO 03-02 00:56:53 [logger.py:42] Received request cmpl-3e39d309f2f846f6852e185ca47bb771-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:56:53 [async_llm.py:261] Added request cmpl-3e39d309f2f846f6852e185ca47bb771-0.
INFO 03-02 00:56:54 [logger.py:42] Received request cmpl-3025cd56d1854d08abc1158826db7877-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:56:54 [async_llm.py:261] Added request cmpl-3025cd56d1854d08abc1158826db7877-0.
INFO 03-02 00:56:55 [logger.py:42] Received request cmpl-cbea4f5eb4c64835b9f3d0035f64f26d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:56:55 [async_llm.py:261] Added request cmpl-cbea4f5eb4c64835b9f3d0035f64f26d-0.
INFO 03-02 00:56:56 [logger.py:42] Received request cmpl-8ef86709862645b0833121f5951281fc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:56:56 [async_llm.py:261] Added request cmpl-8ef86709862645b0833121f5951281fc-0.
INFO 03-02 00:56:58 [logger.py:42] Received request cmpl-f93a398528ab46cd810e87e91819f7f9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:56:58 [async_llm.py:261] Added request cmpl-f93a398528ab46cd810e87e91819f7f9-0.
INFO 03-02 00:56:58 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.2 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.3%, Prefix cache hit rate: 51.6%
INFO 03-02 00:56:59 [logger.py:42] Received request cmpl-d76b4b0e7db94e789f45c3cc2d78eb31-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:56:59 [async_llm.py:261] Added request cmpl-d76b4b0e7db94e789f45c3cc2d78eb31-0.
INFO 03-02 00:57:00 [logger.py:42] Received request cmpl-7ccc37432535418585e30ab25e083bc7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:57:00 [async_llm.py:261] Added request cmpl-7ccc37432535418585e30ab25e083bc7-0.
INFO 03-02 00:57:01 [logger.py:42] Received request cmpl-030f420d9acf43ae99021866f509fb72-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:57:01 [async_llm.py:261] Added request cmpl-030f420d9acf43ae99021866f509fb72-0.
INFO 03-02 00:57:02 [logger.py:42] Received request cmpl-b5a3cfd1b7ca486a82d020f2c54cb321-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:57:02 [async_llm.py:261] Added request cmpl-b5a3cfd1b7ca486a82d020f2c54cb321-0.
INFO 03-02 00:57:03 [logger.py:42] Received request cmpl-9e635ad6d22549c0a86544e25227690f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:57:03 [async_llm.py:261] Added request cmpl-9e635ad6d22549c0a86544e25227690f-0.
INFO 03-02 00:57:04 [logger.py:42] Received request cmpl-253cfaa074b448d495928a3a206b0d28-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:57:04 [async_llm.py:261] Added request cmpl-253cfaa074b448d495928a3a206b0d28-0.
INFO 03-02 00:57:06 [logger.py:42] Received request cmpl-0f57eddef68f4d389a17a3cf98c35c67-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:57:06 [async_llm.py:261] Added request cmpl-0f57eddef68f4d389a17a3cf98c35c67-0.
INFO 03-02 00:57:07 [logger.py:42] Received request cmpl-0fb4717bd12943a38c32e9df299ef095-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:57:07 [async_llm.py:261] Added request cmpl-0fb4717bd12943a38c32e9df299ef095-0.
INFO 03-02 00:57:08 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.3 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:57:08 [logger.py:42] Received request cmpl-36fc347c2ebd41ce8b3bee3bf5812212-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:57:08 [async_llm.py:261] Added request cmpl-36fc347c2ebd41ce8b3bee3bf5812212-0.
INFO 03-02 00:57:09 [logger.py:42] Received request cmpl-1ddfaa3ed0ba4bf28ab226b71f25642a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:57:09 [async_llm.py:261] Added request cmpl-1ddfaa3ed0ba4bf28ab226b71f25642a-0.
INFO 03-02 00:57:10 [logger.py:42] Received request cmpl-1ba9ee0b423849178af3f64483b10b3b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:57:10 [async_llm.py:261] Added request cmpl-1ba9ee0b423849178af3f64483b10b3b-0.
INFO 03-02 00:57:11 [logger.py:42] Received request cmpl-5420288abd3745ad93d4c8e6f78cfdbe-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:57:11 [async_llm.py:261] Added request cmpl-5420288abd3745ad93d4c8e6f78cfdbe-0.
INFO 03-02 00:57:13 [logger.py:42] Received request cmpl-14fdf7c8b47942a5af87bf6d3993499b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:57:13 [async_llm.py:261] Added request cmpl-14fdf7c8b47942a5af87bf6d3993499b-0.
INFO 03-02 00:57:14 [logger.py:42] Received request cmpl-dfbeca860d634e85a1b2faf2d637563f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:57:14 [async_llm.py:261] Added request cmpl-dfbeca860d634e85a1b2faf2d637563f-0.
INFO 03-02 00:57:15 [logger.py:42] Received request cmpl-81a7f11bc5d84a0490b2afb6f2130899-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:57:15 [async_llm.py:261] Added request cmpl-81a7f11bc5d84a0490b2afb6f2130899-0.
INFO 03-02 00:57:16 [logger.py:42] Received request cmpl-1e57a30a8e114f35a7b8fea0f622c12f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:57:16 [async_llm.py:261] Added request cmpl-1e57a30a8e114f35a7b8fea0f622c12f-0.
INFO 03-02 00:57:17 [logger.py:42] Received request cmpl-b6baf88f3c434edbafa4611486ac7b45-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:57:17 [async_llm.py:261] Added request cmpl-b6baf88f3c434edbafa4611486ac7b45-0.
INFO 03-02 00:57:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:57:18 [logger.py:42] Received request cmpl-a07bdd50710741e6b58bc0d9fd9c6e4d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:57:18 [async_llm.py:261] Added request cmpl-a07bdd50710741e6b58bc0d9fd9c6e4d-0.
INFO 03-02 00:57:19 [logger.py:42] Received request cmpl-1476abf5ecd24c7dbe73dfa7dec38509-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:57:19 [async_llm.py:261] Added request cmpl-1476abf5ecd24c7dbe73dfa7dec38509-0.
INFO 03-02 00:57:21 [logger.py:42] Received request cmpl-84052ddd7a694be197c99d760a62ff6e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:57:21 [async_llm.py:261] Added request cmpl-84052ddd7a694be197c99d760a62ff6e-0.
INFO 03-02 00:57:22 [logger.py:42] Received request cmpl-e6c9e762589946c8acf01f3f4436ff7c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:57:22 [async_llm.py:261] Added request cmpl-e6c9e762589946c8acf01f3f4436ff7c-0.
INFO 03-02 00:57:23 [logger.py:42] Received request cmpl-fd21a3b0e90d4114926a6c87445b725c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:57:23 [async_llm.py:261] Added request cmpl-fd21a3b0e90d4114926a6c87445b725c-0.
INFO 03-02 00:57:24 [logger.py:42] Received request cmpl-057e366c4c23423fbaf9c22346eb2245-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:57:24 [async_llm.py:261] Added request cmpl-057e366c4c23423fbaf9c22346eb2245-0.
INFO 03-02 00:57:25 [logger.py:42] Received request cmpl-4980500fd60d43d6b3f322d7a1d035a0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:57:25 [async_llm.py:261] Added request cmpl-4980500fd60d43d6b3f322d7a1d035a0-0.
INFO 03-02 00:57:26 [logger.py:42] Received request cmpl-1735ba1796684359aebf93fa75302838-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:57:26 [async_llm.py:261] Added request cmpl-1735ba1796684359aebf93fa75302838-0.
INFO 03-02 00:57:28 [logger.py:42] Received request cmpl-82face1576a2461b80dc38f128492699-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:57:28 [async_llm.py:261] Added request cmpl-82face1576a2461b80dc38f128492699-0.
INFO 03-02 00:57:28 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.1 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.3%, Prefix cache hit rate: 51.6%
INFO 03-02 00:57:29 [logger.py:42] Received request cmpl-09b8ffdb4adc41e686a505ea486571b0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:57:29 [async_llm.py:261] Added request cmpl-09b8ffdb4adc41e686a505ea486571b0-0.
INFO 03-02 00:57:30 [logger.py:42] Received request cmpl-269be1fcbbfc40e5810ecd75982a627f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:57:30 [async_llm.py:261] Added request cmpl-269be1fcbbfc40e5810ecd75982a627f-0.
INFO 03-02 00:57:31 [logger.py:42] Received request cmpl-c3ba097c89cd4809969d95b28e58d726-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:57:31 [async_llm.py:261] Added request cmpl-c3ba097c89cd4809969d95b28e58d726-0.
INFO 03-02 00:57:32 [logger.py:42] Received request cmpl-654384d884c34dbfb56de3bd6e3ee4c4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:57:32 [async_llm.py:261] Added request cmpl-654384d884c34dbfb56de3bd6e3ee4c4-0.
INFO 03-02 00:57:33 [logger.py:42] Received request cmpl-2b0bb811ff8542d2816fe107aedc6738-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:57:33 [async_llm.py:261] Added request cmpl-2b0bb811ff8542d2816fe107aedc6738-0.
INFO 03-02 00:57:35 [logger.py:42] Received request cmpl-38614570d94b44c684a51e614b323d6a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:57:35 [async_llm.py:261] Added request cmpl-38614570d94b44c684a51e614b323d6a-0.
INFO 03-02 00:57:36 [logger.py:42] Received request cmpl-5388cd62da0e4f6ca6b705b0ad0e9182-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:57:36 [async_llm.py:261] Added request cmpl-5388cd62da0e4f6ca6b705b0ad0e9182-0.
INFO 03-02 00:57:37 [logger.py:42] Received request cmpl-08e3fc5333454c758c5b017652d1f56d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:57:37 [async_llm.py:261] Added request cmpl-08e3fc5333454c758c5b017652d1f56d-0.
INFO 03-02 00:57:38 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.4 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:57:38 [logger.py:42] Received request cmpl-a9ea5f9e7ecf44efab52624bd024ecbc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:57:38 [async_llm.py:261] Added request cmpl-a9ea5f9e7ecf44efab52624bd024ecbc-0.
INFO 03-02 00:57:39 [logger.py:42] Received request cmpl-f89ff763aed546baa4d3c3a1f0dacb01-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:57:39 [async_llm.py:261] Added request cmpl-f89ff763aed546baa4d3c3a1f0dacb01-0.
INFO 03-02 00:57:40 [logger.py:42] Received request cmpl-3bcef2454086478993bbf565c931d22f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:57:40 [async_llm.py:261] Added request cmpl-3bcef2454086478993bbf565c931d22f-0.
INFO 03-02 00:57:41 [logger.py:42] Received request cmpl-e7a76843980147938b8c7373a1e93e6a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:57:41 [async_llm.py:261] Added request cmpl-e7a76843980147938b8c7373a1e93e6a-0.
INFO 03-02 00:57:43 [logger.py:42] Received request cmpl-5c92403f9e4647778a50fe38b5610b9c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:57:43 [async_llm.py:261] Added request cmpl-5c92403f9e4647778a50fe38b5610b9c-0.
INFO 03-02 00:57:44 [logger.py:42] Received request cmpl-2bac6f4f5b2c45d7a775b209fab05f71-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:57:44 [async_llm.py:261] Added request cmpl-2bac6f4f5b2c45d7a775b209fab05f71-0.
INFO 03-02 00:57:45 [logger.py:42] Received request cmpl-0dc8b15d375b43f6a5bb27562f562d95-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:57:45 [async_llm.py:261] Added request cmpl-0dc8b15d375b43f6a5bb27562f562d95-0.
INFO 03-02 00:57:46 [logger.py:42] Received request cmpl-6770669a758a4d0ba4edd1bf2c129ac6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:57:46 [async_llm.py:261] Added request cmpl-6770669a758a4d0ba4edd1bf2c129ac6-0.
INFO 03-02 00:57:47 [logger.py:42] Received request cmpl-14d64d13f5af40f78d6570a2316b9e38-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:57:47 [async_llm.py:261] Added request cmpl-14d64d13f5af40f78d6570a2316b9e38-0.
INFO 03-02 00:57:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:57:48 [logger.py:42] Received request cmpl-7f3aff9b26bf4ba198da168a0b037803-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:57:48 [async_llm.py:261] Added request cmpl-7f3aff9b26bf4ba198da168a0b037803-0.
INFO 03-02 00:57:50 [logger.py:42] Received request cmpl-d5eebfdd44ee4e85b8d1f05a671b219e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:57:50 [async_llm.py:261] Added request cmpl-d5eebfdd44ee4e85b8d1f05a671b219e-0.
INFO 03-02 00:57:51 [logger.py:42] Received request cmpl-1071678b9f2243fa8baf06419e5d3d34-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:57:51 [async_llm.py:261] Added request cmpl-1071678b9f2243fa8baf06419e5d3d34-0.
INFO 03-02 00:57:52 [logger.py:42] Received request cmpl-5f5fa6b35fc649a38f28fd29013f7ae0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:57:52 [async_llm.py:261] Added request cmpl-5f5fa6b35fc649a38f28fd29013f7ae0-0.
INFO 03-02 00:57:53 [logger.py:42] Received request cmpl-347374adc2f542cc8f5606bde232679d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:57:53 [async_llm.py:261] Added request cmpl-347374adc2f542cc8f5606bde232679d-0.
INFO 03-02 00:57:54 [logger.py:42] Received request cmpl-7a4047ee65954c5cafdce37b7648fcb0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:57:54 [async_llm.py:261] Added request cmpl-7a4047ee65954c5cafdce37b7648fcb0-0.
INFO 03-02 00:57:55 [logger.py:42] Received request cmpl-fbc9997517e24c08ad3133bc20800844-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:57:55 [async_llm.py:261] Added request cmpl-fbc9997517e24c08ad3133bc20800844-0.
INFO 03-02 00:57:56 [logger.py:42] Received request cmpl-b43e05947a5f460fb5d16ec3c436f82b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:57:56 [async_llm.py:261] Added request cmpl-b43e05947a5f460fb5d16ec3c436f82b-0.
INFO 03-02 00:57:58 [logger.py:42] Received request cmpl-29ad79004b874f71bb2391fa7c1d4556-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:57:58 [async_llm.py:261] Added request cmpl-29ad79004b874f71bb2391fa7c1d4556-0.
INFO 03-02 00:57:58 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.1 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.3%, Prefix cache hit rate: 51.6%
INFO 03-02 00:57:59 [logger.py:42] Received request cmpl-d265fa8f79b24d9db8c8a38c56966733-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:57:59 [async_llm.py:261] Added request cmpl-d265fa8f79b24d9db8c8a38c56966733-0.
INFO 03-02 00:58:00 [logger.py:42] Received request cmpl-de1569c42a7f4474a0e3b8e7ae12b4c4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:58:00 [async_llm.py:261] Added request cmpl-de1569c42a7f4474a0e3b8e7ae12b4c4-0.
INFO 03-02 00:58:01 [logger.py:42] Received request cmpl-756d7d04524e43e1971af4fc2417c1c0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:58:01 [async_llm.py:261] Added request cmpl-756d7d04524e43e1971af4fc2417c1c0-0.
INFO 03-02 00:58:02 [logger.py:42] Received request cmpl-9344a1850040406883cd5b8d260c5093-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:58:02 [async_llm.py:261] Added request cmpl-9344a1850040406883cd5b8d260c5093-0.
INFO 03-02 00:58:03 [logger.py:42] Received request cmpl-dc8e31315e184567abaa0496942ad209-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:58:03 [async_llm.py:261] Added request cmpl-dc8e31315e184567abaa0496942ad209-0.
INFO 03-02 00:58:05 [logger.py:42] Received request cmpl-88f0e64b94d84755b5a502f267d01b19-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:58:05 [async_llm.py:261] Added request cmpl-88f0e64b94d84755b5a502f267d01b19-0.
INFO 03-02 00:58:06 [logger.py:42] Received request cmpl-af739241b7e9415ca1a762d66579b762-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:58:06 [async_llm.py:261] Added request cmpl-af739241b7e9415ca1a762d66579b762-0.
INFO 03-02 00:58:07 [logger.py:42] Received request cmpl-b41ee9f6fa894921bc1cf632b3d7e0a6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:58:07 [async_llm.py:261] Added request cmpl-b41ee9f6fa894921bc1cf632b3d7e0a6-0.
INFO 03-02 00:58:08 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.4 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:58:08 [logger.py:42] Received request cmpl-3ed5cd698bae42848c6961a68547d576-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:58:08 [async_llm.py:261] Added request cmpl-3ed5cd698bae42848c6961a68547d576-0.
INFO 03-02 00:58:09 [logger.py:42] Received request cmpl-d953e29e09124c3ab0c8afec89e2ce53-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:58:09 [async_llm.py:261] Added request cmpl-d953e29e09124c3ab0c8afec89e2ce53-0.
INFO 03-02 00:58:10 [logger.py:42] Received request cmpl-98e101843f134615811df715d7d7d961-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:58:10 [async_llm.py:261] Added request cmpl-98e101843f134615811df715d7d7d961-0.
INFO 03-02 00:58:11 [logger.py:42] Received request cmpl-1c1c0b291b634d818d631431e748a32c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:58:11 [async_llm.py:261] Added request cmpl-1c1c0b291b634d818d631431e748a32c-0.
INFO 03-02 00:58:13 [logger.py:42] Received request cmpl-5ccd2022189a42ecaef1311927c51ca2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:58:13 [async_llm.py:261] Added request cmpl-5ccd2022189a42ecaef1311927c51ca2-0.
INFO 03-02 00:58:14 [logger.py:42] Received request cmpl-8ac27ffa994646d3963bb68765c0a869-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:58:14 [async_llm.py:261] Added request cmpl-8ac27ffa994646d3963bb68765c0a869-0.
INFO 03-02 00:58:15 [logger.py:42] Received request cmpl-5ab37b68868e4682b16ca8d05847f7d0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:58:15 [async_llm.py:261] Added request cmpl-5ab37b68868e4682b16ca8d05847f7d0-0.
INFO 03-02 00:58:16 [logger.py:42] Received request cmpl-c484eb1f79434291b945ca5f14fd13a1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:58:16 [async_llm.py:261] Added request cmpl-c484eb1f79434291b945ca5f14fd13a1-0.
INFO 03-02 00:58:17 [logger.py:42] Received request cmpl-e5e88951c769401fbb1d5d69f5891762-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:58:17 [async_llm.py:261] Added request cmpl-e5e88951c769401fbb1d5d69f5891762-0.
INFO 03-02 00:58:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:58:18 [logger.py:42] Received request cmpl-e0fad546b4a3473389b02a1cdb673b17-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:58:18 [async_llm.py:261] Added request cmpl-e0fad546b4a3473389b02a1cdb673b17-0.
INFO 03-02 00:58:20 [logger.py:42] Received request cmpl-3ba7f183bc4d452297fdb0fa544ad01c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:58:20 [async_llm.py:261] Added request cmpl-3ba7f183bc4d452297fdb0fa544ad01c-0.
INFO 03-02 00:58:21 [logger.py:42] Received request cmpl-c99eed91efb449e784dae6b6f2585bf7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:58:21 [async_llm.py:261] Added request cmpl-c99eed91efb449e784dae6b6f2585bf7-0.
INFO 03-02 00:58:22 [logger.py:42] Received request cmpl-f8892d8348ef42a3b9042a2a3e02ac4b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:58:22 [async_llm.py:261] Added request cmpl-f8892d8348ef42a3b9042a2a3e02ac4b-0.
INFO 03-02 00:58:23 [logger.py:42] Received request cmpl-1a89b1c3c1ee46e79c2011799ee38d47-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:58:23 [async_llm.py:261] Added request cmpl-1a89b1c3c1ee46e79c2011799ee38d47-0.
INFO 03-02 00:58:24 [logger.py:42] Received request cmpl-a5308ceadad74acaac5b585224296b12-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:58:24 [async_llm.py:261] Added request cmpl-a5308ceadad74acaac5b585224296b12-0.
INFO 03-02 00:58:25 [logger.py:42] Received request cmpl-644ec96e96df47e79b461710290ac1d4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:58:25 [async_llm.py:261] Added request cmpl-644ec96e96df47e79b461710290ac1d4-0.
INFO 03-02 00:58:26 [logger.py:42] Received request cmpl-be00129337734f3897c0287e20266e02-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:58:26 [async_llm.py:261] Added request cmpl-be00129337734f3897c0287e20266e02-0.
INFO 03-02 00:58:28 [logger.py:42] Received request cmpl-d171116e4f3c4de2a70624f6bcfb895d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:58:28 [async_llm.py:261] Added request cmpl-d171116e4f3c4de2a70624f6bcfb895d-0.
INFO 03-02 00:58:28 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.1 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.3%, Prefix cache hit rate: 51.6%
INFO 03-02 00:58:29 [logger.py:42] Received request cmpl-b51696d050e44df28e1d43e45fe26d9f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:58:29 [async_llm.py:261] Added request cmpl-b51696d050e44df28e1d43e45fe26d9f-0.
INFO 03-02 00:58:30 [logger.py:42] Received request cmpl-ca4b7bf3b4a9435cb107140c9057ee58-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:58:30 [async_llm.py:261] Added request cmpl-ca4b7bf3b4a9435cb107140c9057ee58-0.
INFO 03-02 00:58:31 [logger.py:42] Received request cmpl-7240e17671e1402d8dcc487dcfb9bb1f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:58:31 [async_llm.py:261] Added request cmpl-7240e17671e1402d8dcc487dcfb9bb1f-0.
INFO 03-02 00:58:32 [logger.py:42] Received request cmpl-2f9e0bc15e3b4c38a368c54067e4553e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:58:32 [async_llm.py:261] Added request cmpl-2f9e0bc15e3b4c38a368c54067e4553e-0.
INFO 03-02 00:58:33 [logger.py:42] Received request cmpl-e2ccf60395eb4a2db9c01626a7b1acee-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:58:33 [async_llm.py:261] Added request cmpl-e2ccf60395eb4a2db9c01626a7b1acee-0.
INFO 03-02 00:58:35 [logger.py:42] Received request cmpl-6c3553da542847d786a628cdc21c2906-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:58:35 [async_llm.py:261] Added request cmpl-6c3553da542847d786a628cdc21c2906-0.
INFO 03-02 00:58:36 [logger.py:42] Received request cmpl-34b4fcd48971491283f0131ee3a4a5a2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:58:36 [async_llm.py:261] Added request cmpl-34b4fcd48971491283f0131ee3a4a5a2-0.
INFO 03-02 00:58:37 [logger.py:42] Received request cmpl-918ddee8c918490b919ac4b34d1ec981-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:58:37 [async_llm.py:261] Added request cmpl-918ddee8c918490b919ac4b34d1ec981-0.
INFO 03-02 00:58:38 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.4 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:58:38 [logger.py:42] Received request cmpl-237675995c8245c7a0d0fdf4d783fc90-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:58:38 [async_llm.py:261] Added request cmpl-237675995c8245c7a0d0fdf4d783fc90-0.
INFO 03-02 00:58:39 [logger.py:42] Received request cmpl-54c9840931144909af8c0817bdab4ab8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:58:39 [async_llm.py:261] Added request cmpl-54c9840931144909af8c0817bdab4ab8-0.
INFO 03-02 00:58:40 [logger.py:42] Received request cmpl-0bbeaf62b60b4445ab1754adf2d69246-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:58:40 [async_llm.py:261] Added request cmpl-0bbeaf62b60b4445ab1754adf2d69246-0.
INFO 03-02 00:58:41 [logger.py:42] Received request cmpl-306967aa244f48f9a0fa7b8a63ea6af9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:58:41 [async_llm.py:261] Added request cmpl-306967aa244f48f9a0fa7b8a63ea6af9-0.
INFO 03-02 00:58:43 [logger.py:42] Received request cmpl-014f4ce65efe4bd38b8b0cb7acdf2222-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:58:43 [async_llm.py:261] Added request cmpl-014f4ce65efe4bd38b8b0cb7acdf2222-0.
INFO 03-02 00:58:44 [logger.py:42] Received request cmpl-9f784313e269451db87b022e0e4d31c2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:58:44 [async_llm.py:261] Added request cmpl-9f784313e269451db87b022e0e4d31c2-0.
INFO 03-02 00:58:45 [logger.py:42] Received request cmpl-c1c5cf046a3b43ea92b10b1e7cd64cd2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:58:45 [async_llm.py:261] Added request cmpl-c1c5cf046a3b43ea92b10b1e7cd64cd2-0.
INFO 03-02 00:58:46 [logger.py:42] Received request cmpl-79823c331cf346959ad41a6c523113ab-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:58:46 [async_llm.py:261] Added request cmpl-79823c331cf346959ad41a6c523113ab-0.
INFO 03-02 00:58:47 [logger.py:42] Received request cmpl-7bcf77ff3cc64cc28728f88597f6965a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:58:47 [async_llm.py:261] Added request cmpl-7bcf77ff3cc64cc28728f88597f6965a-0.
INFO 03-02 00:58:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:58:48 [logger.py:42] Received request cmpl-219b0b8c29394a07b867801fadf19117-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:58:48 [async_llm.py:261] Added request cmpl-219b0b8c29394a07b867801fadf19117-0.
INFO 03-02 00:58:50 [logger.py:42] Received request cmpl-0ed65129d2a44b8f9845b48a96a61083-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:58:50 [async_llm.py:261] Added request cmpl-0ed65129d2a44b8f9845b48a96a61083-0.
INFO 03-02 00:58:51 [logger.py:42] Received request cmpl-1bfc657023684985ba180b11e0706f7e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:58:51 [async_llm.py:261] Added request cmpl-1bfc657023684985ba180b11e0706f7e-0.
INFO 03-02 00:58:52 [logger.py:42] Received request cmpl-57131cd3d1e74811b42aa6531e0d7dc0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:58:52 [async_llm.py:261] Added request cmpl-57131cd3d1e74811b42aa6531e0d7dc0-0.
INFO 03-02 00:58:53 [logger.py:42] Received request cmpl-8dea185e5e654893a52c8f6ad400769c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:58:53 [async_llm.py:261] Added request cmpl-8dea185e5e654893a52c8f6ad400769c-0.
INFO 03-02 00:58:54 [logger.py:42] Received request cmpl-f1d95d6afa7144d19f92bcaa9b4ae1db-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:58:54 [async_llm.py:261] Added request cmpl-f1d95d6afa7144d19f92bcaa9b4ae1db-0.
INFO 03-02 00:58:55 [logger.py:42] Received request cmpl-87159dcdde9d4574b39ef33f4d42c4d1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:58:55 [async_llm.py:261] Added request cmpl-87159dcdde9d4574b39ef33f4d42c4d1-0.
INFO 03-02 00:58:56 [logger.py:42] Received request cmpl-e5d55bf475414447a9385b0019dcbe97-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:58:56 [async_llm.py:261] Added request cmpl-e5d55bf475414447a9385b0019dcbe97-0.
INFO 03-02 00:58:58 [logger.py:42] Received request cmpl-e4297571fa9842a8af88030c81f932ae-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:58:58 [async_llm.py:261] Added request cmpl-e4297571fa9842a8af88030c81f932ae-0.
INFO 03-02 00:58:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:58:59 [logger.py:42] Received request cmpl-7b29927d62dd41e2bc2bbe99595ef634-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:58:59 [async_llm.py:261] Added request cmpl-7b29927d62dd41e2bc2bbe99595ef634-0.
INFO 03-02 00:59:00 [logger.py:42] Received request cmpl-6e947b53358444a499cedb31bd67b746-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:59:00 [async_llm.py:261] Added request cmpl-6e947b53358444a499cedb31bd67b746-0.
INFO 03-02 00:59:01 [logger.py:42] Received request cmpl-59007664576042a288fae53efe4d414a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:59:01 [async_llm.py:261] Added request cmpl-59007664576042a288fae53efe4d414a-0.
INFO 03-02 00:59:02 [logger.py:42] Received request cmpl-63f5f92bb4874ca088f7ca6ae0e77e96-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:59:02 [async_llm.py:261] Added request cmpl-63f5f92bb4874ca088f7ca6ae0e77e96-0.
INFO 03-02 00:59:03 [logger.py:42] Received request cmpl-4beada1507614606b4d30c3c468e2f6b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:59:03 [async_llm.py:261] Added request cmpl-4beada1507614606b4d30c3c468e2f6b-0.
INFO 03-02 00:59:05 [logger.py:42] Received request cmpl-beb10a3a711a47bb8eb512d0bf5784d7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:59:05 [async_llm.py:261] Added request cmpl-beb10a3a711a47bb8eb512d0bf5784d7-0.
INFO 03-02 00:59:06 [logger.py:42] Received request cmpl-f005ca31164e40dba759d713a32733e9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:59:06 [async_llm.py:261] Added request cmpl-f005ca31164e40dba759d713a32733e9-0.
INFO 03-02 00:59:07 [logger.py:42] Received request cmpl-9a05e4825bd045dbafd5330f0a046af4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:59:07 [async_llm.py:261] Added request cmpl-9a05e4825bd045dbafd5330f0a046af4-0.
INFO 03-02 00:59:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:59:08 [logger.py:42] Received request cmpl-145d5ab48500438cacc43932686db80d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:59:08 [async_llm.py:261] Added request cmpl-145d5ab48500438cacc43932686db80d-0.
INFO 03-02 00:59:09 [logger.py:42] Received request cmpl-938445ee60ff4c38a7ca60f59b5adfb0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:59:09 [async_llm.py:261] Added request cmpl-938445ee60ff4c38a7ca60f59b5adfb0-0.
INFO 03-02 00:59:10 [logger.py:42] Received request cmpl-ba841c76e5de434593375e596b28ce59-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:59:10 [async_llm.py:261] Added request cmpl-ba841c76e5de434593375e596b28ce59-0.
INFO 03-02 00:59:11 [logger.py:42] Received request cmpl-2a02a1d0848743dcb546003cfce44fa1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:59:11 [async_llm.py:261] Added request cmpl-2a02a1d0848743dcb546003cfce44fa1-0.
INFO 03-02 00:59:13 [logger.py:42] Received request cmpl-f389ba30b64c443e8ae8f4f644473aea-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:59:13 [async_llm.py:261] Added request cmpl-f389ba30b64c443e8ae8f4f644473aea-0.
INFO 03-02 00:59:14 [logger.py:42] Received request cmpl-301da7338d914bafafdc7bb6584a51ae-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:59:14 [async_llm.py:261] Added request cmpl-301da7338d914bafafdc7bb6584a51ae-0.
INFO 03-02 00:59:15 [logger.py:42] Received request cmpl-a42eb967c70d4308a447d942c0cb33cc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:59:15 [async_llm.py:261] Added request cmpl-a42eb967c70d4308a447d942c0cb33cc-0.
INFO 03-02 00:59:16 [logger.py:42] Received request cmpl-d4906ca96a1d4605a5ad00a3d8e5e0dc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:59:16 [async_llm.py:261] Added request cmpl-d4906ca96a1d4605a5ad00a3d8e5e0dc-0.
INFO 03-02 00:59:17 [logger.py:42] Received request cmpl-306824c6cce747ea94dfe37ab186a78b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:59:17 [async_llm.py:261] Added request cmpl-306824c6cce747ea94dfe37ab186a78b-0.
INFO 03-02 00:59:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:59:18 [logger.py:42] Received request cmpl-1427d5715d9b46829da204b55b7bbf55-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:59:18 [async_llm.py:261] Added request cmpl-1427d5715d9b46829da204b55b7bbf55-0.
INFO 03-02 00:59:20 [logger.py:42] Received request cmpl-0c257cfa78674c4fbbc1dc67ce71ed39-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:59:20 [async_llm.py:261] Added request cmpl-0c257cfa78674c4fbbc1dc67ce71ed39-0.
INFO 03-02 00:59:21 [logger.py:42] Received request cmpl-38b70dccc9ac42c99d2b32203a8ff714-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:59:21 [async_llm.py:261] Added request cmpl-38b70dccc9ac42c99d2b32203a8ff714-0.
INFO 03-02 00:59:22 [logger.py:42] Received request cmpl-1db8cb6e43074fb2b6c77749833adec7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:59:22 [async_llm.py:261] Added request cmpl-1db8cb6e43074fb2b6c77749833adec7-0.
INFO 03-02 00:59:23 [logger.py:42] Received request cmpl-f5f320c3f724426fb23e92c0c2dbda72-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:59:23 [async_llm.py:261] Added request cmpl-f5f320c3f724426fb23e92c0c2dbda72-0.
INFO 03-02 00:59:24 [logger.py:42] Received request cmpl-c2f6b25a868841d4ae787607e074b209-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:59:24 [async_llm.py:261] Added request cmpl-c2f6b25a868841d4ae787607e074b209-0.
INFO 03-02 00:59:25 [logger.py:42] Received request cmpl-f0dd5a37131b48bc85c8cc734606d1f0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:59:25 [async_llm.py:261] Added request cmpl-f0dd5a37131b48bc85c8cc734606d1f0-0.
INFO 03-02 00:59:26 [logger.py:42] Received request cmpl-15858464557741fdbacab21d1dbc1c67-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:59:26 [async_llm.py:261] Added request cmpl-15858464557741fdbacab21d1dbc1c67-0.
INFO 03-02 00:59:28 [logger.py:42] Received request cmpl-99de8fc583374d06847280d57ad0968c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:59:28 [async_llm.py:261] Added request cmpl-99de8fc583374d06847280d57ad0968c-0.
INFO 03-02 00:59:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:59:29 [logger.py:42] Received request cmpl-f4e8115685a0472cb8638d51d17e3a05-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:59:29 [async_llm.py:261] Added request cmpl-f4e8115685a0472cb8638d51d17e3a05-0.
INFO 03-02 00:59:30 [logger.py:42] Received request cmpl-9795d5a8c51c4ed58f676f929ddc0f4f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:59:30 [async_llm.py:261] Added request cmpl-9795d5a8c51c4ed58f676f929ddc0f4f-0.
INFO 03-02 00:59:31 [logger.py:42] Received request cmpl-01765c952cbb4c7097f7e0c4500618fd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:59:31 [async_llm.py:261] Added request cmpl-01765c952cbb4c7097f7e0c4500618fd-0.
INFO 03-02 00:59:32 [logger.py:42] Received request cmpl-8868aed9fa444ee0bbfa1c5fec2fa0da-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:59:32 [async_llm.py:261] Added request cmpl-8868aed9fa444ee0bbfa1c5fec2fa0da-0.
INFO 03-02 00:59:33 [logger.py:42] Received request cmpl-a8d4ed94cad146c1b9a8132bb804547a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:59:33 [async_llm.py:261] Added request cmpl-a8d4ed94cad146c1b9a8132bb804547a-0.
INFO 03-02 00:59:35 [logger.py:42] Received request cmpl-c9fd80a9783d482ea941b0f87afec4f2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:59:35 [async_llm.py:261] Added request cmpl-c9fd80a9783d482ea941b0f87afec4f2-0.
INFO 03-02 00:59:36 [logger.py:42] Received request cmpl-5cdc83eeaa954058a2f427ffb236d55f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:59:36 [async_llm.py:261] Added request cmpl-5cdc83eeaa954058a2f427ffb236d55f-0.
INFO 03-02 00:59:37 [logger.py:42] Received request cmpl-a787e99808644f6cbed8040b5609aa2c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:59:37 [async_llm.py:261] Added request cmpl-a787e99808644f6cbed8040b5609aa2c-0.
INFO 03-02 00:59:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:59:38 [logger.py:42] Received request cmpl-59bf318b09fb46baa126711ec79a8304-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:59:38 [async_llm.py:261] Added request cmpl-59bf318b09fb46baa126711ec79a8304-0.
INFO 03-02 00:59:39 [logger.py:42] Received request cmpl-6b09966ce5554b1f820c05e35e3e386e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:59:39 [async_llm.py:261] Added request cmpl-6b09966ce5554b1f820c05e35e3e386e-0.
INFO 03-02 00:59:40 [logger.py:42] Received request cmpl-1d39ff1588d84009aaa52ef7508e8da9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:59:40 [async_llm.py:261] Added request cmpl-1d39ff1588d84009aaa52ef7508e8da9-0.
INFO 03-02 00:59:41 [logger.py:42] Received request cmpl-4b519d22b18e45e38f20184f801c2744-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:59:41 [async_llm.py:261] Added request cmpl-4b519d22b18e45e38f20184f801c2744-0.
INFO 03-02 00:59:43 [logger.py:42] Received request cmpl-a50636ff0c1b4a439696f7a164f7aaff-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:59:43 [async_llm.py:261] Added request cmpl-a50636ff0c1b4a439696f7a164f7aaff-0.
INFO 03-02 00:59:44 [logger.py:42] Received request cmpl-72d28ec554214798a1452034edcfbbbf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:59:44 [async_llm.py:261] Added request cmpl-72d28ec554214798a1452034edcfbbbf-0.
INFO 03-02 00:59:45 [logger.py:42] Received request cmpl-7eb82940713946bfbf3001cdbf574ce0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:59:45 [async_llm.py:261] Added request cmpl-7eb82940713946bfbf3001cdbf574ce0-0.
INFO 03-02 00:59:46 [logger.py:42] Received request cmpl-10defc81fe15421abfddd9de5b4591bc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:59:46 [async_llm.py:261] Added request cmpl-10defc81fe15421abfddd9de5b4591bc-0.
INFO 03-02 00:59:47 [logger.py:42] Received request cmpl-61247b3648784f01996dd7ae43f427b1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:59:47 [async_llm.py:261] Added request cmpl-61247b3648784f01996dd7ae43f427b1-0.
INFO 03-02 00:59:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:59:48 [logger.py:42] Received request cmpl-fa9b4b6f70734824aed5c249ab845dc7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:59:48 [async_llm.py:261] Added request cmpl-fa9b4b6f70734824aed5c249ab845dc7-0.
INFO 03-02 00:59:50 [logger.py:42] Received request cmpl-680ec2ffe9c640c1801187458e011f2f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:59:50 [async_llm.py:261] Added request cmpl-680ec2ffe9c640c1801187458e011f2f-0.
INFO 03-02 00:59:51 [logger.py:42] Received request cmpl-562fc1141a9d4ba3bce061590a398e7f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:59:51 [async_llm.py:261] Added request cmpl-562fc1141a9d4ba3bce061590a398e7f-0.
INFO 03-02 00:59:52 [logger.py:42] Received request cmpl-7ed5a7ac84054918bc0032adf2d41034-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:59:52 [async_llm.py:261] Added request cmpl-7ed5a7ac84054918bc0032adf2d41034-0.
INFO 03-02 00:59:53 [logger.py:42] Received request cmpl-4227210f75d84d268d86bbf163b687d3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:59:53 [async_llm.py:261] Added request cmpl-4227210f75d84d268d86bbf163b687d3-0.
INFO 03-02 00:59:54 [logger.py:42] Received request cmpl-8e5d7c45004f4a32995ef85a5d31f281-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:59:54 [async_llm.py:261] Added request cmpl-8e5d7c45004f4a32995ef85a5d31f281-0.
INFO 03-02 00:59:55 [logger.py:42] Received request cmpl-3348e203ce774280a7e0eee7a023f4dc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:59:55 [async_llm.py:261] Added request cmpl-3348e203ce774280a7e0eee7a023f4dc-0.
INFO 03-02 00:59:56 [logger.py:42] Received request cmpl-6495248caec443779da44c8725aaee15-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:59:56 [async_llm.py:261] Added request cmpl-6495248caec443779da44c8725aaee15-0.
INFO 03-02 00:59:58 [logger.py:42] Received request cmpl-0d6b7f27c7ff4745aeff7fe2bab72693-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:59:58 [async_llm.py:261] Added request cmpl-0d6b7f27c7ff4745aeff7fe2bab72693-0.
INFO 03-02 00:59:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 00:59:59 [logger.py:42] Received request cmpl-3e33eab5396840f0add17ef14df58193-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 00:59:59 [async_llm.py:261] Added request cmpl-3e33eab5396840f0add17ef14df58193-0.
INFO 03-02 01:00:00 [logger.py:42] Received request cmpl-3b61744538b64fe994b8fe9f0c607548-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:00:00 [async_llm.py:261] Added request cmpl-3b61744538b64fe994b8fe9f0c607548-0.
INFO 03-02 01:00:01 [logger.py:42] Received request cmpl-93a7a6515e704262a0ca7d35231fb0f1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:00:01 [async_llm.py:261] Added request cmpl-93a7a6515e704262a0ca7d35231fb0f1-0.
INFO 03-02 01:00:02 [logger.py:42] Received request cmpl-1b672c1a562146edb8913fbed31c1e5c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:00:02 [async_llm.py:261] Added request cmpl-1b672c1a562146edb8913fbed31c1e5c-0.
INFO 03-02 01:00:03 [logger.py:42] Received request cmpl-7ed25827f66a42daa48323a6e2507b5c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:00:03 [async_llm.py:261] Added request cmpl-7ed25827f66a42daa48323a6e2507b5c-0.
INFO 03-02 01:00:05 [logger.py:42] Received request cmpl-b2060df17f344c139a7ae7ba2d7b8673-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:00:05 [async_llm.py:261] Added request cmpl-b2060df17f344c139a7ae7ba2d7b8673-0.
INFO 03-02 01:00:06 [logger.py:42] Received request cmpl-5994629d97014cbc93caab1604fefb10-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:00:06 [async_llm.py:261] Added request cmpl-5994629d97014cbc93caab1604fefb10-0.
INFO 03-02 01:00:07 [logger.py:42] Received request cmpl-6c9f363c58ee4e24b3e46c382c64657e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:00:07 [async_llm.py:261] Added request cmpl-6c9f363c58ee4e24b3e46c382c64657e-0.
INFO 03-02 01:00:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:00:08 [logger.py:42] Received request cmpl-7f3e1158bf304425b93b8ac34dd13613-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:00:08 [async_llm.py:261] Added request cmpl-7f3e1158bf304425b93b8ac34dd13613-0.
INFO 03-02 01:00:09 [logger.py:42] Received request cmpl-e18fe3221e474b86bb8367b3d7b643ff-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:00:09 [async_llm.py:261] Added request cmpl-e18fe3221e474b86bb8367b3d7b643ff-0.
INFO 03-02 01:00:10 [logger.py:42] Received request cmpl-11690c7411bf43f08aa6945850e01a35-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:00:10 [async_llm.py:261] Added request cmpl-11690c7411bf43f08aa6945850e01a35-0.
INFO 03-02 01:00:11 [logger.py:42] Received request cmpl-42262a4737e84f639e13733c65b22a2d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:00:11 [async_llm.py:261] Added request cmpl-42262a4737e84f639e13733c65b22a2d-0.
INFO 03-02 01:00:13 [logger.py:42] Received request cmpl-e84fb2872ca34e4caa2ed54afa8712e6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:00:13 [async_llm.py:261] Added request cmpl-e84fb2872ca34e4caa2ed54afa8712e6-0.
INFO 03-02 01:00:14 [logger.py:42] Received request cmpl-e0506b0375dd4be2a641084b6bf3ca32-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:00:14 [async_llm.py:261] Added request cmpl-e0506b0375dd4be2a641084b6bf3ca32-0.
INFO 03-02 01:00:15 [logger.py:42] Received request cmpl-175b2140e48c4a8c9121d5d2c1cef280-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:00:15 [async_llm.py:261] Added request cmpl-175b2140e48c4a8c9121d5d2c1cef280-0.
INFO 03-02 01:00:16 [logger.py:42] Received request cmpl-8f86ee12160b4e6c8db47525a26e27d3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:00:16 [async_llm.py:261] Added request cmpl-8f86ee12160b4e6c8db47525a26e27d3-0.
INFO 03-02 01:00:17 [logger.py:42] Received request cmpl-1480fff3577543a59a250fc7247f3db7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:00:17 [async_llm.py:261] Added request cmpl-1480fff3577543a59a250fc7247f3db7-0.
INFO 03-02 01:00:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:00:18 [logger.py:42] Received request cmpl-ff82886a6282444ca8bfcc3821a68e58-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:00:18 [async_llm.py:261] Added request cmpl-ff82886a6282444ca8bfcc3821a68e58-0.
INFO 03-02 01:00:20 [logger.py:42] Received request cmpl-b75f55ede803435cb6f87e22bae6a556-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:00:20 [async_llm.py:261] Added request cmpl-b75f55ede803435cb6f87e22bae6a556-0.
INFO 03-02 01:00:21 [logger.py:42] Received request cmpl-7e070213b66d43ffa53b4f3f6476e86c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:00:21 [async_llm.py:261] Added request cmpl-7e070213b66d43ffa53b4f3f6476e86c-0.
INFO 03-02 01:00:22 [logger.py:42] Received request cmpl-ddd0aa1fae864de1a8e9165406808158-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:00:22 [async_llm.py:261] Added request cmpl-ddd0aa1fae864de1a8e9165406808158-0.
INFO 03-02 01:00:23 [logger.py:42] Received request cmpl-2f34570b599c45fca7b07ea3a8326715-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:00:23 [async_llm.py:261] Added request cmpl-2f34570b599c45fca7b07ea3a8326715-0.
INFO 03-02 01:00:24 [logger.py:42] Received request cmpl-1fef129d47c3456ba4009e9b62eabbae-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:00:24 [async_llm.py:261] Added request cmpl-1fef129d47c3456ba4009e9b62eabbae-0.
INFO 03-02 01:00:25 [logger.py:42] Received request cmpl-504703e259eb437a943b3c3140401677-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:00:25 [async_llm.py:261] Added request cmpl-504703e259eb437a943b3c3140401677-0.
INFO 03-02 01:00:27 [logger.py:42] Received request cmpl-3a73ab02d79d4e2e910176f7c468a304-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:00:27 [async_llm.py:261] Added request cmpl-3a73ab02d79d4e2e910176f7c468a304-0.
INFO 03-02 01:00:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:00:28 [logger.py:42] Received request cmpl-0b489913789843f29a315198b80dcc73-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:00:28 [async_llm.py:261] Added request cmpl-0b489913789843f29a315198b80dcc73-0.
INFO 03-02 01:00:29 [logger.py:42] Received request cmpl-83f0de9dd72448e195f606d50d5f9d9c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:00:29 [async_llm.py:261] Added request cmpl-83f0de9dd72448e195f606d50d5f9d9c-0.
INFO 03-02 01:00:30 [logger.py:42] Received request cmpl-e5e40a49f6fe47958eec41632df7bcfe-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:00:30 [async_llm.py:261] Added request cmpl-e5e40a49f6fe47958eec41632df7bcfe-0.
INFO 03-02 01:00:31 [logger.py:42] Received request cmpl-2955ff48e7bc4c179f431f1ce464aa1d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:00:31 [async_llm.py:261] Added request cmpl-2955ff48e7bc4c179f431f1ce464aa1d-0.
INFO 03-02 01:00:32 [logger.py:42] Received request cmpl-7532a61855b744e5b2f486071dc25d53-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:00:32 [async_llm.py:261] Added request cmpl-7532a61855b744e5b2f486071dc25d53-0.
INFO 03-02 01:00:33 [logger.py:42] Received request cmpl-565946f03d974f7783e9e7bcddfc6b36-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:00:33 [async_llm.py:261] Added request cmpl-565946f03d974f7783e9e7bcddfc6b36-0.
INFO 03-02 01:00:35 [logger.py:42] Received request cmpl-e7fcc6e8115f4bdb97f62f3c1e2908dc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:00:35 [async_llm.py:261] Added request cmpl-e7fcc6e8115f4bdb97f62f3c1e2908dc-0.
INFO 03-02 01:00:36 [logger.py:42] Received request cmpl-ed3f4fade152470aa62e368cd42896a0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:00:36 [async_llm.py:261] Added request cmpl-ed3f4fade152470aa62e368cd42896a0-0.
INFO 03-02 01:00:37 [logger.py:42] Received request cmpl-e9d5f697ffaf45a9af533a07fa833a1e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:00:37 [async_llm.py:261] Added request cmpl-e9d5f697ffaf45a9af533a07fa833a1e-0.
INFO 03-02 01:00:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:00:38 [logger.py:42] Received request cmpl-2e1a1234a8d540b88b1d479fdd58ecc5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:00:38 [async_llm.py:261] Added request cmpl-2e1a1234a8d540b88b1d479fdd58ecc5-0.
INFO 03-02 01:00:39 [logger.py:42] Received request cmpl-53899a5183ba43a3b0e7aa8d913615de-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:00:39 [async_llm.py:261] Added request cmpl-53899a5183ba43a3b0e7aa8d913615de-0.
INFO 03-02 01:00:40 [logger.py:42] Received request cmpl-398c589fcbdc44df8696d321d05d10f8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:00:40 [async_llm.py:261] Added request cmpl-398c589fcbdc44df8696d321d05d10f8-0.
INFO 03-02 01:00:42 [logger.py:42] Received request cmpl-398fec6c5453472db3b9964b8d38d807-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:00:42 [async_llm.py:261] Added request cmpl-398fec6c5453472db3b9964b8d38d807-0.
INFO 03-02 01:00:43 [logger.py:42] Received request cmpl-660b2d2535e64fdc883348c09dc67b8b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:00:43 [async_llm.py:261] Added request cmpl-660b2d2535e64fdc883348c09dc67b8b-0.
INFO 03-02 01:00:44 [logger.py:42] Received request cmpl-8148b4cb89de4c34aba06e4215b3f8ba-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:00:44 [async_llm.py:261] Added request cmpl-8148b4cb89de4c34aba06e4215b3f8ba-0.
INFO 03-02 01:00:45 [logger.py:42] Received request cmpl-5fd87fb0d92440da9f5f1d8ad31e615b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:00:45 [async_llm.py:261] Added request cmpl-5fd87fb0d92440da9f5f1d8ad31e615b-0.
INFO 03-02 01:00:46 [logger.py:42] Received request cmpl-668641b2620b4374947850fe1eae57d0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:00:46 [async_llm.py:261] Added request cmpl-668641b2620b4374947850fe1eae57d0-0.
INFO 03-02 01:00:47 [logger.py:42] Received request cmpl-321bda4430c846c1bcc6d5ccca3f3d47-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:00:47 [async_llm.py:261] Added request cmpl-321bda4430c846c1bcc6d5ccca3f3d47-0.
INFO 03-02 01:00:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:00:48 [logger.py:42] Received request cmpl-ea3573e768bf4b198cdccd08974932b2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:00:48 [async_llm.py:261] Added request cmpl-ea3573e768bf4b198cdccd08974932b2-0.
INFO 03-02 01:00:50 [logger.py:42] Received request cmpl-afe1c06145754053b847adfe35747edd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:00:50 [async_llm.py:261] Added request cmpl-afe1c06145754053b847adfe35747edd-0.
INFO 03-02 01:00:51 [logger.py:42] Received request cmpl-64eba1d3d55f46a4b7eb2e5b255c59bf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:00:51 [async_llm.py:261] Added request cmpl-64eba1d3d55f46a4b7eb2e5b255c59bf-0.
INFO 03-02 01:00:52 [logger.py:42] Received request cmpl-7ea6e0fd9a804091838a1baf870b747a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:00:52 [async_llm.py:261] Added request cmpl-7ea6e0fd9a804091838a1baf870b747a-0.
INFO 03-02 01:00:53 [logger.py:42] Received request cmpl-214a32caa25a45988d1699d89cd6c22e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:00:53 [async_llm.py:261] Added request cmpl-214a32caa25a45988d1699d89cd6c22e-0.
INFO 03-02 01:00:54 [logger.py:42] Received request cmpl-acac627e596945ef92fb31b9e87aacc5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:00:54 [async_llm.py:261] Added request cmpl-acac627e596945ef92fb31b9e87aacc5-0.
INFO 03-02 01:00:55 [logger.py:42] Received request cmpl-cb1acbe9595a426d8f4c5dc2411ce22a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:00:55 [async_llm.py:261] Added request cmpl-cb1acbe9595a426d8f4c5dc2411ce22a-0.
INFO 03-02 01:00:57 [logger.py:42] Received request cmpl-f46979c259b0439f8c8961c8a78c50f6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:00:57 [async_llm.py:261] Added request cmpl-f46979c259b0439f8c8961c8a78c50f6-0.
INFO 03-02 01:00:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:00:58 [logger.py:42] Received request cmpl-9c837345b43c4a4d893be21dc472c9d9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:00:58 [async_llm.py:261] Added request cmpl-9c837345b43c4a4d893be21dc472c9d9-0.
INFO 03-02 01:00:59 [logger.py:42] Received request cmpl-09c17f9c343a413aa07ffeeacfe518c5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:00:59 [async_llm.py:261] Added request cmpl-09c17f9c343a413aa07ffeeacfe518c5-0.
INFO 03-02 01:01:00 [logger.py:42] Received request cmpl-bf847ce6330c44e183a305c6398c9317-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:01:00 [async_llm.py:261] Added request cmpl-bf847ce6330c44e183a305c6398c9317-0.
INFO 03-02 01:01:01 [logger.py:42] Received request cmpl-c6797f20168f42c49b5c7e49aa5d0f81-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:01:01 [async_llm.py:261] Added request cmpl-c6797f20168f42c49b5c7e49aa5d0f81-0.
INFO 03-02 01:01:02 [logger.py:42] Received request cmpl-5194968ff0e44c26a22c556a3097c718-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:01:02 [async_llm.py:261] Added request cmpl-5194968ff0e44c26a22c556a3097c718-0.
INFO 03-02 01:01:03 [logger.py:42] Received request cmpl-a13a1e975b2d47baae31a2d5227bf511-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:01:03 [async_llm.py:261] Added request cmpl-a13a1e975b2d47baae31a2d5227bf511-0.
INFO 03-02 01:01:05 [logger.py:42] Received request cmpl-0877697ebd924e3aaf9b305a5d3f90cc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:01:05 [async_llm.py:261] Added request cmpl-0877697ebd924e3aaf9b305a5d3f90cc-0.
INFO 03-02 01:01:06 [logger.py:42] Received request cmpl-1b94dc374e1545d58104df7015697422-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:01:06 [async_llm.py:261] Added request cmpl-1b94dc374e1545d58104df7015697422-0.
INFO 03-02 01:01:07 [logger.py:42] Received request cmpl-5b57d5ad64d946dc88c580886af763ec-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:01:07 [async_llm.py:261] Added request cmpl-5b57d5ad64d946dc88c580886af763ec-0.
INFO 03-02 01:01:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:01:08 [logger.py:42] Received request cmpl-2e46c2bbacdf405185c7014fb69f158d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:01:08 [async_llm.py:261] Added request cmpl-2e46c2bbacdf405185c7014fb69f158d-0.
INFO 03-02 01:01:09 [logger.py:42] Received request cmpl-53a8a2bafaff4bc0ac11ec9b73213649-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:01:09 [async_llm.py:261] Added request cmpl-53a8a2bafaff4bc0ac11ec9b73213649-0.
INFO 03-02 01:01:10 [logger.py:42] Received request cmpl-b5cd603de53e442798109beac8b73cdb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:01:10 [async_llm.py:261] Added request cmpl-b5cd603de53e442798109beac8b73cdb-0.
INFO 03-02 01:01:12 [logger.py:42] Received request cmpl-d008bc66d44d40debd90cdbf1d01eaa4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:01:12 [async_llm.py:261] Added request cmpl-d008bc66d44d40debd90cdbf1d01eaa4-0.
INFO 03-02 01:01:13 [logger.py:42] Received request cmpl-c36bc140d52241b68a1c0675a8745097-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:01:13 [async_llm.py:261] Added request cmpl-c36bc140d52241b68a1c0675a8745097-0.
INFO 03-02 01:01:14 [logger.py:42] Received request cmpl-fd86038743e74b60ba820d71f151f179-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:01:14 [async_llm.py:261] Added request cmpl-fd86038743e74b60ba820d71f151f179-0.
INFO 03-02 01:01:15 [logger.py:42] Received request cmpl-513b3a68ec2545af98124ede42875c62-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:01:15 [async_llm.py:261] Added request cmpl-513b3a68ec2545af98124ede42875c62-0.
INFO 03-02 01:01:16 [logger.py:42] Received request cmpl-e91fabd4d95a4c2aacc9b11750b35d8f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:01:16 [async_llm.py:261] Added request cmpl-e91fabd4d95a4c2aacc9b11750b35d8f-0.
INFO 03-02 01:01:17 [logger.py:42] Received request cmpl-b0bd4bf25c9243d180e7c02849bec415-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:01:17 [async_llm.py:261] Added request cmpl-b0bd4bf25c9243d180e7c02849bec415-0.
INFO 03-02 01:01:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:01:18 [logger.py:42] Received request cmpl-1522eb643e8543849b6dc06496162fd3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:01:18 [async_llm.py:261] Added request cmpl-1522eb643e8543849b6dc06496162fd3-0.
INFO 03-02 01:01:20 [logger.py:42] Received request cmpl-c6d8f1d1e218482b84256c507f285c3e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:01:20 [async_llm.py:261] Added request cmpl-c6d8f1d1e218482b84256c507f285c3e-0.
INFO 03-02 01:01:21 [logger.py:42] Received request cmpl-709cbe51723c40e2b7dc5a3295a690c2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:01:21 [async_llm.py:261] Added request cmpl-709cbe51723c40e2b7dc5a3295a690c2-0.
INFO 03-02 01:01:22 [logger.py:42] Received request cmpl-4fc04956281e431fb53a5847181caa97-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:01:22 [async_llm.py:261] Added request cmpl-4fc04956281e431fb53a5847181caa97-0.
INFO 03-02 01:01:23 [logger.py:42] Received request cmpl-8095ae8478ed42c7ac21753a2db3f725-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:01:23 [async_llm.py:261] Added request cmpl-8095ae8478ed42c7ac21753a2db3f725-0.
INFO 03-02 01:01:24 [logger.py:42] Received request cmpl-d52b6720f81b4736ae0dc18bf3627655-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:01:24 [async_llm.py:261] Added request cmpl-d52b6720f81b4736ae0dc18bf3627655-0.
INFO 03-02 01:01:25 [logger.py:42] Received request cmpl-d83b64d8a4d74a4dad64c5020632793b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:01:25 [async_llm.py:261] Added request cmpl-d83b64d8a4d74a4dad64c5020632793b-0.
INFO 03-02 01:01:27 [logger.py:42] Received request cmpl-5582897385d44d459c5d79170c4be3a2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:01:27 [async_llm.py:261] Added request cmpl-5582897385d44d459c5d79170c4be3a2-0.
INFO 03-02 01:01:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:01:28 [logger.py:42] Received request cmpl-030b0707fd3445d7bc490d38243261f1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:01:28 [async_llm.py:261] Added request cmpl-030b0707fd3445d7bc490d38243261f1-0.
INFO 03-02 01:01:29 [logger.py:42] Received request cmpl-8b298ab6cb6343da9dc81409d117a29c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:01:29 [async_llm.py:261] Added request cmpl-8b298ab6cb6343da9dc81409d117a29c-0.
INFO 03-02 01:01:30 [logger.py:42] Received request cmpl-a86350af1bd341ab8167ae009d18a737-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:01:30 [async_llm.py:261] Added request cmpl-a86350af1bd341ab8167ae009d18a737-0.
INFO 03-02 01:01:31 [logger.py:42] Received request cmpl-a1b8712a50184ad0a6fa77ed7a11b2d0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:01:31 [async_llm.py:261] Added request cmpl-a1b8712a50184ad0a6fa77ed7a11b2d0-0.
INFO 03-02 01:01:32 [logger.py:42] Received request cmpl-8905580a097246b2bd1daeaf7f05bf8a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:01:32 [async_llm.py:261] Added request cmpl-8905580a097246b2bd1daeaf7f05bf8a-0.
INFO 03-02 01:01:33 [logger.py:42] Received request cmpl-8632069bbca94cfcb755427be49ede26-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:01:33 [async_llm.py:261] Added request cmpl-8632069bbca94cfcb755427be49ede26-0.
INFO 03-02 01:01:35 [logger.py:42] Received request cmpl-eb39d731b6044491894fbb647a59124e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:01:35 [async_llm.py:261] Added request cmpl-eb39d731b6044491894fbb647a59124e-0.
INFO 03-02 01:01:36 [logger.py:42] Received request cmpl-4646515f63ed4b6b8e4abdc9a22a077a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:01:36 [async_llm.py:261] Added request cmpl-4646515f63ed4b6b8e4abdc9a22a077a-0.
INFO 03-02 01:01:37 [logger.py:42] Received request cmpl-7c9e1be49dc8431e80bfe5cd2f82a15a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:01:37 [async_llm.py:261] Added request cmpl-7c9e1be49dc8431e80bfe5cd2f82a15a-0.
INFO 03-02 01:01:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:01:38 [logger.py:42] Received request cmpl-a118e7b40e624031812c71c6f98619b8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:01:38 [async_llm.py:261] Added request cmpl-a118e7b40e624031812c71c6f98619b8-0.
INFO 03-02 01:01:39 [logger.py:42] Received request cmpl-b97492bf2b3f4dee92d1039837a34a3c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:01:39 [async_llm.py:261] Added request cmpl-b97492bf2b3f4dee92d1039837a34a3c-0.
INFO 03-02 01:01:40 [logger.py:42] Received request cmpl-599311de31e341d89a54c197847c31a3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:01:40 [async_llm.py:261] Added request cmpl-599311de31e341d89a54c197847c31a3-0.
INFO 03-02 01:01:42 [logger.py:42] Received request cmpl-3f3ef228c7be4ff49852987da0fcd323-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:01:42 [async_llm.py:261] Added request cmpl-3f3ef228c7be4ff49852987da0fcd323-0.
INFO 03-02 01:01:43 [logger.py:42] Received request cmpl-3691a63ee824454da6a89726885d9c6e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:01:43 [async_llm.py:261] Added request cmpl-3691a63ee824454da6a89726885d9c6e-0.
INFO 03-02 01:01:44 [logger.py:42] Received request cmpl-feb27cd946884e51ae6ec0ccc35702aa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:01:44 [async_llm.py:261] Added request cmpl-feb27cd946884e51ae6ec0ccc35702aa-0.
INFO 03-02 01:01:45 [logger.py:42] Received request cmpl-f0f1dc6cbe7f4c6b8fb41cddedaa0f8a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:01:45 [async_llm.py:261] Added request cmpl-f0f1dc6cbe7f4c6b8fb41cddedaa0f8a-0.
INFO 03-02 01:01:46 [logger.py:42] Received request cmpl-70e6c6782d7c42cc857411417139d718-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:01:46 [async_llm.py:261] Added request cmpl-70e6c6782d7c42cc857411417139d718-0.
INFO 03-02 01:01:47 [logger.py:42] Received request cmpl-764657c35efd4017948c79fd90c6dbcf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:01:47 [async_llm.py:261] Added request cmpl-764657c35efd4017948c79fd90c6dbcf-0.
INFO 03-02 01:01:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:01:48 [logger.py:42] Received request cmpl-df68464e5bd04ae8a11ae33ae208e216-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:01:48 [async_llm.py:261] Added request cmpl-df68464e5bd04ae8a11ae33ae208e216-0.
INFO 03-02 01:01:50 [logger.py:42] Received request cmpl-3f4f83638fcf42d7bf55e58389ee18b5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:01:50 [async_llm.py:261] Added request cmpl-3f4f83638fcf42d7bf55e58389ee18b5-0.
INFO 03-02 01:01:51 [logger.py:42] Received request cmpl-e8846448291c431fb61ed64e8ec0f79c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:01:51 [async_llm.py:261] Added request cmpl-e8846448291c431fb61ed64e8ec0f79c-0.
INFO 03-02 01:01:52 [logger.py:42] Received request cmpl-9f456761f81d4ff38ef496cad6d5caea-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:01:52 [async_llm.py:261] Added request cmpl-9f456761f81d4ff38ef496cad6d5caea-0.
INFO 03-02 01:01:53 [logger.py:42] Received request cmpl-5e6d7272726c489883b41c07d96566cb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:01:53 [async_llm.py:261] Added request cmpl-5e6d7272726c489883b41c07d96566cb-0.
INFO 03-02 01:01:54 [logger.py:42] Received request cmpl-0ef99dd82c07479f99528d3d17cd0026-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:01:54 [async_llm.py:261] Added request cmpl-0ef99dd82c07479f99528d3d17cd0026-0.
INFO 03-02 01:01:55 [logger.py:42] Received request cmpl-af84071c0493470ebc42d97cdea75063-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:01:55 [async_llm.py:261] Added request cmpl-af84071c0493470ebc42d97cdea75063-0.
INFO 03-02 01:01:57 [logger.py:42] Received request cmpl-01c9f08e70ab443cb9e2b0f5dd1b726e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:01:57 [async_llm.py:261] Added request cmpl-01c9f08e70ab443cb9e2b0f5dd1b726e-0.
INFO 03-02 01:01:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:01:58 [logger.py:42] Received request cmpl-06b0d75248f241b5bf1b5638b455da52-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:01:58 [async_llm.py:261] Added request cmpl-06b0d75248f241b5bf1b5638b455da52-0.
INFO 03-02 01:01:59 [logger.py:42] Received request cmpl-d38fdb0c34a04c64a0ad143742791495-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:01:59 [async_llm.py:261] Added request cmpl-d38fdb0c34a04c64a0ad143742791495-0.
INFO 03-02 01:02:00 [logger.py:42] Received request cmpl-28789565b5b840188578f2bf319276a5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:02:00 [async_llm.py:261] Added request cmpl-28789565b5b840188578f2bf319276a5-0.
INFO 03-02 01:02:01 [logger.py:42] Received request cmpl-c77f8b219b6749f6998324ee084931f1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:02:01 [async_llm.py:261] Added request cmpl-c77f8b219b6749f6998324ee084931f1-0.
INFO 03-02 01:02:02 [logger.py:42] Received request cmpl-1e752f5d1df2477c8f6c476a75e585db-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:02:02 [async_llm.py:261] Added request cmpl-1e752f5d1df2477c8f6c476a75e585db-0.
INFO 03-02 01:02:03 [logger.py:42] Received request cmpl-468c0ab82ac2411583d290305e16b007-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:02:03 [async_llm.py:261] Added request cmpl-468c0ab82ac2411583d290305e16b007-0.
INFO 03-02 01:02:05 [logger.py:42] Received request cmpl-206bbab82313458386072b869b86e8bd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:02:05 [async_llm.py:261] Added request cmpl-206bbab82313458386072b869b86e8bd-0.
INFO 03-02 01:02:06 [logger.py:42] Received request cmpl-14102bbc8ab04c8695bddebec07f0d53-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:02:06 [async_llm.py:261] Added request cmpl-14102bbc8ab04c8695bddebec07f0d53-0.
INFO 03-02 01:02:07 [logger.py:42] Received request cmpl-354a2d24779141ba8eac4bff5dbddbff-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:02:07 [async_llm.py:261] Added request cmpl-354a2d24779141ba8eac4bff5dbddbff-0.
INFO 03-02 01:02:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:02:08 [logger.py:42] Received request cmpl-20b2e6d725d54ed895a422a61354055b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:02:08 [async_llm.py:261] Added request cmpl-20b2e6d725d54ed895a422a61354055b-0.
INFO 03-02 01:02:09 [logger.py:42] Received request cmpl-bf17390396e54fc98a6407146f6689ec-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:02:09 [async_llm.py:261] Added request cmpl-bf17390396e54fc98a6407146f6689ec-0.
INFO 03-02 01:02:10 [logger.py:42] Received request cmpl-937d6f740c11441ba04449778e4a819d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:02:10 [async_llm.py:261] Added request cmpl-937d6f740c11441ba04449778e4a819d-0.
INFO 03-02 01:02:12 [logger.py:42] Received request cmpl-861375fcf3e140dfaf282c9bd7e26975-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:02:12 [async_llm.py:261] Added request cmpl-861375fcf3e140dfaf282c9bd7e26975-0.
INFO 03-02 01:02:13 [logger.py:42] Received request cmpl-23067f382e3d4ca9a1e64b31cf435097-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:02:13 [async_llm.py:261] Added request cmpl-23067f382e3d4ca9a1e64b31cf435097-0.
INFO 03-02 01:02:14 [logger.py:42] Received request cmpl-29e9cb63e6d9416788a0743bf17dd533-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:02:14 [async_llm.py:261] Added request cmpl-29e9cb63e6d9416788a0743bf17dd533-0.
INFO 03-02 01:02:15 [logger.py:42] Received request cmpl-2286381665a74480889fb0e4d287b22c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:02:15 [async_llm.py:261] Added request cmpl-2286381665a74480889fb0e4d287b22c-0.
INFO 03-02 01:02:16 [logger.py:42] Received request cmpl-2331ee74f9ac4b3da5b3f16e5fa7986f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:02:16 [async_llm.py:261] Added request cmpl-2331ee74f9ac4b3da5b3f16e5fa7986f-0.
INFO 03-02 01:02:17 [logger.py:42] Received request cmpl-710de962a8474e2f8a960dcdb0eb1234-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:02:17 [async_llm.py:261] Added request cmpl-710de962a8474e2f8a960dcdb0eb1234-0.
INFO 03-02 01:02:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:02:18 [logger.py:42] Received request cmpl-2835649c380c48689b2dbc77479285f7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:02:18 [async_llm.py:261] Added request cmpl-2835649c380c48689b2dbc77479285f7-0.
INFO 03-02 01:02:20 [logger.py:42] Received request cmpl-80f0549c4ad84285a602f3ead80fddf0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:02:20 [async_llm.py:261] Added request cmpl-80f0549c4ad84285a602f3ead80fddf0-0.
INFO 03-02 01:02:21 [logger.py:42] Received request cmpl-8e5c4fb7f1e04570a7fb12f1993c7d2f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:02:21 [async_llm.py:261] Added request cmpl-8e5c4fb7f1e04570a7fb12f1993c7d2f-0.
INFO 03-02 01:02:22 [logger.py:42] Received request cmpl-dd335306617a412d872c9260ff759955-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:02:22 [async_llm.py:261] Added request cmpl-dd335306617a412d872c9260ff759955-0.
INFO 03-02 01:02:23 [logger.py:42] Received request cmpl-ec91975a63524696a02e7ad305d2daab-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:02:23 [async_llm.py:261] Added request cmpl-ec91975a63524696a02e7ad305d2daab-0.
INFO 03-02 01:02:24 [logger.py:42] Received request cmpl-28bcf7a2c78e426cbc4d9576faa1ab10-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:02:24 [async_llm.py:261] Added request cmpl-28bcf7a2c78e426cbc4d9576faa1ab10-0.
INFO 03-02 01:02:25 [logger.py:42] Received request cmpl-7779637d12a5481284d5c4a134348e93-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:02:25 [async_llm.py:261] Added request cmpl-7779637d12a5481284d5c4a134348e93-0.
INFO 03-02 01:02:27 [logger.py:42] Received request cmpl-c0a9e870515840229d7108f4cd06e551-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:02:27 [async_llm.py:261] Added request cmpl-c0a9e870515840229d7108f4cd06e551-0.
INFO 03-02 01:02:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:02:28 [logger.py:42] Received request cmpl-5f06b7dca23947fc993dd5fe2761530c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:02:28 [async_llm.py:261] Added request cmpl-5f06b7dca23947fc993dd5fe2761530c-0.
INFO 03-02 01:02:29 [logger.py:42] Received request cmpl-cbdd9b4304134999bef3e5882a0db0c1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:02:29 [async_llm.py:261] Added request cmpl-cbdd9b4304134999bef3e5882a0db0c1-0.
INFO 03-02 01:02:30 [logger.py:42] Received request cmpl-6cb60286d362471888847be189c973ec-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:02:30 [async_llm.py:261] Added request cmpl-6cb60286d362471888847be189c973ec-0.
INFO 03-02 01:02:31 [logger.py:42] Received request cmpl-7379fa3926c9413da9ac905814f1be52-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:02:31 [async_llm.py:261] Added request cmpl-7379fa3926c9413da9ac905814f1be52-0.
INFO 03-02 01:02:32 [logger.py:42] Received request cmpl-5225c60a67a64f9b86440b200b247744-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:02:32 [async_llm.py:261] Added request cmpl-5225c60a67a64f9b86440b200b247744-0.
INFO 03-02 01:02:33 [logger.py:42] Received request cmpl-906995b184d14e00ae05c0ea50237df9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:02:33 [async_llm.py:261] Added request cmpl-906995b184d14e00ae05c0ea50237df9-0.
INFO 03-02 01:02:35 [logger.py:42] Received request cmpl-d900b300c6c042389ca4f845a671aa38-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:02:35 [async_llm.py:261] Added request cmpl-d900b300c6c042389ca4f845a671aa38-0.
INFO 03-02 01:02:36 [logger.py:42] Received request cmpl-c8598b7ccd60409e8dd772c8c26f8c2d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:02:36 [async_llm.py:261] Added request cmpl-c8598b7ccd60409e8dd772c8c26f8c2d-0.
INFO 03-02 01:02:37 [logger.py:42] Received request cmpl-d9518190ebbb4647acc4e540e404114c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:02:37 [async_llm.py:261] Added request cmpl-d9518190ebbb4647acc4e540e404114c-0.
INFO 03-02 01:02:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:02:38 [logger.py:42] Received request cmpl-d045da21d5d442909db339d3e1fc7279-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:02:38 [async_llm.py:261] Added request cmpl-d045da21d5d442909db339d3e1fc7279-0.
INFO 03-02 01:02:39 [logger.py:42] Received request cmpl-d9c68977a2fd45ed9ea2a5976888acf2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:02:39 [async_llm.py:261] Added request cmpl-d9c68977a2fd45ed9ea2a5976888acf2-0.
INFO 03-02 01:02:40 [logger.py:42] Received request cmpl-7050bd1e7df143a3ab758ae4cc853a59-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:02:40 [async_llm.py:261] Added request cmpl-7050bd1e7df143a3ab758ae4cc853a59-0.
INFO 03-02 01:02:42 [logger.py:42] Received request cmpl-7febfff0cd7a4320b6bb23455cddec9b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:02:42 [async_llm.py:261] Added request cmpl-7febfff0cd7a4320b6bb23455cddec9b-0.
INFO 03-02 01:02:43 [logger.py:42] Received request cmpl-c5fa956feb31495ab85bfcf72f8cfb9a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:02:43 [async_llm.py:261] Added request cmpl-c5fa956feb31495ab85bfcf72f8cfb9a-0.
INFO 03-02 01:02:44 [logger.py:42] Received request cmpl-2309d50c1f6a4467a5f24ec019432fda-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:02:44 [async_llm.py:261] Added request cmpl-2309d50c1f6a4467a5f24ec019432fda-0.
INFO 03-02 01:02:45 [logger.py:42] Received request cmpl-2cf59b6ae40c440fb53ffec86adc6aba-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:02:45 [async_llm.py:261] Added request cmpl-2cf59b6ae40c440fb53ffec86adc6aba-0.
INFO 03-02 01:02:46 [logger.py:42] Received request cmpl-a37462c691a442e99cbecd639362b314-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:02:46 [async_llm.py:261] Added request cmpl-a37462c691a442e99cbecd639362b314-0.
INFO 03-02 01:02:47 [logger.py:42] Received request cmpl-e17ef94a65764396b4504a41d61fd251-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:02:47 [async_llm.py:261] Added request cmpl-e17ef94a65764396b4504a41d61fd251-0.
INFO 03-02 01:02:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:02:48 [logger.py:42] Received request cmpl-2cfbc079b1584289ba28ae25e4a9791a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:02:48 [async_llm.py:261] Added request cmpl-2cfbc079b1584289ba28ae25e4a9791a-0.
INFO 03-02 01:02:50 [logger.py:42] Received request cmpl-a0a460c55215457181e3fc87e07b396e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:02:50 [async_llm.py:261] Added request cmpl-a0a460c55215457181e3fc87e07b396e-0.
INFO 03-02 01:02:51 [logger.py:42] Received request cmpl-664ec8b3b759416ebb9c792987948d0d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:02:51 [async_llm.py:261] Added request cmpl-664ec8b3b759416ebb9c792987948d0d-0.
INFO 03-02 01:02:52 [logger.py:42] Received request cmpl-8a6a085e23074459b82969e0f6696130-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:02:52 [async_llm.py:261] Added request cmpl-8a6a085e23074459b82969e0f6696130-0.
INFO 03-02 01:02:53 [logger.py:42] Received request cmpl-507a268948e341de848017e4440ae556-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:02:53 [async_llm.py:261] Added request cmpl-507a268948e341de848017e4440ae556-0.
INFO 03-02 01:02:54 [logger.py:42] Received request cmpl-7e77a04c12534a249bce4b1e3d23e159-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:02:54 [async_llm.py:261] Added request cmpl-7e77a04c12534a249bce4b1e3d23e159-0.
INFO 03-02 01:02:55 [logger.py:42] Received request cmpl-66621adbf3c64ba7ae0da5e323a48f51-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:02:55 [async_llm.py:261] Added request cmpl-66621adbf3c64ba7ae0da5e323a48f51-0.
INFO 03-02 01:02:57 [logger.py:42] Received request cmpl-a185e70a519344d889ccf3eceb87748d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:02:57 [async_llm.py:261] Added request cmpl-a185e70a519344d889ccf3eceb87748d-0.
INFO 03-02 01:02:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:02:58 [logger.py:42] Received request cmpl-d4151025f4204b188ef6df1d137a1e8d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:02:58 [async_llm.py:261] Added request cmpl-d4151025f4204b188ef6df1d137a1e8d-0.
INFO 03-02 01:02:59 [logger.py:42] Received request cmpl-5bc7f7d6dc7a434fbdcd5b9c7c209a88-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:02:59 [async_llm.py:261] Added request cmpl-5bc7f7d6dc7a434fbdcd5b9c7c209a88-0.
INFO 03-02 01:03:00 [logger.py:42] Received request cmpl-915ad9975090498193f4e5b538588a82-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:03:00 [async_llm.py:261] Added request cmpl-915ad9975090498193f4e5b538588a82-0.
INFO 03-02 01:03:01 [logger.py:42] Received request cmpl-3c1bb401f2574cd6ae003d9b784e246e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:03:01 [async_llm.py:261] Added request cmpl-3c1bb401f2574cd6ae003d9b784e246e-0.
INFO 03-02 01:03:02 [logger.py:42] Received request cmpl-604c8f4bfaf84369b49a27d64b9cce56-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:03:02 [async_llm.py:261] Added request cmpl-604c8f4bfaf84369b49a27d64b9cce56-0.
INFO 03-02 01:03:03 [logger.py:42] Received request cmpl-23222ff1670847df8da8b34e997d5a18-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:03:03 [async_llm.py:261] Added request cmpl-23222ff1670847df8da8b34e997d5a18-0.
INFO 03-02 01:03:05 [logger.py:42] Received request cmpl-164dfe7a95b146138f30fc1ac5d33211-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:03:05 [async_llm.py:261] Added request cmpl-164dfe7a95b146138f30fc1ac5d33211-0.
INFO 03-02 01:03:06 [logger.py:42] Received request cmpl-28083ddca7a944c98ac0356ef085b86b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:03:06 [async_llm.py:261] Added request cmpl-28083ddca7a944c98ac0356ef085b86b-0.
INFO 03-02 01:03:07 [logger.py:42] Received request cmpl-0326cc003d8c4372a492356df6645983-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:03:07 [async_llm.py:261] Added request cmpl-0326cc003d8c4372a492356df6645983-0.
INFO 03-02 01:03:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:03:08 [logger.py:42] Received request cmpl-ba1eac98aa2e4eaea3dfa4481f1372d9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:03:08 [async_llm.py:261] Added request cmpl-ba1eac98aa2e4eaea3dfa4481f1372d9-0.
INFO 03-02 01:03:09 [logger.py:42] Received request cmpl-fbe3d68c3952408eabfcf626a5018645-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:03:09 [async_llm.py:261] Added request cmpl-fbe3d68c3952408eabfcf626a5018645-0.
INFO 03-02 01:03:10 [logger.py:42] Received request cmpl-e416c83a8c5a4db2ab98634a59bff0e6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:03:10 [async_llm.py:261] Added request cmpl-e416c83a8c5a4db2ab98634a59bff0e6-0.
INFO 03-02 01:03:12 [logger.py:42] Received request cmpl-718c60fa3f3e4341832c1ed501385c61-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:03:12 [async_llm.py:261] Added request cmpl-718c60fa3f3e4341832c1ed501385c61-0.
INFO 03-02 01:03:13 [logger.py:42] Received request cmpl-f68ce46d21314a888b0931d3eb9a2987-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:03:13 [async_llm.py:261] Added request cmpl-f68ce46d21314a888b0931d3eb9a2987-0.
INFO 03-02 01:03:14 [logger.py:42] Received request cmpl-4df2b8321d3a4a4f880fafcc58d4c7df-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:03:14 [async_llm.py:261] Added request cmpl-4df2b8321d3a4a4f880fafcc58d4c7df-0.
INFO 03-02 01:03:15 [logger.py:42] Received request cmpl-93cde8d77dca4d95a537332a063b53a5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:03:15 [async_llm.py:261] Added request cmpl-93cde8d77dca4d95a537332a063b53a5-0.
INFO 03-02 01:03:16 [logger.py:42] Received request cmpl-4eedaad5897049a096e4edb7f1fb2b5b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:03:16 [async_llm.py:261] Added request cmpl-4eedaad5897049a096e4edb7f1fb2b5b-0.
INFO 03-02 01:03:17 [logger.py:42] Received request cmpl-c8c8d9991d504270a502d02d38ff2e5d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:03:17 [async_llm.py:261] Added request cmpl-c8c8d9991d504270a502d02d38ff2e5d-0.
INFO 03-02 01:03:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:03:18 [logger.py:42] Received request cmpl-bc501a4e95dc424280aae40b6f80a03c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:03:18 [async_llm.py:261] Added request cmpl-bc501a4e95dc424280aae40b6f80a03c-0.
INFO 03-02 01:03:20 [logger.py:42] Received request cmpl-cfb4cb494a0e40e6b63b4676c0354c42-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:03:20 [async_llm.py:261] Added request cmpl-cfb4cb494a0e40e6b63b4676c0354c42-0.
INFO 03-02 01:03:21 [logger.py:42] Received request cmpl-9e9b08fe9ffa49dda2714c55b0241d8c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:03:21 [async_llm.py:261] Added request cmpl-9e9b08fe9ffa49dda2714c55b0241d8c-0.
INFO 03-02 01:03:22 [logger.py:42] Received request cmpl-7f2d3ee78ad94793925fdd6af45be289-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:03:22 [async_llm.py:261] Added request cmpl-7f2d3ee78ad94793925fdd6af45be289-0.
INFO 03-02 01:03:23 [logger.py:42] Received request cmpl-8b227632cfbe458ca500096b8c232de4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:03:23 [async_llm.py:261] Added request cmpl-8b227632cfbe458ca500096b8c232de4-0.
INFO 03-02 01:03:24 [logger.py:42] Received request cmpl-6162fd5de15c4a92a1e27b9a25dfe5f9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:03:24 [async_llm.py:261] Added request cmpl-6162fd5de15c4a92a1e27b9a25dfe5f9-0.
INFO 03-02 01:03:25 [logger.py:42] Received request cmpl-4633f1ba86cf4f0c80b06f4a751fcd48-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:03:25 [async_llm.py:261] Added request cmpl-4633f1ba86cf4f0c80b06f4a751fcd48-0.
INFO 03-02 01:03:27 [logger.py:42] Received request cmpl-061330c19ae344d580d51cf5a7954660-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:03:27 [async_llm.py:261] Added request cmpl-061330c19ae344d580d51cf5a7954660-0.
INFO 03-02 01:03:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:03:28 [logger.py:42] Received request cmpl-b24065b7a4cc45c097f78f0ff34394d6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:03:28 [async_llm.py:261] Added request cmpl-b24065b7a4cc45c097f78f0ff34394d6-0.
INFO 03-02 01:03:29 [logger.py:42] Received request cmpl-53d3a9281d194a1d9b58de32eeaa5852-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:03:29 [async_llm.py:261] Added request cmpl-53d3a9281d194a1d9b58de32eeaa5852-0.
INFO 03-02 01:03:30 [logger.py:42] Received request cmpl-22679bf8aed34f74b2f3fcba83ec8f1f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:03:30 [async_llm.py:261] Added request cmpl-22679bf8aed34f74b2f3fcba83ec8f1f-0.
INFO 03-02 01:03:31 [logger.py:42] Received request cmpl-b572efeb42b94e5382bd38b4791ffb98-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:03:31 [async_llm.py:261] Added request cmpl-b572efeb42b94e5382bd38b4791ffb98-0.
INFO 03-02 01:03:32 [logger.py:42] Received request cmpl-ae58e02fc9954148818c4d63f20969eb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:03:32 [async_llm.py:261] Added request cmpl-ae58e02fc9954148818c4d63f20969eb-0.
INFO 03-02 01:03:33 [logger.py:42] Received request cmpl-37c70b026b58474dbdc04b4a61faf04b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:03:33 [async_llm.py:261] Added request cmpl-37c70b026b58474dbdc04b4a61faf04b-0.
INFO 03-02 01:03:35 [logger.py:42] Received request cmpl-7d227f65b0ec47e299277e72bdbf3da8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:03:35 [async_llm.py:261] Added request cmpl-7d227f65b0ec47e299277e72bdbf3da8-0.
INFO 03-02 01:03:36 [logger.py:42] Received request cmpl-58d52661e66e4d31a9aa712f1c62c9ab-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:03:36 [async_llm.py:261] Added request cmpl-58d52661e66e4d31a9aa712f1c62c9ab-0.
INFO 03-02 01:03:37 [logger.py:42] Received request cmpl-311de8cf93194d4681385797b7326717-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:03:37 [async_llm.py:261] Added request cmpl-311de8cf93194d4681385797b7326717-0.
INFO 03-02 01:03:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:03:38 [logger.py:42] Received request cmpl-0294611fa1bb40468d8e1145a2f43f19-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:03:38 [async_llm.py:261] Added request cmpl-0294611fa1bb40468d8e1145a2f43f19-0.
INFO 03-02 01:03:39 [logger.py:42] Received request cmpl-f6422ac7544a4c6abb8b8764d8d48c7e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:03:39 [async_llm.py:261] Added request cmpl-f6422ac7544a4c6abb8b8764d8d48c7e-0.
INFO 03-02 01:03:40 [logger.py:42] Received request cmpl-c463aa25fca14acb8c6dd5802173c023-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:03:40 [async_llm.py:261] Added request cmpl-c463aa25fca14acb8c6dd5802173c023-0.
INFO 03-02 01:03:41 [logger.py:42] Received request cmpl-48638c177f95447ca314c08bc291d72f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:03:41 [async_llm.py:261] Added request cmpl-48638c177f95447ca314c08bc291d72f-0.
INFO 03-02 01:03:43 [logger.py:42] Received request cmpl-98c8daf0077443aab884061b82b09938-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:03:43 [async_llm.py:261] Added request cmpl-98c8daf0077443aab884061b82b09938-0.
INFO 03-02 01:03:44 [logger.py:42] Received request cmpl-3fd6da545dc84394bc2c96b5aeacd5e3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:03:44 [async_llm.py:261] Added request cmpl-3fd6da545dc84394bc2c96b5aeacd5e3-0.
INFO 03-02 01:03:45 [logger.py:42] Received request cmpl-1371e3d9da634cdea0354613d2aae9a1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:03:45 [async_llm.py:261] Added request cmpl-1371e3d9da634cdea0354613d2aae9a1-0.
INFO 03-02 01:03:46 [logger.py:42] Received request cmpl-6144309f96b54162a630fb2e336fcfa7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:03:46 [async_llm.py:261] Added request cmpl-6144309f96b54162a630fb2e336fcfa7-0.
INFO 03-02 01:03:47 [logger.py:42] Received request cmpl-3a9e884fe4fc4687b1c72718ea29eac6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:03:47 [async_llm.py:261] Added request cmpl-3a9e884fe4fc4687b1c72718ea29eac6-0.
INFO 03-02 01:03:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:03:48 [logger.py:42] Received request cmpl-f2034cb3801c48aca9e477298be02ade-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:03:48 [async_llm.py:261] Added request cmpl-f2034cb3801c48aca9e477298be02ade-0.
INFO 03-02 01:03:50 [logger.py:42] Received request cmpl-b292b01fc3704792b45e35718353f41b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:03:50 [async_llm.py:261] Added request cmpl-b292b01fc3704792b45e35718353f41b-0.
INFO 03-02 01:03:51 [logger.py:42] Received request cmpl-fd562bef900243f899ac42779add97a0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:03:51 [async_llm.py:261] Added request cmpl-fd562bef900243f899ac42779add97a0-0.
INFO 03-02 01:03:52 [logger.py:42] Received request cmpl-810f28eea56243cb90fd4d6792072106-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:03:52 [async_llm.py:261] Added request cmpl-810f28eea56243cb90fd4d6792072106-0.
INFO 03-02 01:03:53 [logger.py:42] Received request cmpl-3ac59f5c88514d2bad79c0d443372a17-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:03:53 [async_llm.py:261] Added request cmpl-3ac59f5c88514d2bad79c0d443372a17-0.
INFO 03-02 01:03:54 [logger.py:42] Received request cmpl-66fa685af8a74d0595746a139d1f5798-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:03:54 [async_llm.py:261] Added request cmpl-66fa685af8a74d0595746a139d1f5798-0.
INFO 03-02 01:03:55 [logger.py:42] Received request cmpl-ae2af6e0eff64856851f75f867c847ef-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:03:55 [async_llm.py:261] Added request cmpl-ae2af6e0eff64856851f75f867c847ef-0.
INFO 03-02 01:03:56 [logger.py:42] Received request cmpl-31f68a316c474a8a93ee3eac4e37a652-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:03:56 [async_llm.py:261] Added request cmpl-31f68a316c474a8a93ee3eac4e37a652-0.
INFO 03-02 01:03:58 [logger.py:42] Received request cmpl-1dc62facf6244793b8be6fdbad3508cd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:03:58 [async_llm.py:261] Added request cmpl-1dc62facf6244793b8be6fdbad3508cd-0.
INFO 03-02 01:03:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:03:59 [logger.py:42] Received request cmpl-f0b2246d05c44ccf9d50e8ceeabcea33-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:03:59 [async_llm.py:261] Added request cmpl-f0b2246d05c44ccf9d50e8ceeabcea33-0.
INFO 03-02 01:04:00 [logger.py:42] Received request cmpl-e5612fc7a82645c0b8bc2b92b06286dd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:04:00 [async_llm.py:261] Added request cmpl-e5612fc7a82645c0b8bc2b92b06286dd-0.
INFO 03-02 01:04:01 [logger.py:42] Received request cmpl-8a0f1393e58e43089c6c9e1b97c03fc2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:04:01 [async_llm.py:261] Added request cmpl-8a0f1393e58e43089c6c9e1b97c03fc2-0.
INFO 03-02 01:04:02 [logger.py:42] Received request cmpl-ef87f4c01775458792069a19d2470b11-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:04:02 [async_llm.py:261] Added request cmpl-ef87f4c01775458792069a19d2470b11-0.
INFO 03-02 01:04:03 [logger.py:42] Received request cmpl-6967f4645b9949448f9b01c5fb48d1ba-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:04:03 [async_llm.py:261] Added request cmpl-6967f4645b9949448f9b01c5fb48d1ba-0.
INFO 03-02 01:04:05 [logger.py:42] Received request cmpl-d7cf06eecef14d32acae9196766e00cb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:04:05 [async_llm.py:261] Added request cmpl-d7cf06eecef14d32acae9196766e00cb-0.
INFO 03-02 01:04:06 [logger.py:42] Received request cmpl-c5215ce0a0484619ba084e3f8843fbb0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:04:06 [async_llm.py:261] Added request cmpl-c5215ce0a0484619ba084e3f8843fbb0-0.
INFO 03-02 01:04:07 [logger.py:42] Received request cmpl-669ac89f1d844e89a479c8973bbf275a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:04:07 [async_llm.py:261] Added request cmpl-669ac89f1d844e89a479c8973bbf275a-0.
INFO 03-02 01:04:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:04:08 [logger.py:42] Received request cmpl-3def3a5075d74c02bcb7b8c6c223ed61-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:04:08 [async_llm.py:261] Added request cmpl-3def3a5075d74c02bcb7b8c6c223ed61-0.
INFO 03-02 01:04:09 [logger.py:42] Received request cmpl-65014853227e43a096e7005b4d4a2e20-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:04:09 [async_llm.py:261] Added request cmpl-65014853227e43a096e7005b4d4a2e20-0.
INFO 03-02 01:04:10 [logger.py:42] Received request cmpl-40ce419446274262b990848a85a79e9c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:04:10 [async_llm.py:261] Added request cmpl-40ce419446274262b990848a85a79e9c-0.
INFO 03-02 01:04:11 [logger.py:42] Received request cmpl-dc296f35c1d34ef891c30cca3059b9e0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:04:11 [async_llm.py:261] Added request cmpl-dc296f35c1d34ef891c30cca3059b9e0-0.
INFO 03-02 01:04:13 [logger.py:42] Received request cmpl-1adce6ecb2f340dfa5c59db182df2822-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:04:13 [async_llm.py:261] Added request cmpl-1adce6ecb2f340dfa5c59db182df2822-0.
INFO 03-02 01:04:14 [logger.py:42] Received request cmpl-a0b2c91f19e542ef891cfd996ba3e9b1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:04:14 [async_llm.py:261] Added request cmpl-a0b2c91f19e542ef891cfd996ba3e9b1-0.
INFO 03-02 01:04:15 [logger.py:42] Received request cmpl-f3ff674cdd9c46459d76307b7c96e4fa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:04:15 [async_llm.py:261] Added request cmpl-f3ff674cdd9c46459d76307b7c96e4fa-0.
INFO 03-02 01:04:16 [logger.py:42] Received request cmpl-aa90e27ae618488e81b80c1175286623-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:04:16 [async_llm.py:261] Added request cmpl-aa90e27ae618488e81b80c1175286623-0.
INFO 03-02 01:04:17 [logger.py:42] Received request cmpl-0094fad16d5642858d06f3a1301e240c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:04:17 [async_llm.py:261] Added request cmpl-0094fad16d5642858d06f3a1301e240c-0.
INFO 03-02 01:04:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:04:18 [logger.py:42] Received request cmpl-d9f1d5684c7e4f09ae6c7027ebddbbf6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:04:18 [async_llm.py:261] Added request cmpl-d9f1d5684c7e4f09ae6c7027ebddbbf6-0.
INFO 03-02 01:04:20 [logger.py:42] Received request cmpl-59533a2480a6477f8ad84cba089d1813-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:04:20 [async_llm.py:261] Added request cmpl-59533a2480a6477f8ad84cba089d1813-0.
INFO 03-02 01:04:21 [logger.py:42] Received request cmpl-47e6b7152f8240b1b76fbffc8a861663-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:04:21 [async_llm.py:261] Added request cmpl-47e6b7152f8240b1b76fbffc8a861663-0.
INFO 03-02 01:04:22 [logger.py:42] Received request cmpl-2de73ef12ba74ca79d5141e82a4b6a71-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:04:22 [async_llm.py:261] Added request cmpl-2de73ef12ba74ca79d5141e82a4b6a71-0.
INFO 03-02 01:04:23 [logger.py:42] Received request cmpl-8eef74e49309413fbb85795f2cc39e60-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:04:23 [async_llm.py:261] Added request cmpl-8eef74e49309413fbb85795f2cc39e60-0.
INFO 03-02 01:04:24 [logger.py:42] Received request cmpl-538fbfdad77b40218a1b71652ac3b7a3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:04:24 [async_llm.py:261] Added request cmpl-538fbfdad77b40218a1b71652ac3b7a3-0.
INFO 03-02 01:04:25 [logger.py:42] Received request cmpl-3c0e733470cc486d8802b0843ed0ecbd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:04:25 [async_llm.py:261] Added request cmpl-3c0e733470cc486d8802b0843ed0ecbd-0.
INFO 03-02 01:04:26 [logger.py:42] Received request cmpl-8c6035112d804747b42c30181a896147-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:04:26 [async_llm.py:261] Added request cmpl-8c6035112d804747b42c30181a896147-0.
INFO 03-02 01:04:28 [logger.py:42] Received request cmpl-ba3a468e680a455a946a82b50c7ec51b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:04:28 [async_llm.py:261] Added request cmpl-ba3a468e680a455a946a82b50c7ec51b-0.
INFO 03-02 01:04:28 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.1 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.3%, Prefix cache hit rate: 51.6%
INFO 03-02 01:04:29 [logger.py:42] Received request cmpl-96fc51523a96469ba233068f61e81e7b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:04:29 [async_llm.py:261] Added request cmpl-96fc51523a96469ba233068f61e81e7b-0.
INFO 03-02 01:04:30 [logger.py:42] Received request cmpl-44488558dbd143439614aa62e92243f8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:04:30 [async_llm.py:261] Added request cmpl-44488558dbd143439614aa62e92243f8-0.
INFO 03-02 01:04:31 [logger.py:42] Received request cmpl-f222e73834af455c99591547fa7deffe-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:04:31 [async_llm.py:261] Added request cmpl-f222e73834af455c99591547fa7deffe-0.
INFO 03-02 01:04:32 [logger.py:42] Received request cmpl-8c252286d9b84bcf9f44d3ba7f38b70a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:04:32 [async_llm.py:261] Added request cmpl-8c252286d9b84bcf9f44d3ba7f38b70a-0.
INFO 03-02 01:04:33 [logger.py:42] Received request cmpl-4525325074394fbdafc5fb863f6d03d8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:04:33 [async_llm.py:261] Added request cmpl-4525325074394fbdafc5fb863f6d03d8-0.
INFO 03-02 01:04:35 [logger.py:42] Received request cmpl-023d2ead4bd2461bb99d9b3de5214d98-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:04:35 [async_llm.py:261] Added request cmpl-023d2ead4bd2461bb99d9b3de5214d98-0.
INFO 03-02 01:04:36 [logger.py:42] Received request cmpl-0ce591d932d8433083bfb860256c47cf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:04:36 [async_llm.py:261] Added request cmpl-0ce591d932d8433083bfb860256c47cf-0.
INFO 03-02 01:04:37 [logger.py:42] Received request cmpl-25cfd686c6334f43a15db83e1eec68ad-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:04:37 [async_llm.py:261] Added request cmpl-25cfd686c6334f43a15db83e1eec68ad-0.
INFO 03-02 01:04:38 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.4 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:04:38 [logger.py:42] Received request cmpl-046b097161494f8689b10ec2f6f7d888-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:04:38 [async_llm.py:261] Added request cmpl-046b097161494f8689b10ec2f6f7d888-0.
INFO 03-02 01:04:39 [logger.py:42] Received request cmpl-58d78b3873aa4d82b71165fa8bf53030-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:04:39 [async_llm.py:261] Added request cmpl-58d78b3873aa4d82b71165fa8bf53030-0.
INFO 03-02 01:04:40 [logger.py:42] Received request cmpl-fb49a065cccd43289dac249f4e202cca-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:04:40 [async_llm.py:261] Added request cmpl-fb49a065cccd43289dac249f4e202cca-0.
INFO 03-02 01:04:41 [logger.py:42] Received request cmpl-3686c0012d71415899dc50be6ac90b54-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:04:41 [async_llm.py:261] Added request cmpl-3686c0012d71415899dc50be6ac90b54-0.
INFO 03-02 01:04:43 [logger.py:42] Received request cmpl-51644f28b1f046318de2fb7bc01fd311-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:04:43 [async_llm.py:261] Added request cmpl-51644f28b1f046318de2fb7bc01fd311-0.
INFO 03-02 01:04:44 [logger.py:42] Received request cmpl-6714c1b98104449ebfe1783f28752318-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:04:44 [async_llm.py:261] Added request cmpl-6714c1b98104449ebfe1783f28752318-0.
INFO 03-02 01:04:45 [logger.py:42] Received request cmpl-095b56941ae4449ab6ede70d8259213b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:04:45 [async_llm.py:261] Added request cmpl-095b56941ae4449ab6ede70d8259213b-0.
INFO 03-02 01:04:46 [logger.py:42] Received request cmpl-78e2a1276ed94e89be236ed779a964c7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:04:46 [async_llm.py:261] Added request cmpl-78e2a1276ed94e89be236ed779a964c7-0.
INFO 03-02 01:04:47 [logger.py:42] Received request cmpl-453572cac54f4de2aabef732ede28ac6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:04:47 [async_llm.py:261] Added request cmpl-453572cac54f4de2aabef732ede28ac6-0.
INFO 03-02 01:04:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:04:48 [logger.py:42] Received request cmpl-eeb9710f422f41b6b5aaf10a3bbecbc6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:04:48 [async_llm.py:261] Added request cmpl-eeb9710f422f41b6b5aaf10a3bbecbc6-0.
INFO 03-02 01:04:50 [logger.py:42] Received request cmpl-270d8a0d6567474b941c79b70ad46bbb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:04:50 [async_llm.py:261] Added request cmpl-270d8a0d6567474b941c79b70ad46bbb-0.
INFO 03-02 01:04:51 [logger.py:42] Received request cmpl-b6b1725303034678a577a9aa3832b725-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:04:51 [async_llm.py:261] Added request cmpl-b6b1725303034678a577a9aa3832b725-0.
INFO 03-02 01:04:52 [logger.py:42] Received request cmpl-0dad1cd0ea8c4e0da779ace88f11cea6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:04:52 [async_llm.py:261] Added request cmpl-0dad1cd0ea8c4e0da779ace88f11cea6-0.
INFO 03-02 01:04:53 [logger.py:42] Received request cmpl-18a99d6d74b74f2f860ea6cf918976e1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:04:53 [async_llm.py:261] Added request cmpl-18a99d6d74b74f2f860ea6cf918976e1-0.
INFO 03-02 01:04:54 [logger.py:42] Received request cmpl-50dd19d304b94ff0bc207a6066f9dfab-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:04:54 [async_llm.py:261] Added request cmpl-50dd19d304b94ff0bc207a6066f9dfab-0.
INFO 03-02 01:04:55 [logger.py:42] Received request cmpl-27c0862205634b28b03f56b8bc1f1b8d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:04:55 [async_llm.py:261] Added request cmpl-27c0862205634b28b03f56b8bc1f1b8d-0.
INFO 03-02 01:04:56 [logger.py:42] Received request cmpl-de654d8cad8849419503fe245dda682b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:04:56 [async_llm.py:261] Added request cmpl-de654d8cad8849419503fe245dda682b-0.
INFO 03-02 01:04:58 [logger.py:42] Received request cmpl-077e61171b2442f1aa19ce2e68784ff6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:04:58 [async_llm.py:261] Added request cmpl-077e61171b2442f1aa19ce2e68784ff6-0.
INFO 03-02 01:04:58 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.2 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.3%, Prefix cache hit rate: 51.6%
INFO 03-02 01:04:59 [logger.py:42] Received request cmpl-519fdc1333994dc1ae6ce1ee3fdbe664-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:04:59 [async_llm.py:261] Added request cmpl-519fdc1333994dc1ae6ce1ee3fdbe664-0.
INFO 03-02 01:05:00 [logger.py:42] Received request cmpl-5343492259284f5f9c6084138a1b205f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:05:00 [async_llm.py:261] Added request cmpl-5343492259284f5f9c6084138a1b205f-0.
INFO 03-02 01:05:01 [logger.py:42] Received request cmpl-ec020d4f8ff140228ec6d235318cbd49-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:05:01 [async_llm.py:261] Added request cmpl-ec020d4f8ff140228ec6d235318cbd49-0.
INFO 03-02 01:05:02 [logger.py:42] Received request cmpl-5f5c591ed00941c78900464f6882346e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:05:02 [async_llm.py:261] Added request cmpl-5f5c591ed00941c78900464f6882346e-0.
INFO 03-02 01:05:03 [logger.py:42] Received request cmpl-4f00b3aea087460db8c490d0e5bcd5fa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:05:03 [async_llm.py:261] Added request cmpl-4f00b3aea087460db8c490d0e5bcd5fa-0.
INFO 03-02 01:05:05 [logger.py:42] Received request cmpl-618bacb7b59d46c0a5133c90ab9a1958-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:05:05 [async_llm.py:261] Added request cmpl-618bacb7b59d46c0a5133c90ab9a1958-0.
INFO 03-02 01:05:06 [logger.py:42] Received request cmpl-8e6fdf8a5b0e42539261607556d0dc04-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:05:06 [async_llm.py:261] Added request cmpl-8e6fdf8a5b0e42539261607556d0dc04-0.
INFO 03-02 01:05:07 [logger.py:42] Received request cmpl-2d9435d3e7ce48229772fe2a4ab49a3b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:05:07 [async_llm.py:261] Added request cmpl-2d9435d3e7ce48229772fe2a4ab49a3b-0.
INFO 03-02 01:05:08 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.3 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:05:08 [logger.py:42] Received request cmpl-0040255ac76f4828bf09a90f392e1b23-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:05:08 [async_llm.py:261] Added request cmpl-0040255ac76f4828bf09a90f392e1b23-0.
INFO 03-02 01:05:09 [logger.py:42] Received request cmpl-4af865527c6a432188f36f9c2ebf1709-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:05:09 [async_llm.py:261] Added request cmpl-4af865527c6a432188f36f9c2ebf1709-0.
INFO 03-02 01:05:10 [logger.py:42] Received request cmpl-b4a3ffa06aa548ce96546b76bdb3a0d1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:05:10 [async_llm.py:261] Added request cmpl-b4a3ffa06aa548ce96546b76bdb3a0d1-0.
INFO 03-02 01:05:11 [logger.py:42] Received request cmpl-96332765791f4f5db0ba97c37e784c3c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:05:11 [async_llm.py:261] Added request cmpl-96332765791f4f5db0ba97c37e784c3c-0.
INFO 03-02 01:05:13 [logger.py:42] Received request cmpl-6b48c1271bdd40469c48279effd30fd8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:05:13 [async_llm.py:261] Added request cmpl-6b48c1271bdd40469c48279effd30fd8-0.
INFO 03-02 01:05:14 [logger.py:42] Received request cmpl-ffa57c6024f04c5b92cdf2c6db71f862-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:05:14 [async_llm.py:261] Added request cmpl-ffa57c6024f04c5b92cdf2c6db71f862-0.
INFO 03-02 01:05:15 [logger.py:42] Received request cmpl-c433e8de5c3941a38a6ba6ce8581f64e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:05:15 [async_llm.py:261] Added request cmpl-c433e8de5c3941a38a6ba6ce8581f64e-0.
INFO 03-02 01:05:16 [logger.py:42] Received request cmpl-dcb7370d311b466b9c42f5d677fc6e28-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:05:16 [async_llm.py:261] Added request cmpl-dcb7370d311b466b9c42f5d677fc6e28-0.
INFO 03-02 01:05:17 [logger.py:42] Received request cmpl-4cc5c689ed5f4beeae9b5b9d567ae296-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:05:17 [async_llm.py:261] Added request cmpl-4cc5c689ed5f4beeae9b5b9d567ae296-0.
INFO 03-02 01:05:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:05:18 [logger.py:42] Received request cmpl-d4156fd25b7e4101ba7818b3fd08c03c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:05:18 [async_llm.py:261] Added request cmpl-d4156fd25b7e4101ba7818b3fd08c03c-0.
INFO 03-02 01:05:19 [logger.py:42] Received request cmpl-bdf07fc616784ade83bbd6051f062ec6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:05:19 [async_llm.py:261] Added request cmpl-bdf07fc616784ade83bbd6051f062ec6-0.
INFO 03-02 01:05:21 [logger.py:42] Received request cmpl-6ebdafb137a04c2a8994b8312ed29e55-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:05:21 [async_llm.py:261] Added request cmpl-6ebdafb137a04c2a8994b8312ed29e55-0.
INFO 03-02 01:05:22 [logger.py:42] Received request cmpl-fa285e0f6b06482387012c8bc1f7ee4b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:05:22 [async_llm.py:261] Added request cmpl-fa285e0f6b06482387012c8bc1f7ee4b-0.
INFO 03-02 01:05:23 [logger.py:42] Received request cmpl-8a760f5dc7aa4b97a3ae35343fd55534-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:05:23 [async_llm.py:261] Added request cmpl-8a760f5dc7aa4b97a3ae35343fd55534-0.
INFO 03-02 01:05:24 [logger.py:42] Received request cmpl-e000616eab984230b6e6789c8c0826d5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:05:24 [async_llm.py:261] Added request cmpl-e000616eab984230b6e6789c8c0826d5-0.
INFO 03-02 01:05:25 [logger.py:42] Received request cmpl-ea7b821100524d4e8da6ac49fa490e3c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:05:25 [async_llm.py:261] Added request cmpl-ea7b821100524d4e8da6ac49fa490e3c-0.
INFO 03-02 01:05:26 [logger.py:42] Received request cmpl-a4abc9a8bd5f4dfa85c9657cddf74f15-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:05:26 [async_llm.py:261] Added request cmpl-a4abc9a8bd5f4dfa85c9657cddf74f15-0.
INFO 03-02 01:05:28 [logger.py:42] Received request cmpl-788e078d3d5146569f19dbca5a194555-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:05:28 [async_llm.py:261] Added request cmpl-788e078d3d5146569f19dbca5a194555-0.
INFO 03-02 01:05:28 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.2 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.3%, Prefix cache hit rate: 51.6%
INFO 03-02 01:05:29 [logger.py:42] Received request cmpl-f95b592822934a0ebdfe1d73307e7c6b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:05:29 [async_llm.py:261] Added request cmpl-f95b592822934a0ebdfe1d73307e7c6b-0.
INFO 03-02 01:05:30 [logger.py:42] Received request cmpl-20cc76af8970415198b103ae81497b38-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:05:30 [async_llm.py:261] Added request cmpl-20cc76af8970415198b103ae81497b38-0.
INFO 03-02 01:05:31 [logger.py:42] Received request cmpl-4c7e1bbc7a9b4fd2ba012c84e0dae782-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:05:31 [async_llm.py:261] Added request cmpl-4c7e1bbc7a9b4fd2ba012c84e0dae782-0.
INFO 03-02 01:05:32 [logger.py:42] Received request cmpl-8e7a447cd4d7438da780de0b2be5e2b1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:05:32 [async_llm.py:261] Added request cmpl-8e7a447cd4d7438da780de0b2be5e2b1-0.
INFO 03-02 01:05:33 [logger.py:42] Received request cmpl-dd50de325592493f9af42ef0a26fc9fa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:05:33 [async_llm.py:261] Added request cmpl-dd50de325592493f9af42ef0a26fc9fa-0.
INFO 03-02 01:05:34 [logger.py:42] Received request cmpl-cf8a0d7d883041a2803e2659f72fa4f2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:05:34 [async_llm.py:261] Added request cmpl-cf8a0d7d883041a2803e2659f72fa4f2-0.
INFO 03-02 01:05:36 [logger.py:42] Received request cmpl-50c67767a0674ec880aaee482e9e4638-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:05:36 [async_llm.py:261] Added request cmpl-50c67767a0674ec880aaee482e9e4638-0.
INFO 03-02 01:05:37 [logger.py:42] Received request cmpl-2e18c255980c498fb267e3a5e999cec2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:05:37 [async_llm.py:261] Added request cmpl-2e18c255980c498fb267e3a5e999cec2-0.
INFO 03-02 01:05:38 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.3 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:05:38 [logger.py:42] Received request cmpl-e964026c30b94257acd692cb3a65342a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:05:38 [async_llm.py:261] Added request cmpl-e964026c30b94257acd692cb3a65342a-0.
INFO 03-02 01:05:39 [logger.py:42] Received request cmpl-df944427c8984375a1ddaeb42a23ed07-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:05:39 [async_llm.py:261] Added request cmpl-df944427c8984375a1ddaeb42a23ed07-0.
INFO 03-02 01:05:40 [logger.py:42] Received request cmpl-b2768e2cc1e2479aa87a72d38af52979-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:05:40 [async_llm.py:261] Added request cmpl-b2768e2cc1e2479aa87a72d38af52979-0.
INFO 03-02 01:05:41 [logger.py:42] Received request cmpl-c57c3ca0c15243a2ab49983543675243-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:05:41 [async_llm.py:261] Added request cmpl-c57c3ca0c15243a2ab49983543675243-0.
INFO 03-02 01:05:43 [logger.py:42] Received request cmpl-eb024b498b7f48ecbc33f5123b0ef800-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:05:43 [async_llm.py:261] Added request cmpl-eb024b498b7f48ecbc33f5123b0ef800-0.
INFO 03-02 01:05:44 [logger.py:42] Received request cmpl-6320af66216c4bf686a31d071c9b2e9e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:05:44 [async_llm.py:261] Added request cmpl-6320af66216c4bf686a31d071c9b2e9e-0.
INFO 03-02 01:05:45 [logger.py:42] Received request cmpl-a4e8b8c165804615bfd240c275fe9a48-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:05:45 [async_llm.py:261] Added request cmpl-a4e8b8c165804615bfd240c275fe9a48-0.
INFO 03-02 01:05:46 [logger.py:42] Received request cmpl-9f3eb5fb1cc24fde97a603004243e780-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:05:46 [async_llm.py:261] Added request cmpl-9f3eb5fb1cc24fde97a603004243e780-0.
INFO 03-02 01:05:47 [logger.py:42] Received request cmpl-9a8b8ec30e2447948256c6b152c11c27-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:05:47 [async_llm.py:261] Added request cmpl-9a8b8ec30e2447948256c6b152c11c27-0.
INFO 03-02 01:05:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:05:48 [logger.py:42] Received request cmpl-de6d9c8916c748d0ba4f27ff8506bc78-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:05:48 [async_llm.py:261] Added request cmpl-de6d9c8916c748d0ba4f27ff8506bc78-0.
INFO 03-02 01:05:49 [logger.py:42] Received request cmpl-1e95c69622bc480dbcfa6540673a1b81-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:05:49 [async_llm.py:261] Added request cmpl-1e95c69622bc480dbcfa6540673a1b81-0.
INFO 03-02 01:05:51 [logger.py:42] Received request cmpl-ac8fb099b2e1490aa9688a934c2d9a87-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:05:51 [async_llm.py:261] Added request cmpl-ac8fb099b2e1490aa9688a934c2d9a87-0.
INFO 03-02 01:05:52 [logger.py:42] Received request cmpl-0cae79d5354146c1be653a6052259f1c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:05:52 [async_llm.py:261] Added request cmpl-0cae79d5354146c1be653a6052259f1c-0.
INFO 03-02 01:05:53 [logger.py:42] Received request cmpl-d277497d09ac432382a395da78abc8ee-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:05:53 [async_llm.py:261] Added request cmpl-d277497d09ac432382a395da78abc8ee-0.
INFO 03-02 01:05:54 [logger.py:42] Received request cmpl-c33430bc31ae46a6b93fe2289bc91d1d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:05:54 [async_llm.py:261] Added request cmpl-c33430bc31ae46a6b93fe2289bc91d1d-0.
INFO 03-02 01:05:55 [logger.py:42] Received request cmpl-81f543ce69c74261821816fd5c02a769-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:05:55 [async_llm.py:261] Added request cmpl-81f543ce69c74261821816fd5c02a769-0.
INFO 03-02 01:05:56 [logger.py:42] Received request cmpl-2bc72a866a2348acae9c3bcf7e531705-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:05:56 [async_llm.py:261] Added request cmpl-2bc72a866a2348acae9c3bcf7e531705-0.
INFO 03-02 01:05:58 [logger.py:42] Received request cmpl-6d56ce0d16764088ad76cf99e2e67233-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:05:58 [async_llm.py:261] Added request cmpl-6d56ce0d16764088ad76cf99e2e67233-0.
INFO 03-02 01:05:58 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.2 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.3%, Prefix cache hit rate: 51.6%
INFO 03-02 01:05:59 [logger.py:42] Received request cmpl-cb950b837a5344e1ab778e3bf3b87774-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:05:59 [async_llm.py:261] Added request cmpl-cb950b837a5344e1ab778e3bf3b87774-0.
INFO 03-02 01:06:00 [logger.py:42] Received request cmpl-b31149d462d04e168ad445626e7635c9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:06:00 [async_llm.py:261] Added request cmpl-b31149d462d04e168ad445626e7635c9-0.
INFO 03-02 01:06:01 [logger.py:42] Received request cmpl-fbcba17242374e23930bbe31a2635ffe-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:06:01 [async_llm.py:261] Added request cmpl-fbcba17242374e23930bbe31a2635ffe-0.
INFO 03-02 01:06:02 [logger.py:42] Received request cmpl-249d523f27fd4574875532d5215b9783-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:06:02 [async_llm.py:261] Added request cmpl-249d523f27fd4574875532d5215b9783-0.
INFO 03-02 01:06:03 [logger.py:42] Received request cmpl-9f78a3f93ae8470181e2d6c26f72587f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:06:03 [async_llm.py:261] Added request cmpl-9f78a3f93ae8470181e2d6c26f72587f-0.
INFO 03-02 01:06:04 [logger.py:42] Received request cmpl-ee0de3a329994670b65deebacb33f3f3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:06:04 [async_llm.py:261] Added request cmpl-ee0de3a329994670b65deebacb33f3f3-0.
INFO 03-02 01:06:06 [logger.py:42] Received request cmpl-67bfe999d3b74011a2a1dba44a6dac93-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:06:06 [async_llm.py:261] Added request cmpl-67bfe999d3b74011a2a1dba44a6dac93-0.
INFO 03-02 01:06:07 [logger.py:42] Received request cmpl-28cb6916426342b399b4d2337498c125-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:06:07 [async_llm.py:261] Added request cmpl-28cb6916426342b399b4d2337498c125-0.
INFO 03-02 01:06:08 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.3 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:06:08 [logger.py:42] Received request cmpl-9b621117567b4b5a99ffb4d9c6ebe302-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:06:08 [async_llm.py:261] Added request cmpl-9b621117567b4b5a99ffb4d9c6ebe302-0.
INFO 03-02 01:06:09 [logger.py:42] Received request cmpl-6aabc4f9520a4eec9cdb8477ee3edc01-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:06:09 [async_llm.py:261] Added request cmpl-6aabc4f9520a4eec9cdb8477ee3edc01-0.
INFO 03-02 01:06:10 [logger.py:42] Received request cmpl-78723cae4010440094bad31d0fc2596b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:06:10 [async_llm.py:261] Added request cmpl-78723cae4010440094bad31d0fc2596b-0.
INFO 03-02 01:06:11 [logger.py:42] Received request cmpl-ab3cbe302522412a81bdc5cb79997844-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:06:11 [async_llm.py:261] Added request cmpl-ab3cbe302522412a81bdc5cb79997844-0.
INFO 03-02 01:06:13 [logger.py:42] Received request cmpl-e46b9b2b00274841b470a4ade65a7b27-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:06:13 [async_llm.py:261] Added request cmpl-e46b9b2b00274841b470a4ade65a7b27-0.
INFO 03-02 01:06:14 [logger.py:42] Received request cmpl-45dfbf02249c4e29af1ea7aa660e4de2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:06:14 [async_llm.py:261] Added request cmpl-45dfbf02249c4e29af1ea7aa660e4de2-0.
INFO 03-02 01:06:15 [logger.py:42] Received request cmpl-b8c7e6edcc494b559c1d296465aaa60c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:06:15 [async_llm.py:261] Added request cmpl-b8c7e6edcc494b559c1d296465aaa60c-0.
INFO 03-02 01:06:16 [logger.py:42] Received request cmpl-6eb00f0abda94fb2bb678f197ef69470-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:06:16 [async_llm.py:261] Added request cmpl-6eb00f0abda94fb2bb678f197ef69470-0.
INFO 03-02 01:06:17 [logger.py:42] Received request cmpl-15182363f07a4146bfe654bf0ae5f731-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:06:17 [async_llm.py:261] Added request cmpl-15182363f07a4146bfe654bf0ae5f731-0.
INFO 03-02 01:06:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:06:18 [logger.py:42] Received request cmpl-4eb67bde02de4a46bdc4fd89c01db6c3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:06:18 [async_llm.py:261] Added request cmpl-4eb67bde02de4a46bdc4fd89c01db6c3-0.
INFO 03-02 01:06:19 [logger.py:42] Received request cmpl-176781e7e07d4383974e1085b244892d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:06:19 [async_llm.py:261] Added request cmpl-176781e7e07d4383974e1085b244892d-0.
INFO 03-02 01:06:21 [logger.py:42] Received request cmpl-79f4c77374784dff98e8f5fbca05d97d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:06:21 [async_llm.py:261] Added request cmpl-79f4c77374784dff98e8f5fbca05d97d-0.
INFO 03-02 01:06:22 [logger.py:42] Received request cmpl-8f05313220b849168786e59d805557b3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:06:22 [async_llm.py:261] Added request cmpl-8f05313220b849168786e59d805557b3-0.
INFO 03-02 01:06:23 [logger.py:42] Received request cmpl-131a00f05b344fb7a8ab97586e9060cc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:06:23 [async_llm.py:261] Added request cmpl-131a00f05b344fb7a8ab97586e9060cc-0.
INFO 03-02 01:06:24 [logger.py:42] Received request cmpl-ff45a5c1c62145e585dd557355b8de51-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:06:24 [async_llm.py:261] Added request cmpl-ff45a5c1c62145e585dd557355b8de51-0.
INFO 03-02 01:06:25 [logger.py:42] Received request cmpl-a57dbe732d8c462f81182af501049eff-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:06:25 [async_llm.py:261] Added request cmpl-a57dbe732d8c462f81182af501049eff-0.
INFO 03-02 01:06:26 [logger.py:42] Received request cmpl-e313220f4ebf4356b9e7af92d9fe38ca-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:06:26 [async_llm.py:261] Added request cmpl-e313220f4ebf4356b9e7af92d9fe38ca-0.
INFO 03-02 01:06:28 [logger.py:42] Received request cmpl-3eb9eaba44504cd18b0dc3ee8042a745-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:06:28 [async_llm.py:261] Added request cmpl-3eb9eaba44504cd18b0dc3ee8042a745-0.
INFO 03-02 01:06:28 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.3 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.4%, Prefix cache hit rate: 51.6%
INFO 03-02 01:06:29 [logger.py:42] Received request cmpl-470bd2ac412245deaf31fd32951272db-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:06:29 [async_llm.py:261] Added request cmpl-470bd2ac412245deaf31fd32951272db-0.
INFO 03-02 01:06:30 [logger.py:42] Received request cmpl-5b91f8c9855749c99c29c87ee5ef8a84-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:06:30 [async_llm.py:261] Added request cmpl-5b91f8c9855749c99c29c87ee5ef8a84-0.
INFO 03-02 01:06:31 [logger.py:42] Received request cmpl-04af2b4fec2e4e5c9d9aaf31d00f0f13-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:06:31 [async_llm.py:261] Added request cmpl-04af2b4fec2e4e5c9d9aaf31d00f0f13-0.
INFO 03-02 01:06:32 [logger.py:42] Received request cmpl-92497ff366da41faab4dd921a41fa6d7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:06:32 [async_llm.py:261] Added request cmpl-92497ff366da41faab4dd921a41fa6d7-0.
INFO 03-02 01:06:33 [logger.py:42] Received request cmpl-bbbf13a4bf4d414eba3dd23fb1fe237d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:06:33 [async_llm.py:261] Added request cmpl-bbbf13a4bf4d414eba3dd23fb1fe237d-0.
INFO 03-02 01:06:34 [logger.py:42] Received request cmpl-62a581d959444d2faca85ce346ae5759-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:06:34 [async_llm.py:261] Added request cmpl-62a581d959444d2faca85ce346ae5759-0.
INFO 03-02 01:06:36 [logger.py:42] Received request cmpl-d679af7143214a35a9a4e9b98d4f353e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:06:36 [async_llm.py:261] Added request cmpl-d679af7143214a35a9a4e9b98d4f353e-0.
INFO 03-02 01:06:37 [logger.py:42] Received request cmpl-0d4e65afa91540eabb0ebd9c78bfbe8b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:06:37 [async_llm.py:261] Added request cmpl-0d4e65afa91540eabb0ebd9c78bfbe8b-0.
INFO 03-02 01:06:38 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.2 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:06:38 [logger.py:42] Received request cmpl-04dacef217924a09ac52900916976a65-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:06:38 [async_llm.py:261] Added request cmpl-04dacef217924a09ac52900916976a65-0.
INFO 03-02 01:06:39 [logger.py:42] Received request cmpl-f527b63a2e834a589114747c2724325a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:06:39 [async_llm.py:261] Added request cmpl-f527b63a2e834a589114747c2724325a-0.
INFO 03-02 01:06:40 [logger.py:42] Received request cmpl-68ebe29006ee4496b72638d9ecd38eee-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:06:40 [async_llm.py:261] Added request cmpl-68ebe29006ee4496b72638d9ecd38eee-0.
INFO 03-02 01:06:41 [logger.py:42] Received request cmpl-68b1d21206fb440887bfa9f016faf476-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:06:41 [async_llm.py:261] Added request cmpl-68b1d21206fb440887bfa9f016faf476-0.
INFO 03-02 01:06:43 [logger.py:42] Received request cmpl-bae421ac71e8408b9d93694445435c72-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:06:43 [async_llm.py:261] Added request cmpl-bae421ac71e8408b9d93694445435c72-0.
INFO 03-02 01:06:44 [logger.py:42] Received request cmpl-8827990728a94f299f59f3d58c4024ab-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:06:44 [async_llm.py:261] Added request cmpl-8827990728a94f299f59f3d58c4024ab-0.
INFO 03-02 01:06:45 [logger.py:42] Received request cmpl-674989c5744140a6882d1c953c458a27-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:06:45 [async_llm.py:261] Added request cmpl-674989c5744140a6882d1c953c458a27-0.
INFO 03-02 01:06:46 [logger.py:42] Received request cmpl-2905ec26f3404f2b9df5542e35675571-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:06:46 [async_llm.py:261] Added request cmpl-2905ec26f3404f2b9df5542e35675571-0.
INFO 03-02 01:06:47 [logger.py:42] Received request cmpl-8b1fddc8133941a6bad29237e8e7c082-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:06:47 [async_llm.py:261] Added request cmpl-8b1fddc8133941a6bad29237e8e7c082-0.
INFO 03-02 01:06:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:06:48 [logger.py:42] Received request cmpl-b279377b37784ec38ef731a29560f9ef-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:06:48 [async_llm.py:261] Added request cmpl-b279377b37784ec38ef731a29560f9ef-0.
INFO 03-02 01:06:49 [logger.py:42] Received request cmpl-6846f90bfeca4a4bb9ebd7ecdda89076-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:06:49 [async_llm.py:261] Added request cmpl-6846f90bfeca4a4bb9ebd7ecdda89076-0.
INFO 03-02 01:06:51 [logger.py:42] Received request cmpl-47cb787d3431479e855e85d96e3c3a4e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:06:51 [async_llm.py:261] Added request cmpl-47cb787d3431479e855e85d96e3c3a4e-0.
INFO 03-02 01:06:52 [logger.py:42] Received request cmpl-88ffb2c3396e42f88e93612a0805c72f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:06:52 [async_llm.py:261] Added request cmpl-88ffb2c3396e42f88e93612a0805c72f-0.
INFO 03-02 01:06:53 [logger.py:42] Received request cmpl-a6ac917d5bde4553a306f45c41fac035-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:06:53 [async_llm.py:261] Added request cmpl-a6ac917d5bde4553a306f45c41fac035-0.
INFO 03-02 01:06:54 [logger.py:42] Received request cmpl-99bd94cb9a6d4104b99235a1396c7897-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:06:54 [async_llm.py:261] Added request cmpl-99bd94cb9a6d4104b99235a1396c7897-0.
INFO 03-02 01:06:55 [logger.py:42] Received request cmpl-b12ca3166e96421483ab6d747ccae24b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:06:55 [async_llm.py:261] Added request cmpl-b12ca3166e96421483ab6d747ccae24b-0.
INFO 03-02 01:06:56 [logger.py:42] Received request cmpl-fbb433191075420c803b2492008c8c16-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:06:56 [async_llm.py:261] Added request cmpl-fbb433191075420c803b2492008c8c16-0.
INFO 03-02 01:06:58 [logger.py:42] Received request cmpl-b8b1f680de184b039277d0baf2c7fc2b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:06:58 [async_llm.py:261] Added request cmpl-b8b1f680de184b039277d0baf2c7fc2b-0.
INFO 03-02 01:06:58 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.2 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.3%, Prefix cache hit rate: 51.6%
INFO 03-02 01:06:59 [logger.py:42] Received request cmpl-4ac0011b978546f586a6036d37caee4f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:06:59 [async_llm.py:261] Added request cmpl-4ac0011b978546f586a6036d37caee4f-0.
INFO 03-02 01:07:00 [logger.py:42] Received request cmpl-ee84c24f3fc14ca8961c19179ba2e9b6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:07:00 [async_llm.py:261] Added request cmpl-ee84c24f3fc14ca8961c19179ba2e9b6-0.
INFO 03-02 01:07:01 [logger.py:42] Received request cmpl-ed1e112d35f54d12b47d3346752b1267-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:07:01 [async_llm.py:261] Added request cmpl-ed1e112d35f54d12b47d3346752b1267-0.
INFO 03-02 01:07:02 [logger.py:42] Received request cmpl-4c413925660f413f89a91e46bbfe6051-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:07:02 [async_llm.py:261] Added request cmpl-4c413925660f413f89a91e46bbfe6051-0.
INFO 03-02 01:07:03 [logger.py:42] Received request cmpl-41aa11d19b1d46a0948b3578bd0346ac-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:07:03 [async_llm.py:261] Added request cmpl-41aa11d19b1d46a0948b3578bd0346ac-0.
INFO 03-02 01:07:05 [logger.py:42] Received request cmpl-2f8f3176db074e028519f1c91f8c0bb1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:07:05 [async_llm.py:261] Added request cmpl-2f8f3176db074e028519f1c91f8c0bb1-0.
INFO 03-02 01:07:06 [logger.py:42] Received request cmpl-f6588debb10942da92bad085f7d87216-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:07:06 [async_llm.py:261] Added request cmpl-f6588debb10942da92bad085f7d87216-0.
INFO 03-02 01:07:07 [logger.py:42] Received request cmpl-83fcf3c53f484bc5af1bf1297955ec5d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:07:07 [async_llm.py:261] Added request cmpl-83fcf3c53f484bc5af1bf1297955ec5d-0.
INFO 03-02 01:07:08 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.3 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:07:08 [logger.py:42] Received request cmpl-9849f5f3c5f7457cb3003772fab9e88f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:07:08 [async_llm.py:261] Added request cmpl-9849f5f3c5f7457cb3003772fab9e88f-0.
INFO 03-02 01:07:09 [logger.py:42] Received request cmpl-54b8136ca2f44f5893bad7375e50fb12-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:07:09 [async_llm.py:261] Added request cmpl-54b8136ca2f44f5893bad7375e50fb12-0.
INFO 03-02 01:07:10 [logger.py:42] Received request cmpl-bac8a8d7dd9c46bfb7a14ea973ec97ad-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:07:10 [async_llm.py:261] Added request cmpl-bac8a8d7dd9c46bfb7a14ea973ec97ad-0.
INFO 03-02 01:07:11 [logger.py:42] Received request cmpl-e0e8da6cb715414dae04996962de7f6d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:07:11 [async_llm.py:261] Added request cmpl-e0e8da6cb715414dae04996962de7f6d-0.
INFO 03-02 01:07:13 [logger.py:42] Received request cmpl-835a2770c1434c36baf3a3aea5be8d83-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:07:13 [async_llm.py:261] Added request cmpl-835a2770c1434c36baf3a3aea5be8d83-0.
INFO 03-02 01:07:14 [logger.py:42] Received request cmpl-579ad7a0e256448b997bd3a3f547d098-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:07:14 [async_llm.py:261] Added request cmpl-579ad7a0e256448b997bd3a3f547d098-0.
INFO 03-02 01:07:15 [logger.py:42] Received request cmpl-a2ab195f76db436e85251e4a2b7fbf09-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:07:15 [async_llm.py:261] Added request cmpl-a2ab195f76db436e85251e4a2b7fbf09-0.
INFO 03-02 01:07:16 [logger.py:42] Received request cmpl-f5128bc9b15b413ea74a02350c831550-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:07:16 [async_llm.py:261] Added request cmpl-f5128bc9b15b413ea74a02350c831550-0.
INFO 03-02 01:07:17 [logger.py:42] Received request cmpl-281f5b20db8e4b16ab3f7faaec330b4b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:07:17 [async_llm.py:261] Added request cmpl-281f5b20db8e4b16ab3f7faaec330b4b-0.
INFO 03-02 01:07:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:07:18 [logger.py:42] Received request cmpl-65d0ab0367084f299187cf1db970e353-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:07:18 [async_llm.py:261] Added request cmpl-65d0ab0367084f299187cf1db970e353-0.
INFO 03-02 01:07:20 [logger.py:42] Received request cmpl-0c5b03d98176433e805591be183c6020-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:07:20 [async_llm.py:261] Added request cmpl-0c5b03d98176433e805591be183c6020-0.
INFO 03-02 01:07:21 [logger.py:42] Received request cmpl-4638f562017c4b9699d7ab1984e648f2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:07:21 [async_llm.py:261] Added request cmpl-4638f562017c4b9699d7ab1984e648f2-0.
INFO 03-02 01:07:22 [logger.py:42] Received request cmpl-b9c3ed3acbaf41d08a6360c3a4254f36-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:07:22 [async_llm.py:261] Added request cmpl-b9c3ed3acbaf41d08a6360c3a4254f36-0.
INFO 03-02 01:07:23 [logger.py:42] Received request cmpl-015cc694ad674f41ae1a5d5ad9478dee-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:07:23 [async_llm.py:261] Added request cmpl-015cc694ad674f41ae1a5d5ad9478dee-0.
INFO 03-02 01:07:24 [logger.py:42] Received request cmpl-9f1f1eadf5c04442b0707defb9c8ddbb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:07:24 [async_llm.py:261] Added request cmpl-9f1f1eadf5c04442b0707defb9c8ddbb-0.
INFO 03-02 01:07:25 [logger.py:42] Received request cmpl-e7a970fc93654a30be180bc20b3aa78f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:07:25 [async_llm.py:261] Added request cmpl-e7a970fc93654a30be180bc20b3aa78f-0.
INFO 03-02 01:07:26 [logger.py:42] Received request cmpl-544cc6e6eb3a4542b87f14fb64b2906e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:07:26 [async_llm.py:261] Added request cmpl-544cc6e6eb3a4542b87f14fb64b2906e-0.
INFO 03-02 01:07:28 [logger.py:42] Received request cmpl-314275f290de48de9648bfcd741878b2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:07:28 [async_llm.py:261] Added request cmpl-314275f290de48de9648bfcd741878b2-0.
INFO 03-02 01:07:28 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.1 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.3%, Prefix cache hit rate: 51.6%
INFO 03-02 01:07:29 [logger.py:42] Received request cmpl-41c92c00c1684b54a538073f2c92f1cd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:07:29 [async_llm.py:261] Added request cmpl-41c92c00c1684b54a538073f2c92f1cd-0.
INFO 03-02 01:07:30 [logger.py:42] Received request cmpl-3ce1f88dbae043e28cd2e8362a17f059-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:07:30 [async_llm.py:261] Added request cmpl-3ce1f88dbae043e28cd2e8362a17f059-0.
INFO 03-02 01:07:31 [logger.py:42] Received request cmpl-8c3173bb22d041239d3aefddc2dbfa28-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:07:31 [async_llm.py:261] Added request cmpl-8c3173bb22d041239d3aefddc2dbfa28-0.
INFO 03-02 01:07:32 [logger.py:42] Received request cmpl-fae5366a13f94d64a24c9abc1b8574be-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:07:32 [async_llm.py:261] Added request cmpl-fae5366a13f94d64a24c9abc1b8574be-0.
INFO 03-02 01:07:33 [logger.py:42] Received request cmpl-b08b63a010b642378298750e62edd427-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:07:33 [async_llm.py:261] Added request cmpl-b08b63a010b642378298750e62edd427-0.
INFO 03-02 01:07:35 [logger.py:42] Received request cmpl-741bb0c94a0a4d529851a5e70a1f0376-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:07:35 [async_llm.py:261] Added request cmpl-741bb0c94a0a4d529851a5e70a1f0376-0.
INFO 03-02 01:07:36 [logger.py:42] Received request cmpl-8c40b86474124ab4aff02a5f86b17e71-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:07:36 [async_llm.py:261] Added request cmpl-8c40b86474124ab4aff02a5f86b17e71-0.
INFO 03-02 01:07:37 [logger.py:42] Received request cmpl-9e9a0b47ed8748c692b6bf0db7d1ea13-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:07:37 [async_llm.py:261] Added request cmpl-9e9a0b47ed8748c692b6bf0db7d1ea13-0.
INFO 03-02 01:07:38 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.4 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:07:38 [logger.py:42] Received request cmpl-7dd550c9dec1420ea3c1cf2a75f54d1f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:07:38 [async_llm.py:261] Added request cmpl-7dd550c9dec1420ea3c1cf2a75f54d1f-0.
INFO 03-02 01:07:39 [logger.py:42] Received request cmpl-32a21130534a4784abb9ca3c1d77aac9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:07:39 [async_llm.py:261] Added request cmpl-32a21130534a4784abb9ca3c1d77aac9-0.
INFO 03-02 01:07:40 [logger.py:42] Received request cmpl-6a9398ef9cba4b7fa1ac0f4df32d10ed-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:07:40 [async_llm.py:261] Added request cmpl-6a9398ef9cba4b7fa1ac0f4df32d10ed-0.
INFO 03-02 01:07:41 [logger.py:42] Received request cmpl-57a213260f714c84a42c7daab5c465df-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:07:41 [async_llm.py:261] Added request cmpl-57a213260f714c84a42c7daab5c465df-0.
INFO 03-02 01:07:43 [logger.py:42] Received request cmpl-16c51569020446cf8f21625fe6e95c93-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:07:43 [async_llm.py:261] Added request cmpl-16c51569020446cf8f21625fe6e95c93-0.
INFO 03-02 01:07:44 [logger.py:42] Received request cmpl-b569b4669cc1412fbdaf1ad641bedfa1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:07:44 [async_llm.py:261] Added request cmpl-b569b4669cc1412fbdaf1ad641bedfa1-0.
INFO 03-02 01:07:45 [logger.py:42] Received request cmpl-6c1f8d98c56543c2b26fc0a8e8ba7128-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:07:45 [async_llm.py:261] Added request cmpl-6c1f8d98c56543c2b26fc0a8e8ba7128-0.
INFO 03-02 01:07:46 [logger.py:42] Received request cmpl-3e6237a4294e4f51b4b5746497bc8ca8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:07:46 [async_llm.py:261] Added request cmpl-3e6237a4294e4f51b4b5746497bc8ca8-0.
INFO 03-02 01:07:47 [logger.py:42] Received request cmpl-9700e046c2594a46b377d9f1b451c7ff-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:07:47 [async_llm.py:261] Added request cmpl-9700e046c2594a46b377d9f1b451c7ff-0.
INFO 03-02 01:07:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:07:48 [logger.py:42] Received request cmpl-cca4c71e8c3548af964498ace0620196-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:07:48 [async_llm.py:261] Added request cmpl-cca4c71e8c3548af964498ace0620196-0.
INFO 03-02 01:07:50 [logger.py:42] Received request cmpl-8f029989e33a48438a9a96f1137f29b1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:07:50 [async_llm.py:261] Added request cmpl-8f029989e33a48438a9a96f1137f29b1-0.
INFO 03-02 01:07:51 [logger.py:42] Received request cmpl-f7f209753acd487598457b302bc98762-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:07:51 [async_llm.py:261] Added request cmpl-f7f209753acd487598457b302bc98762-0.
INFO 03-02 01:07:52 [logger.py:42] Received request cmpl-94c530d090f64e0cba2a8f598a0cf6dc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:07:52 [async_llm.py:261] Added request cmpl-94c530d090f64e0cba2a8f598a0cf6dc-0.
INFO 03-02 01:07:53 [logger.py:42] Received request cmpl-d020c23588f04b8e9309f3b822212792-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:07:53 [async_llm.py:261] Added request cmpl-d020c23588f04b8e9309f3b822212792-0.
INFO 03-02 01:07:54 [logger.py:42] Received request cmpl-e30d1f1eb4634a2c92444992bfad3948-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:07:54 [async_llm.py:261] Added request cmpl-e30d1f1eb4634a2c92444992bfad3948-0.
INFO 03-02 01:07:55 [logger.py:42] Received request cmpl-d99fda2999cd4e0299ffa4b618d271a6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:07:55 [async_llm.py:261] Added request cmpl-d99fda2999cd4e0299ffa4b618d271a6-0.
INFO 03-02 01:07:56 [logger.py:42] Received request cmpl-ad485c9c859748b79b49cef7cd187994-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:07:56 [async_llm.py:261] Added request cmpl-ad485c9c859748b79b49cef7cd187994-0.
INFO 03-02 01:07:58 [logger.py:42] Received request cmpl-263b9cbd058c4ca68786871794725921-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:07:58 [async_llm.py:261] Added request cmpl-263b9cbd058c4ca68786871794725921-0.
INFO 03-02 01:07:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:07:59 [logger.py:42] Received request cmpl-20f937e734e242299e19b4fe5fb6d575-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:07:59 [async_llm.py:261] Added request cmpl-20f937e734e242299e19b4fe5fb6d575-0.
INFO 03-02 01:08:00 [logger.py:42] Received request cmpl-347fd21da34840028f35d9a6e82d5d95-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:08:00 [async_llm.py:261] Added request cmpl-347fd21da34840028f35d9a6e82d5d95-0.
INFO 03-02 01:08:01 [logger.py:42] Received request cmpl-9c17d90aa4a04b6c82b074b35d5d5135-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:08:01 [async_llm.py:261] Added request cmpl-9c17d90aa4a04b6c82b074b35d5d5135-0.
INFO 03-02 01:08:02 [logger.py:42] Received request cmpl-578ab357de9e4cb7a27f52080463c4cd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:08:02 [async_llm.py:261] Added request cmpl-578ab357de9e4cb7a27f52080463c4cd-0.
INFO 03-02 01:08:03 [logger.py:42] Received request cmpl-530660c3f61e4e3d9604aeae5c0d7705-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:08:03 [async_llm.py:261] Added request cmpl-530660c3f61e4e3d9604aeae5c0d7705-0.
INFO 03-02 01:08:05 [logger.py:42] Received request cmpl-473e0f12a07944b2991e89f163a04db7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:08:05 [async_llm.py:261] Added request cmpl-473e0f12a07944b2991e89f163a04db7-0.
INFO 03-02 01:08:06 [logger.py:42] Received request cmpl-8b879c00024a46a49de9ba8a1e268c9e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:08:06 [async_llm.py:261] Added request cmpl-8b879c00024a46a49de9ba8a1e268c9e-0.
INFO 03-02 01:08:07 [logger.py:42] Received request cmpl-bdcf2e0733094f68b2621221c9d9b604-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:08:07 [async_llm.py:261] Added request cmpl-bdcf2e0733094f68b2621221c9d9b604-0.
INFO 03-02 01:08:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:08:08 [logger.py:42] Received request cmpl-4135a98db4ed46a5bac67515f8de08eb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:08:08 [async_llm.py:261] Added request cmpl-4135a98db4ed46a5bac67515f8de08eb-0.
INFO 03-02 01:08:09 [logger.py:42] Received request cmpl-29185d8718ea4aecab5fb997fd00968c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:08:09 [async_llm.py:261] Added request cmpl-29185d8718ea4aecab5fb997fd00968c-0.
INFO 03-02 01:08:10 [logger.py:42] Received request cmpl-44af1c52cdb34f008d6cfcc4e234237f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:08:10 [async_llm.py:261] Added request cmpl-44af1c52cdb34f008d6cfcc4e234237f-0.
INFO 03-02 01:08:12 [logger.py:42] Received request cmpl-b4341d7385c140e68bd28f475f159682-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:08:12 [async_llm.py:261] Added request cmpl-b4341d7385c140e68bd28f475f159682-0.
INFO 03-02 01:08:13 [logger.py:42] Received request cmpl-fc9c0a4e6677465386933da9c5896d58-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:08:13 [async_llm.py:261] Added request cmpl-fc9c0a4e6677465386933da9c5896d58-0.
INFO 03-02 01:08:14 [logger.py:42] Received request cmpl-81c1f22cad7747e887356fccfca7e6da-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:08:14 [async_llm.py:261] Added request cmpl-81c1f22cad7747e887356fccfca7e6da-0.
INFO 03-02 01:08:15 [logger.py:42] Received request cmpl-d7305f53ed0b4d80b79c59a8477a2546-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:08:15 [async_llm.py:261] Added request cmpl-d7305f53ed0b4d80b79c59a8477a2546-0.
INFO 03-02 01:08:16 [logger.py:42] Received request cmpl-4bdfab0a6bd0489eba257d03b145979c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:08:16 [async_llm.py:261] Added request cmpl-4bdfab0a6bd0489eba257d03b145979c-0.
INFO 03-02 01:08:17 [logger.py:42] Received request cmpl-15b93ab77cbf4982b9192efc2b2ff92d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:08:17 [async_llm.py:261] Added request cmpl-15b93ab77cbf4982b9192efc2b2ff92d-0.
INFO 03-02 01:08:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:08:18 [logger.py:42] Received request cmpl-ef2997c4c94c4616ac12791ec04e6da8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:08:18 [async_llm.py:261] Added request cmpl-ef2997c4c94c4616ac12791ec04e6da8-0.
INFO 03-02 01:08:20 [logger.py:42] Received request cmpl-47d53c953961407e8ae256416ae4e9fc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:08:20 [async_llm.py:261] Added request cmpl-47d53c953961407e8ae256416ae4e9fc-0.
INFO 03-02 01:08:21 [logger.py:42] Received request cmpl-f232be90ff574d74a1a9f1864d5524d8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:08:21 [async_llm.py:261] Added request cmpl-f232be90ff574d74a1a9f1864d5524d8-0.
INFO 03-02 01:08:22 [logger.py:42] Received request cmpl-75198451a2a6484e93c923c4727553b3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:08:22 [async_llm.py:261] Added request cmpl-75198451a2a6484e93c923c4727553b3-0.
INFO 03-02 01:08:23 [logger.py:42] Received request cmpl-306af9135f7f4e8a97dcb4902aa01879-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:08:23 [async_llm.py:261] Added request cmpl-306af9135f7f4e8a97dcb4902aa01879-0.
INFO 03-02 01:08:24 [logger.py:42] Received request cmpl-8ed10e409ad74da780b4b946fc63f3f8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:08:24 [async_llm.py:261] Added request cmpl-8ed10e409ad74da780b4b946fc63f3f8-0.
INFO 03-02 01:08:25 [logger.py:42] Received request cmpl-ae0ba67096694907afe1870fca778c7c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:08:25 [async_llm.py:261] Added request cmpl-ae0ba67096694907afe1870fca778c7c-0.
INFO 03-02 01:08:27 [logger.py:42] Received request cmpl-6c4856067acb4813b76ca29519d02939-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:08:27 [async_llm.py:261] Added request cmpl-6c4856067acb4813b76ca29519d02939-0.
INFO 03-02 01:08:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:08:28 [logger.py:42] Received request cmpl-b468dcad92d14c9685143655e312c0ef-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:08:28 [async_llm.py:261] Added request cmpl-b468dcad92d14c9685143655e312c0ef-0.
INFO 03-02 01:08:29 [logger.py:42] Received request cmpl-b01f1f024b504094aa26f5f24e002c10-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:08:29 [async_llm.py:261] Added request cmpl-b01f1f024b504094aa26f5f24e002c10-0.
INFO 03-02 01:08:30 [logger.py:42] Received request cmpl-402e790bdf184dbf9d7e3539bf9d4c2b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:08:30 [async_llm.py:261] Added request cmpl-402e790bdf184dbf9d7e3539bf9d4c2b-0.
INFO 03-02 01:08:31 [logger.py:42] Received request cmpl-3376ffe8a507490c99d0328edcf04b1c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:08:31 [async_llm.py:261] Added request cmpl-3376ffe8a507490c99d0328edcf04b1c-0.
INFO 03-02 01:08:32 [logger.py:42] Received request cmpl-ad736a4601914a88aaada1a948bff1c6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:08:32 [async_llm.py:261] Added request cmpl-ad736a4601914a88aaada1a948bff1c6-0.
INFO 03-02 01:08:33 [logger.py:42] Received request cmpl-3eff068385284aacac325b74eb6643f4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:08:33 [async_llm.py:261] Added request cmpl-3eff068385284aacac325b74eb6643f4-0.
INFO 03-02 01:08:35 [logger.py:42] Received request cmpl-f699d06a63ae413cb55a178cdb430d20-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:08:35 [async_llm.py:261] Added request cmpl-f699d06a63ae413cb55a178cdb430d20-0.
INFO 03-02 01:08:36 [logger.py:42] Received request cmpl-8778220ca055414cb5d6225923fdd39c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:08:36 [async_llm.py:261] Added request cmpl-8778220ca055414cb5d6225923fdd39c-0.
INFO 03-02 01:08:37 [logger.py:42] Received request cmpl-dcb6e5f579744b8faa880e80542ce444-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:08:37 [async_llm.py:261] Added request cmpl-dcb6e5f579744b8faa880e80542ce444-0.
INFO 03-02 01:08:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:08:38 [logger.py:42] Received request cmpl-a672035e49814150802eee7b0392a76e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:08:38 [async_llm.py:261] Added request cmpl-a672035e49814150802eee7b0392a76e-0.
INFO 03-02 01:08:39 [logger.py:42] Received request cmpl-a8d4872df89a4de4b794d903a9bd6d15-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:08:39 [async_llm.py:261] Added request cmpl-a8d4872df89a4de4b794d903a9bd6d15-0.
INFO 03-02 01:08:40 [logger.py:42] Received request cmpl-849765ef9f0548a7b007aec554349a00-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:08:40 [async_llm.py:261] Added request cmpl-849765ef9f0548a7b007aec554349a00-0.
INFO 03-02 01:08:42 [logger.py:42] Received request cmpl-719854a5078144d58ef7351cea77fed3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:08:42 [async_llm.py:261] Added request cmpl-719854a5078144d58ef7351cea77fed3-0.
INFO 03-02 01:08:43 [logger.py:42] Received request cmpl-16f4ef2021714a72a1de253f55363e3b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:08:43 [async_llm.py:261] Added request cmpl-16f4ef2021714a72a1de253f55363e3b-0.
INFO 03-02 01:08:44 [logger.py:42] Received request cmpl-28c59dd46a1846d394fedeb226c1d5b9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:08:44 [async_llm.py:261] Added request cmpl-28c59dd46a1846d394fedeb226c1d5b9-0.
INFO 03-02 01:08:45 [logger.py:42] Received request cmpl-99bdef5345bf4947a940940512572724-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:08:45 [async_llm.py:261] Added request cmpl-99bdef5345bf4947a940940512572724-0.
INFO 03-02 01:08:46 [logger.py:42] Received request cmpl-318bfc2d93a14553b2719a2d50275ea7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:08:46 [async_llm.py:261] Added request cmpl-318bfc2d93a14553b2719a2d50275ea7-0.
INFO 03-02 01:08:47 [logger.py:42] Received request cmpl-1619f477e8964d4d94f0a58bfad01521-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:08:47 [async_llm.py:261] Added request cmpl-1619f477e8964d4d94f0a58bfad01521-0.
INFO 03-02 01:08:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:08:48 [logger.py:42] Received request cmpl-c2763a15c15e4be4b2e02eeef25718b8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:08:48 [async_llm.py:261] Added request cmpl-c2763a15c15e4be4b2e02eeef25718b8-0.
INFO 03-02 01:08:50 [logger.py:42] Received request cmpl-155688aaba434f3ca89db9733288da76-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:08:50 [async_llm.py:261] Added request cmpl-155688aaba434f3ca89db9733288da76-0.
INFO 03-02 01:08:51 [logger.py:42] Received request cmpl-5a9c187114e14f58a89df468f3471cbc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:08:51 [async_llm.py:261] Added request cmpl-5a9c187114e14f58a89df468f3471cbc-0.
INFO 03-02 01:08:52 [logger.py:42] Received request cmpl-51fbb8109fbe41298fa31ea84a274a48-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:08:52 [async_llm.py:261] Added request cmpl-51fbb8109fbe41298fa31ea84a274a48-0.
INFO 03-02 01:08:53 [logger.py:42] Received request cmpl-b3d6bd1c63514089bc2daebca36f729a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:08:53 [async_llm.py:261] Added request cmpl-b3d6bd1c63514089bc2daebca36f729a-0.
INFO 03-02 01:08:54 [logger.py:42] Received request cmpl-a2ee042c4879402bafb7de774fde70fc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:08:54 [async_llm.py:261] Added request cmpl-a2ee042c4879402bafb7de774fde70fc-0.
INFO 03-02 01:08:55 [logger.py:42] Received request cmpl-a5a32798496f4f209239b635b0775552-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:08:55 [async_llm.py:261] Added request cmpl-a5a32798496f4f209239b635b0775552-0.
INFO 03-02 01:08:57 [logger.py:42] Received request cmpl-ae83d154ba5c450fb2620f547b389f20-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:08:57 [async_llm.py:261] Added request cmpl-ae83d154ba5c450fb2620f547b389f20-0.
INFO 03-02 01:08:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:08:58 [logger.py:42] Received request cmpl-f985195516874488911625a498d85ffe-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:08:58 [async_llm.py:261] Added request cmpl-f985195516874488911625a498d85ffe-0.
INFO 03-02 01:08:59 [logger.py:42] Received request cmpl-7d73866f4ae84084a8925a29c9d22457-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:08:59 [async_llm.py:261] Added request cmpl-7d73866f4ae84084a8925a29c9d22457-0.
INFO 03-02 01:09:00 [logger.py:42] Received request cmpl-4bb9d95b2cd94d63bab58f19dd4805ca-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:09:00 [async_llm.py:261] Added request cmpl-4bb9d95b2cd94d63bab58f19dd4805ca-0.
INFO 03-02 01:09:01 [logger.py:42] Received request cmpl-0908a583a95341c2b8ed6159c3d3a177-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:09:01 [async_llm.py:261] Added request cmpl-0908a583a95341c2b8ed6159c3d3a177-0.
INFO 03-02 01:09:02 [logger.py:42] Received request cmpl-b7d963d42436426b9027430f05244b64-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:09:02 [async_llm.py:261] Added request cmpl-b7d963d42436426b9027430f05244b64-0.
INFO 03-02 01:09:03 [logger.py:42] Received request cmpl-6b9afd3ffab943029093e221e8aaaf0a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:09:03 [async_llm.py:261] Added request cmpl-6b9afd3ffab943029093e221e8aaaf0a-0.
INFO 03-02 01:09:05 [logger.py:42] Received request cmpl-6fe884c6207945b0bbe4f280fb59f07e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:09:05 [async_llm.py:261] Added request cmpl-6fe884c6207945b0bbe4f280fb59f07e-0.
INFO 03-02 01:09:06 [logger.py:42] Received request cmpl-b063d76b0663426895554174b355ae29-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:09:06 [async_llm.py:261] Added request cmpl-b063d76b0663426895554174b355ae29-0.
INFO 03-02 01:09:07 [logger.py:42] Received request cmpl-b9f69a1cc08e45ffa87920609f848f3d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:09:07 [async_llm.py:261] Added request cmpl-b9f69a1cc08e45ffa87920609f848f3d-0.
INFO 03-02 01:09:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:09:08 [logger.py:42] Received request cmpl-7fd0c63ebc9d4426b4a1957fea5b47d5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:09:08 [async_llm.py:261] Added request cmpl-7fd0c63ebc9d4426b4a1957fea5b47d5-0.
INFO 03-02 01:09:09 [logger.py:42] Received request cmpl-d50ea11fbeed4f6e924c9ff0b972b588-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:09:09 [async_llm.py:261] Added request cmpl-d50ea11fbeed4f6e924c9ff0b972b588-0.
INFO 03-02 01:09:10 [logger.py:42] Received request cmpl-2d459de581d2469ea43bce89afcfddad-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:09:10 [async_llm.py:261] Added request cmpl-2d459de581d2469ea43bce89afcfddad-0.
INFO 03-02 01:09:12 [logger.py:42] Received request cmpl-4b30c36917264970b34a381446ad4ec5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:09:12 [async_llm.py:261] Added request cmpl-4b30c36917264970b34a381446ad4ec5-0.
INFO 03-02 01:09:13 [logger.py:42] Received request cmpl-312055316f764b76b5fc6f4c958d1917-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:09:13 [async_llm.py:261] Added request cmpl-312055316f764b76b5fc6f4c958d1917-0.
INFO 03-02 01:09:14 [logger.py:42] Received request cmpl-f9746ddb88c7408f9e2ab427501d5c09-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:09:14 [async_llm.py:261] Added request cmpl-f9746ddb88c7408f9e2ab427501d5c09-0.
INFO 03-02 01:09:15 [logger.py:42] Received request cmpl-b85d8585104b477bb65d4a44fd9371ac-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:09:15 [async_llm.py:261] Added request cmpl-b85d8585104b477bb65d4a44fd9371ac-0.
INFO 03-02 01:09:16 [logger.py:42] Received request cmpl-f7bdc6646a2b4795b0c817bccd9802de-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:09:16 [async_llm.py:261] Added request cmpl-f7bdc6646a2b4795b0c817bccd9802de-0.
INFO 03-02 01:09:17 [logger.py:42] Received request cmpl-42c23d55d76f40e6aa650d935f9a92bb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:09:17 [async_llm.py:261] Added request cmpl-42c23d55d76f40e6aa650d935f9a92bb-0.
INFO 03-02 01:09:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:09:18 [logger.py:42] Received request cmpl-350a202972f9470f9e960b753783aaa6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:09:18 [async_llm.py:261] Added request cmpl-350a202972f9470f9e960b753783aaa6-0.
INFO 03-02 01:09:20 [logger.py:42] Received request cmpl-3eaa0ca310fd412c9ce66a6a0b7e7f87-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:09:20 [async_llm.py:261] Added request cmpl-3eaa0ca310fd412c9ce66a6a0b7e7f87-0.
INFO 03-02 01:09:21 [logger.py:42] Received request cmpl-43a56d53540f4377aa6957757e65a01b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:09:21 [async_llm.py:261] Added request cmpl-43a56d53540f4377aa6957757e65a01b-0.
INFO 03-02 01:09:22 [logger.py:42] Received request cmpl-e8589b8a611f42b59a2ff1877a5d69ff-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:09:22 [async_llm.py:261] Added request cmpl-e8589b8a611f42b59a2ff1877a5d69ff-0.
INFO 03-02 01:09:23 [logger.py:42] Received request cmpl-008d0a057b0b4366ba6a11870c297233-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:09:23 [async_llm.py:261] Added request cmpl-008d0a057b0b4366ba6a11870c297233-0.
INFO 03-02 01:09:24 [logger.py:42] Received request cmpl-45733dcba213434a9fbdba8a837982a5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:09:24 [async_llm.py:261] Added request cmpl-45733dcba213434a9fbdba8a837982a5-0.
INFO 03-02 01:09:25 [logger.py:42] Received request cmpl-f905d5eb028b468a8588baf9190a8c71-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:09:25 [async_llm.py:261] Added request cmpl-f905d5eb028b468a8588baf9190a8c71-0.
INFO 03-02 01:09:27 [logger.py:42] Received request cmpl-8c71277baaca43d4b57d45f40c8ad96f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:09:27 [async_llm.py:261] Added request cmpl-8c71277baaca43d4b57d45f40c8ad96f-0.
INFO 03-02 01:09:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:09:28 [logger.py:42] Received request cmpl-aa95a5b538694dbdb95715b3dcd3c7bc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:09:28 [async_llm.py:261] Added request cmpl-aa95a5b538694dbdb95715b3dcd3c7bc-0.
INFO 03-02 01:09:29 [logger.py:42] Received request cmpl-87140fd38afc43d4a9ff9b87923f0e6f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:09:29 [async_llm.py:261] Added request cmpl-87140fd38afc43d4a9ff9b87923f0e6f-0.
INFO 03-02 01:09:30 [logger.py:42] Received request cmpl-b6ccdc0b67294a6bb1fd5f3892d0d4ef-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:09:30 [async_llm.py:261] Added request cmpl-b6ccdc0b67294a6bb1fd5f3892d0d4ef-0.
INFO 03-02 01:09:31 [logger.py:42] Received request cmpl-d6fea226122f46ac90969fe7ba1c3758-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:09:31 [async_llm.py:261] Added request cmpl-d6fea226122f46ac90969fe7ba1c3758-0.
INFO 03-02 01:09:32 [logger.py:42] Received request cmpl-99b9be68a22543c48e51a58e9f8b4dc8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:09:32 [async_llm.py:261] Added request cmpl-99b9be68a22543c48e51a58e9f8b4dc8-0.
INFO 03-02 01:09:33 [logger.py:42] Received request cmpl-0e62e22bca2048ec964f92d68ddea1fb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:09:33 [async_llm.py:261] Added request cmpl-0e62e22bca2048ec964f92d68ddea1fb-0.
INFO 03-02 01:09:35 [logger.py:42] Received request cmpl-8149abb9b3c0469d8f03d14f367576ef-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:09:35 [async_llm.py:261] Added request cmpl-8149abb9b3c0469d8f03d14f367576ef-0.
INFO 03-02 01:09:36 [logger.py:42] Received request cmpl-5b5663f9d28f424dbb87904bfa3d09cf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:09:36 [async_llm.py:261] Added request cmpl-5b5663f9d28f424dbb87904bfa3d09cf-0.
INFO 03-02 01:09:37 [logger.py:42] Received request cmpl-0e7735e620d847efb872e00660aab2cd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:09:37 [async_llm.py:261] Added request cmpl-0e7735e620d847efb872e00660aab2cd-0.
INFO 03-02 01:09:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:09:38 [logger.py:42] Received request cmpl-67e85db28129431e9e8fd03eb9dea0a8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:09:38 [async_llm.py:261] Added request cmpl-67e85db28129431e9e8fd03eb9dea0a8-0.
INFO 03-02 01:09:39 [logger.py:42] Received request cmpl-88cdd09432f6471a93525b18f3f5ac38-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:09:39 [async_llm.py:261] Added request cmpl-88cdd09432f6471a93525b18f3f5ac38-0.
INFO 03-02 01:09:40 [logger.py:42] Received request cmpl-d55bcbc235d343f8806fdfbe7bda79ed-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:09:40 [async_llm.py:261] Added request cmpl-d55bcbc235d343f8806fdfbe7bda79ed-0.
INFO 03-02 01:09:42 [logger.py:42] Received request cmpl-93c4c7e3db3a483994c7d3ab97d68e0f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:09:42 [async_llm.py:261] Added request cmpl-93c4c7e3db3a483994c7d3ab97d68e0f-0.
INFO 03-02 01:09:43 [logger.py:42] Received request cmpl-ec2ec4ddccfd45d1bb50f6bad669f9f3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:09:43 [async_llm.py:261] Added request cmpl-ec2ec4ddccfd45d1bb50f6bad669f9f3-0.
INFO 03-02 01:09:44 [logger.py:42] Received request cmpl-5743f5d0bcd6454886d404f5ceddd627-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:09:44 [async_llm.py:261] Added request cmpl-5743f5d0bcd6454886d404f5ceddd627-0.
INFO 03-02 01:09:45 [logger.py:42] Received request cmpl-de25728c505243f8abb5f470cb91f581-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:09:45 [async_llm.py:261] Added request cmpl-de25728c505243f8abb5f470cb91f581-0.
INFO 03-02 01:09:46 [logger.py:42] Received request cmpl-74cdb0e5baf54df6b5259f2f1bbced62-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:09:46 [async_llm.py:261] Added request cmpl-74cdb0e5baf54df6b5259f2f1bbced62-0.
INFO 03-02 01:09:47 [logger.py:42] Received request cmpl-f3b51a95845e4899bd32e3ca27445b2e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:09:47 [async_llm.py:261] Added request cmpl-f3b51a95845e4899bd32e3ca27445b2e-0.
INFO 03-02 01:09:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:09:48 [logger.py:42] Received request cmpl-2a633c7e16654a57949e0222a034a0ab-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:09:48 [async_llm.py:261] Added request cmpl-2a633c7e16654a57949e0222a034a0ab-0.
INFO 03-02 01:09:50 [logger.py:42] Received request cmpl-ffec8315366c43c7b5e35e377f238060-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:09:50 [async_llm.py:261] Added request cmpl-ffec8315366c43c7b5e35e377f238060-0.
INFO 03-02 01:09:51 [logger.py:42] Received request cmpl-2f0e8cf4ed7c4db8b271040e9f4d86ab-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:09:51 [async_llm.py:261] Added request cmpl-2f0e8cf4ed7c4db8b271040e9f4d86ab-0.
INFO 03-02 01:09:52 [logger.py:42] Received request cmpl-12df564e14844dbcb2d438517c920357-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:09:52 [async_llm.py:261] Added request cmpl-12df564e14844dbcb2d438517c920357-0.
INFO 03-02 01:09:53 [logger.py:42] Received request cmpl-6c15eb48b911435d99a97741cfca182d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:09:53 [async_llm.py:261] Added request cmpl-6c15eb48b911435d99a97741cfca182d-0.
INFO 03-02 01:09:54 [logger.py:42] Received request cmpl-e8b40b8868fa49b4ab618908803f29df-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:09:54 [async_llm.py:261] Added request cmpl-e8b40b8868fa49b4ab618908803f29df-0.
INFO 03-02 01:09:55 [logger.py:42] Received request cmpl-b7d33a23e1dd494da7a9cdea5350ac45-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:09:55 [async_llm.py:261] Added request cmpl-b7d33a23e1dd494da7a9cdea5350ac45-0.
INFO 03-02 01:09:57 [logger.py:42] Received request cmpl-7e37ffd553dc4bbd80aa5f370a54e41d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:09:57 [async_llm.py:261] Added request cmpl-7e37ffd553dc4bbd80aa5f370a54e41d-0.
INFO 03-02 01:09:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:09:58 [logger.py:42] Received request cmpl-a81c6fa07d7a4b28a78ac3c9a09acbe6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:09:58 [async_llm.py:261] Added request cmpl-a81c6fa07d7a4b28a78ac3c9a09acbe6-0.
INFO 03-02 01:09:59 [logger.py:42] Received request cmpl-b800a0c5fed6436da98821f65c6495c9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:09:59 [async_llm.py:261] Added request cmpl-b800a0c5fed6436da98821f65c6495c9-0.
INFO 03-02 01:10:00 [logger.py:42] Received request cmpl-f18e40188fa0432f9fbc9e2366420aa2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:10:00 [async_llm.py:261] Added request cmpl-f18e40188fa0432f9fbc9e2366420aa2-0.
INFO 03-02 01:10:01 [logger.py:42] Received request cmpl-5ce40fab5df54a9a84659462574e5089-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:10:01 [async_llm.py:261] Added request cmpl-5ce40fab5df54a9a84659462574e5089-0.
INFO 03-02 01:10:02 [logger.py:42] Received request cmpl-0389992c031946088aa9ab4477e8a231-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:10:02 [async_llm.py:261] Added request cmpl-0389992c031946088aa9ab4477e8a231-0.
INFO 03-02 01:10:03 [logger.py:42] Received request cmpl-ac770176daff4d9f972c30424ba35f3d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:10:03 [async_llm.py:261] Added request cmpl-ac770176daff4d9f972c30424ba35f3d-0.
INFO 03-02 01:10:05 [logger.py:42] Received request cmpl-35e7bb9a33d74cd682041b879bc232b9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:10:05 [async_llm.py:261] Added request cmpl-35e7bb9a33d74cd682041b879bc232b9-0.
INFO 03-02 01:10:06 [logger.py:42] Received request cmpl-904543ee98c7418296d7455f762bc9b2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:10:06 [async_llm.py:261] Added request cmpl-904543ee98c7418296d7455f762bc9b2-0.
INFO 03-02 01:10:07 [logger.py:42] Received request cmpl-24268334ef1b41beb92e9fc0e7446a83-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:10:07 [async_llm.py:261] Added request cmpl-24268334ef1b41beb92e9fc0e7446a83-0.
INFO 03-02 01:10:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:10:08 [logger.py:42] Received request cmpl-4249b1f9f2d747378aa9e6e7b8291194-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:10:08 [async_llm.py:261] Added request cmpl-4249b1f9f2d747378aa9e6e7b8291194-0.
INFO 03-02 01:10:09 [logger.py:42] Received request cmpl-f5d71b00f704498fb679d975d073a323-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:10:09 [async_llm.py:261] Added request cmpl-f5d71b00f704498fb679d975d073a323-0.
INFO 03-02 01:10:10 [logger.py:42] Received request cmpl-a9f26458869c4c90a56dc5f5efd9f7b6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:10:10 [async_llm.py:261] Added request cmpl-a9f26458869c4c90a56dc5f5efd9f7b6-0.
INFO 03-02 01:10:12 [logger.py:42] Received request cmpl-f3525a1a2e5f42e4b0f5d7d08e5f6e38-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:10:12 [async_llm.py:261] Added request cmpl-f3525a1a2e5f42e4b0f5d7d08e5f6e38-0.
INFO 03-02 01:10:13 [logger.py:42] Received request cmpl-c1f93cd652a7455c9aea6ff95e442f73-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:10:13 [async_llm.py:261] Added request cmpl-c1f93cd652a7455c9aea6ff95e442f73-0.
INFO 03-02 01:10:14 [logger.py:42] Received request cmpl-f94bcde21b4146dcab9d23a963521bd0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:10:14 [async_llm.py:261] Added request cmpl-f94bcde21b4146dcab9d23a963521bd0-0.
INFO 03-02 01:10:15 [logger.py:42] Received request cmpl-9ddf8586bfe44f7eacc5c962a7cb4dde-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:10:15 [async_llm.py:261] Added request cmpl-9ddf8586bfe44f7eacc5c962a7cb4dde-0.
INFO 03-02 01:10:16 [logger.py:42] Received request cmpl-42584db24d8e4fd98759fb19ce442029-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:10:16 [async_llm.py:261] Added request cmpl-42584db24d8e4fd98759fb19ce442029-0.
INFO 03-02 01:10:17 [logger.py:42] Received request cmpl-7b0c468a94514ca082ba8f0b740b5d4a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:10:17 [async_llm.py:261] Added request cmpl-7b0c468a94514ca082ba8f0b740b5d4a-0.
INFO 03-02 01:10:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:10:18 [logger.py:42] Received request cmpl-8b38b29762f548c6a99da42764d70dd4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:10:18 [async_llm.py:261] Added request cmpl-8b38b29762f548c6a99da42764d70dd4-0.
INFO 03-02 01:10:20 [logger.py:42] Received request cmpl-f1c7ef002c324fccb9630199ef851f7f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:10:20 [async_llm.py:261] Added request cmpl-f1c7ef002c324fccb9630199ef851f7f-0.
INFO 03-02 01:10:21 [logger.py:42] Received request cmpl-0c77a6a40572409c8a16a7c0e97a9168-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:10:21 [async_llm.py:261] Added request cmpl-0c77a6a40572409c8a16a7c0e97a9168-0.
INFO 03-02 01:10:22 [logger.py:42] Received request cmpl-d2f06f1d55cc41bb91d8585233c7848a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:10:22 [async_llm.py:261] Added request cmpl-d2f06f1d55cc41bb91d8585233c7848a-0.
INFO 03-02 01:10:23 [logger.py:42] Received request cmpl-d580a5d018af471080a72a7b76a8b7cf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:10:23 [async_llm.py:261] Added request cmpl-d580a5d018af471080a72a7b76a8b7cf-0.
INFO 03-02 01:10:24 [logger.py:42] Received request cmpl-d37b36e9f29d4904bb7bbb00a8a74689-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:10:24 [async_llm.py:261] Added request cmpl-d37b36e9f29d4904bb7bbb00a8a74689-0.
INFO 03-02 01:10:25 [logger.py:42] Received request cmpl-7f3e0f2ccab84588bfde05c0b91a2d7e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:10:25 [async_llm.py:261] Added request cmpl-7f3e0f2ccab84588bfde05c0b91a2d7e-0.
INFO 03-02 01:10:27 [logger.py:42] Received request cmpl-a7744c5263ee41a7b570dc1e41a5cd84-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:10:27 [async_llm.py:261] Added request cmpl-a7744c5263ee41a7b570dc1e41a5cd84-0.
INFO 03-02 01:10:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:10:28 [logger.py:42] Received request cmpl-247a174c7d3e41eca8a3005982c30720-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:10:28 [async_llm.py:261] Added request cmpl-247a174c7d3e41eca8a3005982c30720-0.
INFO 03-02 01:10:29 [logger.py:42] Received request cmpl-f1340de46ad14441a5f2d152e90c32ec-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:10:29 [async_llm.py:261] Added request cmpl-f1340de46ad14441a5f2d152e90c32ec-0.
INFO 03-02 01:10:30 [logger.py:42] Received request cmpl-7ca91e2a02ed4a6e97ee1cf91488d533-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:10:30 [async_llm.py:261] Added request cmpl-7ca91e2a02ed4a6e97ee1cf91488d533-0.
INFO 03-02 01:10:31 [logger.py:42] Received request cmpl-4d33b7bad71a464fb90937d38eee03ab-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:10:31 [async_llm.py:261] Added request cmpl-4d33b7bad71a464fb90937d38eee03ab-0.
INFO 03-02 01:10:32 [logger.py:42] Received request cmpl-ab1e44a824de46409cfe7e5f529abc0f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:10:32 [async_llm.py:261] Added request cmpl-ab1e44a824de46409cfe7e5f529abc0f-0.
INFO 03-02 01:10:33 [logger.py:42] Received request cmpl-ace218b1d84649f8a1eca7ca626dead0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:10:33 [async_llm.py:261] Added request cmpl-ace218b1d84649f8a1eca7ca626dead0-0.
INFO 03-02 01:10:35 [logger.py:42] Received request cmpl-413f4ce8dba6473080a56197061a692f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:10:35 [async_llm.py:261] Added request cmpl-413f4ce8dba6473080a56197061a692f-0.
INFO 03-02 01:10:36 [logger.py:42] Received request cmpl-eeeb793b6e9941e5b6d167157a3892c9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:10:36 [async_llm.py:261] Added request cmpl-eeeb793b6e9941e5b6d167157a3892c9-0.
INFO 03-02 01:10:37 [logger.py:42] Received request cmpl-987fa4367af44c5082a91d25a5824c3a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:10:37 [async_llm.py:261] Added request cmpl-987fa4367af44c5082a91d25a5824c3a-0.
INFO 03-02 01:10:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:10:38 [logger.py:42] Received request cmpl-a787a284928a410ca0cd46c87fed0918-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:10:38 [async_llm.py:261] Added request cmpl-a787a284928a410ca0cd46c87fed0918-0.
INFO 03-02 01:10:39 [logger.py:42] Received request cmpl-93eadd3c25de40f8a3791b1ca37e5c79-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:10:39 [async_llm.py:261] Added request cmpl-93eadd3c25de40f8a3791b1ca37e5c79-0.
INFO 03-02 01:10:40 [logger.py:42] Received request cmpl-e94eb7e652474f3c81e5f5502f5a8dd1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:10:40 [async_llm.py:261] Added request cmpl-e94eb7e652474f3c81e5f5502f5a8dd1-0.
INFO 03-02 01:10:42 [logger.py:42] Received request cmpl-8a74693337324addacfdc47fdd8437d2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:10:42 [async_llm.py:261] Added request cmpl-8a74693337324addacfdc47fdd8437d2-0.
INFO 03-02 01:10:43 [logger.py:42] Received request cmpl-d650198647464faea4aec4bc805fb12a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:10:43 [async_llm.py:261] Added request cmpl-d650198647464faea4aec4bc805fb12a-0.
INFO 03-02 01:10:44 [logger.py:42] Received request cmpl-60b625f190304e8191b6af11b3b75be7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:10:44 [async_llm.py:261] Added request cmpl-60b625f190304e8191b6af11b3b75be7-0.
INFO 03-02 01:10:45 [logger.py:42] Received request cmpl-6d94d339b5064db0b0bb107497aec59f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:10:45 [async_llm.py:261] Added request cmpl-6d94d339b5064db0b0bb107497aec59f-0.
INFO 03-02 01:10:46 [logger.py:42] Received request cmpl-30b88731f69c46bb99309434256aba41-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:10:46 [async_llm.py:261] Added request cmpl-30b88731f69c46bb99309434256aba41-0.
INFO 03-02 01:10:47 [logger.py:42] Received request cmpl-e74d19c1b331426e8aa9d7aed70f1a7a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:10:47 [async_llm.py:261] Added request cmpl-e74d19c1b331426e8aa9d7aed70f1a7a-0.
INFO 03-02 01:10:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:10:48 [logger.py:42] Received request cmpl-486ca8aa26d14f05a7cb149a231ab303-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:10:48 [async_llm.py:261] Added request cmpl-486ca8aa26d14f05a7cb149a231ab303-0.
INFO 03-02 01:10:50 [logger.py:42] Received request cmpl-09d9f1a11b1a488b997326299a3a3681-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:10:50 [async_llm.py:261] Added request cmpl-09d9f1a11b1a488b997326299a3a3681-0.
INFO 03-02 01:10:51 [logger.py:42] Received request cmpl-7d515d7534424fec9e7cd29ac898dfdf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:10:51 [async_llm.py:261] Added request cmpl-7d515d7534424fec9e7cd29ac898dfdf-0.
INFO 03-02 01:10:52 [logger.py:42] Received request cmpl-ece4a99e65d0476a8b7c4b014f2fe7c1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:10:52 [async_llm.py:261] Added request cmpl-ece4a99e65d0476a8b7c4b014f2fe7c1-0.
INFO 03-02 01:10:53 [logger.py:42] Received request cmpl-58453dd7c12a4dacb584cb6ef096f437-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:10:53 [async_llm.py:261] Added request cmpl-58453dd7c12a4dacb584cb6ef096f437-0.
INFO 03-02 01:10:54 [logger.py:42] Received request cmpl-d3bf979d906b44918beee1072a440b67-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:10:54 [async_llm.py:261] Added request cmpl-d3bf979d906b44918beee1072a440b67-0.
INFO 03-02 01:10:55 [logger.py:42] Received request cmpl-55627cc7880f4b3e933368743c9a4790-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:10:55 [async_llm.py:261] Added request cmpl-55627cc7880f4b3e933368743c9a4790-0.
INFO 03-02 01:10:57 [logger.py:42] Received request cmpl-ef2d82abbe1142da8db7cfc642d5c914-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:10:57 [async_llm.py:261] Added request cmpl-ef2d82abbe1142da8db7cfc642d5c914-0.
INFO 03-02 01:10:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:10:58 [logger.py:42] Received request cmpl-67941df3423644ae922c3c5e68ce09fa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:10:58 [async_llm.py:261] Added request cmpl-67941df3423644ae922c3c5e68ce09fa-0.
INFO 03-02 01:10:59 [logger.py:42] Received request cmpl-f320871e4b6249df9fc58bc883eeb11f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:10:59 [async_llm.py:261] Added request cmpl-f320871e4b6249df9fc58bc883eeb11f-0.
INFO 03-02 01:11:00 [logger.py:42] Received request cmpl-39c532b66c204077bb929e717f3c7fa7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:11:00 [async_llm.py:261] Added request cmpl-39c532b66c204077bb929e717f3c7fa7-0.
INFO 03-02 01:11:01 [logger.py:42] Received request cmpl-384421d3be98476fb667f573b7760ca0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:11:01 [async_llm.py:261] Added request cmpl-384421d3be98476fb667f573b7760ca0-0.
INFO 03-02 01:11:02 [logger.py:42] Received request cmpl-fd99530c201b44e3b4326c0f0c247920-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:11:02 [async_llm.py:261] Added request cmpl-fd99530c201b44e3b4326c0f0c247920-0.
INFO 03-02 01:11:03 [logger.py:42] Received request cmpl-dd881b6ce6fc4b4db0751eeae4d1c7f1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:11:03 [async_llm.py:261] Added request cmpl-dd881b6ce6fc4b4db0751eeae4d1c7f1-0.
INFO 03-02 01:11:05 [logger.py:42] Received request cmpl-14400ae7ae6d49af8ac4e55d695ffdb1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:11:05 [async_llm.py:261] Added request cmpl-14400ae7ae6d49af8ac4e55d695ffdb1-0.
INFO 03-02 01:11:06 [logger.py:42] Received request cmpl-08af658bfaff44f0a24cea1cd9096a9e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:11:06 [async_llm.py:261] Added request cmpl-08af658bfaff44f0a24cea1cd9096a9e-0.
INFO 03-02 01:11:07 [logger.py:42] Received request cmpl-318126d7fe06489cb173004931287fd9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:11:07 [async_llm.py:261] Added request cmpl-318126d7fe06489cb173004931287fd9-0.
INFO 03-02 01:11:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:11:08 [logger.py:42] Received request cmpl-c4676da1cd064f198d9bccb6e8f30f81-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:11:08 [async_llm.py:261] Added request cmpl-c4676da1cd064f198d9bccb6e8f30f81-0.
INFO 03-02 01:11:09 [logger.py:42] Received request cmpl-c7e2481e4d6246b9b10338bd582e3503-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:11:09 [async_llm.py:261] Added request cmpl-c7e2481e4d6246b9b10338bd582e3503-0.
INFO 03-02 01:11:10 [logger.py:42] Received request cmpl-d4ece977a0ec4a949352bf00ff3a1076-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:11:10 [async_llm.py:261] Added request cmpl-d4ece977a0ec4a949352bf00ff3a1076-0.
INFO 03-02 01:11:12 [logger.py:42] Received request cmpl-f09eeba0e7dc4ec2adf48df22805a90f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:11:12 [async_llm.py:261] Added request cmpl-f09eeba0e7dc4ec2adf48df22805a90f-0.
INFO 03-02 01:11:13 [logger.py:42] Received request cmpl-f1c59002ef3d401e92319f9dbf320a8d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:11:13 [async_llm.py:261] Added request cmpl-f1c59002ef3d401e92319f9dbf320a8d-0.
INFO 03-02 01:11:14 [logger.py:42] Received request cmpl-f7dc3af0bdc046609fa442949d79de8f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:11:14 [async_llm.py:261] Added request cmpl-f7dc3af0bdc046609fa442949d79de8f-0.
INFO 03-02 01:11:15 [logger.py:42] Received request cmpl-92df6849c92a4a7e82b4723a97f7e05d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:11:15 [async_llm.py:261] Added request cmpl-92df6849c92a4a7e82b4723a97f7e05d-0.
INFO 03-02 01:11:16 [logger.py:42] Received request cmpl-0d046b54b8e64e44aae579219fd72805-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:11:16 [async_llm.py:261] Added request cmpl-0d046b54b8e64e44aae579219fd72805-0.
INFO 03-02 01:11:17 [logger.py:42] Received request cmpl-a508671d257641b9a9d066ade0778d92-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:11:17 [async_llm.py:261] Added request cmpl-a508671d257641b9a9d066ade0778d92-0.
INFO 03-02 01:11:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:11:18 [logger.py:42] Received request cmpl-72a9b6ee685e4ee1bf7318a3bff4adfa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:11:18 [async_llm.py:261] Added request cmpl-72a9b6ee685e4ee1bf7318a3bff4adfa-0.
INFO 03-02 01:11:20 [logger.py:42] Received request cmpl-68ccc06b66d64903a688d5082bfd56b2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:11:20 [async_llm.py:261] Added request cmpl-68ccc06b66d64903a688d5082bfd56b2-0.
INFO 03-02 01:11:21 [logger.py:42] Received request cmpl-bed2290e8c5c49c3bf18ecdcf6e6bfe0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:11:21 [async_llm.py:261] Added request cmpl-bed2290e8c5c49c3bf18ecdcf6e6bfe0-0.
INFO 03-02 01:11:22 [logger.py:42] Received request cmpl-afb92a18f217436db36ad3ebd7734170-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:11:22 [async_llm.py:261] Added request cmpl-afb92a18f217436db36ad3ebd7734170-0.
INFO 03-02 01:11:23 [logger.py:42] Received request cmpl-bcdc7117ff4648aba4e40b557f17ce8f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:11:23 [async_llm.py:261] Added request cmpl-bcdc7117ff4648aba4e40b557f17ce8f-0.
INFO 03-02 01:11:24 [logger.py:42] Received request cmpl-b1f99bea134f4341819756d00780bfa5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:11:24 [async_llm.py:261] Added request cmpl-b1f99bea134f4341819756d00780bfa5-0.
INFO 03-02 01:11:25 [logger.py:42] Received request cmpl-efad16f5d3734ab09d7aa7059eedaa68-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:11:25 [async_llm.py:261] Added request cmpl-efad16f5d3734ab09d7aa7059eedaa68-0.
INFO 03-02 01:11:27 [logger.py:42] Received request cmpl-591cc5ea6ed84fb4a188afb2a5343e8d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:11:27 [async_llm.py:261] Added request cmpl-591cc5ea6ed84fb4a188afb2a5343e8d-0.
INFO 03-02 01:11:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:11:28 [logger.py:42] Received request cmpl-fe5ef1c71a144c0085fed705150da66c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:11:28 [async_llm.py:261] Added request cmpl-fe5ef1c71a144c0085fed705150da66c-0.
INFO 03-02 01:11:29 [logger.py:42] Received request cmpl-e0532e84f5e944188cd716dfa80039a9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:11:29 [async_llm.py:261] Added request cmpl-e0532e84f5e944188cd716dfa80039a9-0.
INFO 03-02 01:11:30 [logger.py:42] Received request cmpl-98ae85d8ad5a4f05b76fe68823997e09-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:11:30 [async_llm.py:261] Added request cmpl-98ae85d8ad5a4f05b76fe68823997e09-0.
INFO 03-02 01:11:31 [logger.py:42] Received request cmpl-3c89460183614f4085c52594667b05bf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:11:31 [async_llm.py:261] Added request cmpl-3c89460183614f4085c52594667b05bf-0.
INFO 03-02 01:11:32 [logger.py:42] Received request cmpl-af85404b66764fb0bfc09b5a08e9a214-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:11:32 [async_llm.py:261] Added request cmpl-af85404b66764fb0bfc09b5a08e9a214-0.
INFO 03-02 01:11:33 [logger.py:42] Received request cmpl-b764bb2ca4ff4c718bf72bfe1b265fad-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:11:33 [async_llm.py:261] Added request cmpl-b764bb2ca4ff4c718bf72bfe1b265fad-0.
INFO 03-02 01:11:35 [logger.py:42] Received request cmpl-4528ddc25ef8413e8e69625ff786f6f4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:11:35 [async_llm.py:261] Added request cmpl-4528ddc25ef8413e8e69625ff786f6f4-0.
INFO 03-02 01:11:36 [logger.py:42] Received request cmpl-eb696e394ec3467fa52db26ca3fa30f0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:11:36 [async_llm.py:261] Added request cmpl-eb696e394ec3467fa52db26ca3fa30f0-0.
INFO 03-02 01:11:37 [logger.py:42] Received request cmpl-247383017a774264b90ec99d6d71582a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:11:37 [async_llm.py:261] Added request cmpl-247383017a774264b90ec99d6d71582a-0.
INFO 03-02 01:11:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:11:38 [logger.py:42] Received request cmpl-c7508679984442d68dc50efc3d120c9f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:11:38 [async_llm.py:261] Added request cmpl-c7508679984442d68dc50efc3d120c9f-0.
INFO 03-02 01:11:39 [logger.py:42] Received request cmpl-784e75d5c776427c8c639d8626db2636-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:11:39 [async_llm.py:261] Added request cmpl-784e75d5c776427c8c639d8626db2636-0.
INFO 03-02 01:11:40 [logger.py:42] Received request cmpl-70ad7cfdbcbe44358afbaee263957aca-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:11:40 [async_llm.py:261] Added request cmpl-70ad7cfdbcbe44358afbaee263957aca-0.
INFO 03-02 01:11:42 [logger.py:42] Received request cmpl-b22fe69ce6054052a50f7b5d525a6910-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:11:42 [async_llm.py:261] Added request cmpl-b22fe69ce6054052a50f7b5d525a6910-0.
INFO 03-02 01:11:43 [logger.py:42] Received request cmpl-dd8c2919ea5c4edc965b1940af22a91e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:11:43 [async_llm.py:261] Added request cmpl-dd8c2919ea5c4edc965b1940af22a91e-0.
INFO 03-02 01:11:44 [logger.py:42] Received request cmpl-547a9b3be0fc4a6f8bcee11f19d9eb91-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:11:44 [async_llm.py:261] Added request cmpl-547a9b3be0fc4a6f8bcee11f19d9eb91-0.
INFO 03-02 01:11:45 [logger.py:42] Received request cmpl-115b8d802387484fa02a2534ad248145-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:11:45 [async_llm.py:261] Added request cmpl-115b8d802387484fa02a2534ad248145-0.
INFO 03-02 01:11:46 [logger.py:42] Received request cmpl-ae4f8f618f7547fdb22f38113867032f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:11:46 [async_llm.py:261] Added request cmpl-ae4f8f618f7547fdb22f38113867032f-0.
INFO 03-02 01:11:47 [logger.py:42] Received request cmpl-b453d4858de844858e07f04bd39e5082-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:11:47 [async_llm.py:261] Added request cmpl-b453d4858de844858e07f04bd39e5082-0.
INFO 03-02 01:11:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:11:48 [logger.py:42] Received request cmpl-0fde83bd1fd14550bb15b950654ded3b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:11:48 [async_llm.py:261] Added request cmpl-0fde83bd1fd14550bb15b950654ded3b-0.
INFO 03-02 01:11:50 [logger.py:42] Received request cmpl-f1b2d731d42f468e8918a968ee2ca097-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:11:50 [async_llm.py:261] Added request cmpl-f1b2d731d42f468e8918a968ee2ca097-0.
INFO 03-02 01:11:51 [logger.py:42] Received request cmpl-8a6ef50427cf4d58ae75663dd62dc16d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:11:51 [async_llm.py:261] Added request cmpl-8a6ef50427cf4d58ae75663dd62dc16d-0.
INFO 03-02 01:11:52 [logger.py:42] Received request cmpl-7278ba00490a45749f3eeec3be0d1d36-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:11:52 [async_llm.py:261] Added request cmpl-7278ba00490a45749f3eeec3be0d1d36-0.
INFO 03-02 01:11:53 [logger.py:42] Received request cmpl-a9254fdd43d3428e9b884cf958b93469-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:11:53 [async_llm.py:261] Added request cmpl-a9254fdd43d3428e9b884cf958b93469-0.
INFO 03-02 01:11:54 [logger.py:42] Received request cmpl-e29ea181ee69406f949cd343cbf6b2e8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:11:54 [async_llm.py:261] Added request cmpl-e29ea181ee69406f949cd343cbf6b2e8-0.
INFO 03-02 01:11:55 [logger.py:42] Received request cmpl-0ef1c6de365d4b0cbb6b38a0f6f46448-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:11:55 [async_llm.py:261] Added request cmpl-0ef1c6de365d4b0cbb6b38a0f6f46448-0.
INFO 03-02 01:11:57 [logger.py:42] Received request cmpl-5979be54a0df4c24a9a24f19e929cc3c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:11:57 [async_llm.py:261] Added request cmpl-5979be54a0df4c24a9a24f19e929cc3c-0.
INFO 03-02 01:11:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:11:58 [logger.py:42] Received request cmpl-aa983b3ba69c45eabcab510e8861da97-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:11:58 [async_llm.py:261] Added request cmpl-aa983b3ba69c45eabcab510e8861da97-0.
INFO 03-02 01:11:59 [logger.py:42] Received request cmpl-9d55906b7c5741ed971ae59a0982eaae-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:11:59 [async_llm.py:261] Added request cmpl-9d55906b7c5741ed971ae59a0982eaae-0.
INFO 03-02 01:12:00 [logger.py:42] Received request cmpl-83fd300a42bc4aa890e064b410f0c28e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:12:00 [async_llm.py:261] Added request cmpl-83fd300a42bc4aa890e064b410f0c28e-0.
INFO 03-02 01:12:01 [logger.py:42] Received request cmpl-b10047c89dae4c1f96668be7af62a1f6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:12:01 [async_llm.py:261] Added request cmpl-b10047c89dae4c1f96668be7af62a1f6-0.
INFO 03-02 01:12:02 [logger.py:42] Received request cmpl-130320ceca354cbe86871dfbaac51d71-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:12:02 [async_llm.py:261] Added request cmpl-130320ceca354cbe86871dfbaac51d71-0.
INFO 03-02 01:12:03 [logger.py:42] Received request cmpl-a0428bcd75454aac8fe2c0987371ebb7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:12:03 [async_llm.py:261] Added request cmpl-a0428bcd75454aac8fe2c0987371ebb7-0.
INFO 03-02 01:12:05 [logger.py:42] Received request cmpl-626ce7cfb6ff48dd858fd2beb9da2771-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:12:05 [async_llm.py:261] Added request cmpl-626ce7cfb6ff48dd858fd2beb9da2771-0.
INFO 03-02 01:12:06 [logger.py:42] Received request cmpl-72c47651aeb442bf97679e4dc4262863-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:12:06 [async_llm.py:261] Added request cmpl-72c47651aeb442bf97679e4dc4262863-0.
INFO 03-02 01:12:07 [logger.py:42] Received request cmpl-0c2a1d0236dc4d3b8ff9e581caa30e58-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:12:07 [async_llm.py:261] Added request cmpl-0c2a1d0236dc4d3b8ff9e581caa30e58-0.
INFO 03-02 01:12:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:12:08 [logger.py:42] Received request cmpl-6b31bcb724e04c13b33e6d96d3c5e1a1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:12:08 [async_llm.py:261] Added request cmpl-6b31bcb724e04c13b33e6d96d3c5e1a1-0.
INFO 03-02 01:12:09 [logger.py:42] Received request cmpl-03283f2ea83a48cbb36eb3fcb736c820-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:12:09 [async_llm.py:261] Added request cmpl-03283f2ea83a48cbb36eb3fcb736c820-0.
INFO 03-02 01:12:10 [logger.py:42] Received request cmpl-a82066e5493a4297a615d7165be7c399-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:12:10 [async_llm.py:261] Added request cmpl-a82066e5493a4297a615d7165be7c399-0.
INFO 03-02 01:12:12 [logger.py:42] Received request cmpl-384f3a7dd8d24d28bf9fd28ef167cebb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:12:12 [async_llm.py:261] Added request cmpl-384f3a7dd8d24d28bf9fd28ef167cebb-0.
INFO 03-02 01:12:13 [logger.py:42] Received request cmpl-5e014b64d9a24181af7f052a9dfe4e69-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:12:13 [async_llm.py:261] Added request cmpl-5e014b64d9a24181af7f052a9dfe4e69-0.
INFO 03-02 01:12:14 [logger.py:42] Received request cmpl-ca002c16172d4176b6c7ac92195ec15c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:12:14 [async_llm.py:261] Added request cmpl-ca002c16172d4176b6c7ac92195ec15c-0.
INFO 03-02 01:12:15 [logger.py:42] Received request cmpl-4d5cd3ebfc1345d698f387255e239552-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:12:15 [async_llm.py:261] Added request cmpl-4d5cd3ebfc1345d698f387255e239552-0.
INFO 03-02 01:12:16 [logger.py:42] Received request cmpl-968aded7613047b28519c71dbd31655a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:12:16 [async_llm.py:261] Added request cmpl-968aded7613047b28519c71dbd31655a-0.
INFO 03-02 01:12:17 [logger.py:42] Received request cmpl-f35e470a249e44c78b22b3f6173dd0f2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:12:17 [async_llm.py:261] Added request cmpl-f35e470a249e44c78b22b3f6173dd0f2-0.
INFO 03-02 01:12:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:12:18 [logger.py:42] Received request cmpl-e6d898a2f2c446c5bfb9dbf6f1dd2ec0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:12:18 [async_llm.py:261] Added request cmpl-e6d898a2f2c446c5bfb9dbf6f1dd2ec0-0.
INFO 03-02 01:12:20 [logger.py:42] Received request cmpl-0269728896f54a16b84680cc5c08a3fa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:12:20 [async_llm.py:261] Added request cmpl-0269728896f54a16b84680cc5c08a3fa-0.
INFO 03-02 01:12:21 [logger.py:42] Received request cmpl-cbcef57877d84bc69b552de6ffba1a66-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:12:21 [async_llm.py:261] Added request cmpl-cbcef57877d84bc69b552de6ffba1a66-0.
INFO 03-02 01:12:22 [logger.py:42] Received request cmpl-74e5601f17cf4aa3989f063a477d5e30-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:12:22 [async_llm.py:261] Added request cmpl-74e5601f17cf4aa3989f063a477d5e30-0.
INFO 03-02 01:12:23 [logger.py:42] Received request cmpl-c383914e905641369993a2ec19d24c3a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:12:23 [async_llm.py:261] Added request cmpl-c383914e905641369993a2ec19d24c3a-0.
INFO 03-02 01:12:24 [logger.py:42] Received request cmpl-a57c699caf09404393ca359f39bdf373-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:12:24 [async_llm.py:261] Added request cmpl-a57c699caf09404393ca359f39bdf373-0.
INFO 03-02 01:12:25 [logger.py:42] Received request cmpl-c74805dbeaf0464abf970044cfe54e66-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:12:25 [async_llm.py:261] Added request cmpl-c74805dbeaf0464abf970044cfe54e66-0.
INFO 03-02 01:12:27 [logger.py:42] Received request cmpl-18a1fe781f694998b82c31a4ec59a775-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:12:27 [async_llm.py:261] Added request cmpl-18a1fe781f694998b82c31a4ec59a775-0.
INFO 03-02 01:12:28 [logger.py:42] Received request cmpl-0249244e93334043ad2af796417ade17-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:12:28 [async_llm.py:261] Added request cmpl-0249244e93334043ad2af796417ade17-0.
INFO 03-02 01:12:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:12:29 [logger.py:42] Received request cmpl-8b1697cd1b1d41abad6184375436ecf3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:12:29 [async_llm.py:261] Added request cmpl-8b1697cd1b1d41abad6184375436ecf3-0.
INFO 03-02 01:12:30 [logger.py:42] Received request cmpl-44681d4555a74d3a8f1a190159184d61-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:12:30 [async_llm.py:261] Added request cmpl-44681d4555a74d3a8f1a190159184d61-0.
INFO 03-02 01:12:31 [logger.py:42] Received request cmpl-09137a1c7cd241f69f528944e7e717f1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:12:31 [async_llm.py:261] Added request cmpl-09137a1c7cd241f69f528944e7e717f1-0.
INFO 03-02 01:12:32 [logger.py:42] Received request cmpl-48c6380c7f944803b598f73519589f6b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:12:32 [async_llm.py:261] Added request cmpl-48c6380c7f944803b598f73519589f6b-0.
INFO 03-02 01:12:33 [logger.py:42] Received request cmpl-ef6dc2521f3e48ac9c14194833622933-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:12:33 [async_llm.py:261] Added request cmpl-ef6dc2521f3e48ac9c14194833622933-0.
INFO 03-02 01:12:35 [logger.py:42] Received request cmpl-1e4f89c37ef24ad985812b03006b2674-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:12:35 [async_llm.py:261] Added request cmpl-1e4f89c37ef24ad985812b03006b2674-0.
INFO 03-02 01:12:36 [logger.py:42] Received request cmpl-3888413f554f486cb5d163250c7eccee-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:12:36 [async_llm.py:261] Added request cmpl-3888413f554f486cb5d163250c7eccee-0.
INFO 03-02 01:12:37 [logger.py:42] Received request cmpl-9f6b0629308743c790cace56acdffa0a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:12:37 [async_llm.py:261] Added request cmpl-9f6b0629308743c790cace56acdffa0a-0.
INFO 03-02 01:12:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:12:38 [logger.py:42] Received request cmpl-319e607a188d4b83b54f7c42f215ef0a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:12:38 [async_llm.py:261] Added request cmpl-319e607a188d4b83b54f7c42f215ef0a-0.
INFO 03-02 01:12:39 [logger.py:42] Received request cmpl-3bd05856e5594583a167becb06c4c939-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:12:39 [async_llm.py:261] Added request cmpl-3bd05856e5594583a167becb06c4c939-0.
INFO 03-02 01:12:40 [logger.py:42] Received request cmpl-428ff696125f4c5ba2e63cbf21012f60-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:12:40 [async_llm.py:261] Added request cmpl-428ff696125f4c5ba2e63cbf21012f60-0.
INFO 03-02 01:12:42 [logger.py:42] Received request cmpl-62fa845cb68540659d8d93b6291e651e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:12:42 [async_llm.py:261] Added request cmpl-62fa845cb68540659d8d93b6291e651e-0.
INFO 03-02 01:12:43 [logger.py:42] Received request cmpl-827fcc071e7b4f618f8b15b72c09f8ed-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:12:43 [async_llm.py:261] Added request cmpl-827fcc071e7b4f618f8b15b72c09f8ed-0.
INFO 03-02 01:12:44 [logger.py:42] Received request cmpl-e4c1526299494bf88b982e4fbeaa6d00-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:12:44 [async_llm.py:261] Added request cmpl-e4c1526299494bf88b982e4fbeaa6d00-0.
INFO 03-02 01:12:45 [logger.py:42] Received request cmpl-0b9eeb0499e64e13a6501c3a77b50f6d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:12:45 [async_llm.py:261] Added request cmpl-0b9eeb0499e64e13a6501c3a77b50f6d-0.
INFO 03-02 01:12:46 [logger.py:42] Received request cmpl-0d3184b3ff25444aa4e789fc30f2c544-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:12:46 [async_llm.py:261] Added request cmpl-0d3184b3ff25444aa4e789fc30f2c544-0.
INFO 03-02 01:12:47 [logger.py:42] Received request cmpl-eb8da2e57a2242cc946d80f54b44dcc4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:12:47 [async_llm.py:261] Added request cmpl-eb8da2e57a2242cc946d80f54b44dcc4-0.
INFO 03-02 01:12:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:12:48 [logger.py:42] Received request cmpl-d97278b10807488ea7a80952122d48fa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:12:48 [async_llm.py:261] Added request cmpl-d97278b10807488ea7a80952122d48fa-0.
INFO 03-02 01:12:50 [logger.py:42] Received request cmpl-bc73dd7fea654bda939fb8441d0ca51d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:12:50 [async_llm.py:261] Added request cmpl-bc73dd7fea654bda939fb8441d0ca51d-0.
INFO 03-02 01:12:51 [logger.py:42] Received request cmpl-2ffa6e06b2c442e5b63ef14fddc1dc5b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:12:51 [async_llm.py:261] Added request cmpl-2ffa6e06b2c442e5b63ef14fddc1dc5b-0.
INFO 03-02 01:12:52 [logger.py:42] Received request cmpl-42984af987284759b1abc7380dc8b588-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:12:52 [async_llm.py:261] Added request cmpl-42984af987284759b1abc7380dc8b588-0.
INFO 03-02 01:12:53 [logger.py:42] Received request cmpl-3d83989511f54186a6e0281c2c849698-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:12:53 [async_llm.py:261] Added request cmpl-3d83989511f54186a6e0281c2c849698-0.
INFO 03-02 01:12:54 [logger.py:42] Received request cmpl-3b09ced269aa40b8ab20d870a39bda80-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:12:54 [async_llm.py:261] Added request cmpl-3b09ced269aa40b8ab20d870a39bda80-0.
INFO 03-02 01:12:55 [logger.py:42] Received request cmpl-ff5b6ab6bdec4fe19ff3e864e4c686d6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:12:55 [async_llm.py:261] Added request cmpl-ff5b6ab6bdec4fe19ff3e864e4c686d6-0.
INFO 03-02 01:12:57 [logger.py:42] Received request cmpl-b7f7781b41c94837b70c536ca37b7419-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:12:57 [async_llm.py:261] Added request cmpl-b7f7781b41c94837b70c536ca37b7419-0.
INFO 03-02 01:12:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:12:58 [logger.py:42] Received request cmpl-5fe7ae93c47443b3b9d978b85530b653-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:12:58 [async_llm.py:261] Added request cmpl-5fe7ae93c47443b3b9d978b85530b653-0.
INFO 03-02 01:12:59 [logger.py:42] Received request cmpl-6e6a5497821042fbb78fdf876f72e944-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:12:59 [async_llm.py:261] Added request cmpl-6e6a5497821042fbb78fdf876f72e944-0.
INFO 03-02 01:13:00 [logger.py:42] Received request cmpl-9f6a82438ac94ebc821c4725cc177631-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:13:00 [async_llm.py:261] Added request cmpl-9f6a82438ac94ebc821c4725cc177631-0.
INFO 03-02 01:13:01 [logger.py:42] Received request cmpl-3ba47946264b4eb2b7b26496d4963c94-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:13:01 [async_llm.py:261] Added request cmpl-3ba47946264b4eb2b7b26496d4963c94-0.
INFO 03-02 01:13:02 [logger.py:42] Received request cmpl-3368ec0d49464cb3a743ecebfd376778-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:13:02 [async_llm.py:261] Added request cmpl-3368ec0d49464cb3a743ecebfd376778-0.
INFO 03-02 01:13:03 [logger.py:42] Received request cmpl-ff5589d5d24c4e018c0f80e60819cda9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:13:03 [async_llm.py:261] Added request cmpl-ff5589d5d24c4e018c0f80e60819cda9-0.
INFO 03-02 01:13:05 [logger.py:42] Received request cmpl-999ea6d2f5cf4b4e84ceef3240832589-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:13:05 [async_llm.py:261] Added request cmpl-999ea6d2f5cf4b4e84ceef3240832589-0.
INFO 03-02 01:13:06 [logger.py:42] Received request cmpl-517857f3f59b400cbbd575a086bd0aff-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:13:06 [async_llm.py:261] Added request cmpl-517857f3f59b400cbbd575a086bd0aff-0.
INFO 03-02 01:13:07 [logger.py:42] Received request cmpl-403f16defd7c41e785d4119a69e13231-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:13:07 [async_llm.py:261] Added request cmpl-403f16defd7c41e785d4119a69e13231-0.
INFO 03-02 01:13:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:13:08 [logger.py:42] Received request cmpl-ea4cf72af6cf4bbdb9d78ac56a1c7fbc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:13:08 [async_llm.py:261] Added request cmpl-ea4cf72af6cf4bbdb9d78ac56a1c7fbc-0.
INFO 03-02 01:13:09 [logger.py:42] Received request cmpl-9fa443afc3524fb5a41c7aa20dc8196d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:13:09 [async_llm.py:261] Added request cmpl-9fa443afc3524fb5a41c7aa20dc8196d-0.
INFO 03-02 01:13:10 [logger.py:42] Received request cmpl-c36fad2647d5442790a5ff5ee425cbdc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:13:10 [async_llm.py:261] Added request cmpl-c36fad2647d5442790a5ff5ee425cbdc-0.
INFO 03-02 01:13:12 [logger.py:42] Received request cmpl-08c6fad4e54341688a86f8b0c2c5e697-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:13:12 [async_llm.py:261] Added request cmpl-08c6fad4e54341688a86f8b0c2c5e697-0.
INFO 03-02 01:13:13 [logger.py:42] Received request cmpl-5702397460ef40b2b8ec6fe75d4a4a05-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:13:13 [async_llm.py:261] Added request cmpl-5702397460ef40b2b8ec6fe75d4a4a05-0.
INFO 03-02 01:13:14 [logger.py:42] Received request cmpl-626af2173f2b43239f4f10747e4d75aa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:13:14 [async_llm.py:261] Added request cmpl-626af2173f2b43239f4f10747e4d75aa-0.
INFO 03-02 01:13:15 [logger.py:42] Received request cmpl-f4a3f53c06ad4f0498455ee3b10a95b2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:13:15 [async_llm.py:261] Added request cmpl-f4a3f53c06ad4f0498455ee3b10a95b2-0.
INFO 03-02 01:13:16 [logger.py:42] Received request cmpl-a3f6b8d81e71488ba446f8d7bcc6dd0a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:13:16 [async_llm.py:261] Added request cmpl-a3f6b8d81e71488ba446f8d7bcc6dd0a-0.
INFO 03-02 01:13:17 [logger.py:42] Received request cmpl-83ed5fe18d8645d7896485d8e47e5ba5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:13:17 [async_llm.py:261] Added request cmpl-83ed5fe18d8645d7896485d8e47e5ba5-0.
INFO 03-02 01:13:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:13:18 [logger.py:42] Received request cmpl-73d2674b517848e0865c1e44b6eb9980-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:13:18 [async_llm.py:261] Added request cmpl-73d2674b517848e0865c1e44b6eb9980-0.
INFO 03-02 01:13:20 [logger.py:42] Received request cmpl-0bc77c9abc1c4636b55f58117de014ce-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:13:20 [async_llm.py:261] Added request cmpl-0bc77c9abc1c4636b55f58117de014ce-0.
INFO 03-02 01:13:21 [logger.py:42] Received request cmpl-ac3492b8163b4331824be3b1d26c8a01-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:13:21 [async_llm.py:261] Added request cmpl-ac3492b8163b4331824be3b1d26c8a01-0.
INFO 03-02 01:13:22 [logger.py:42] Received request cmpl-0d6975ca89a042d8a92eea9f303e1fca-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:13:22 [async_llm.py:261] Added request cmpl-0d6975ca89a042d8a92eea9f303e1fca-0.
INFO 03-02 01:13:23 [logger.py:42] Received request cmpl-da899cc8a1374333b6b7cd05ea1585a0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:13:23 [async_llm.py:261] Added request cmpl-da899cc8a1374333b6b7cd05ea1585a0-0.
INFO 03-02 01:13:24 [logger.py:42] Received request cmpl-08b776661b2e409489229fc89bd6bc53-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:13:24 [async_llm.py:261] Added request cmpl-08b776661b2e409489229fc89bd6bc53-0.
INFO 03-02 01:13:25 [logger.py:42] Received request cmpl-729eeb9c129544b2af9436b212d2c906-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:13:25 [async_llm.py:261] Added request cmpl-729eeb9c129544b2af9436b212d2c906-0.
INFO 03-02 01:13:27 [logger.py:42] Received request cmpl-65eed9388aa34b44a01b9371618a1ea7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:13:27 [async_llm.py:261] Added request cmpl-65eed9388aa34b44a01b9371618a1ea7-0.
INFO 03-02 01:13:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:13:28 [logger.py:42] Received request cmpl-ad9957d4447b418f834784093c866683-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:13:28 [async_llm.py:261] Added request cmpl-ad9957d4447b418f834784093c866683-0.
INFO 03-02 01:13:29 [logger.py:42] Received request cmpl-3b1fed8711514e3cb7bd7be871c9f3ca-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:13:29 [async_llm.py:261] Added request cmpl-3b1fed8711514e3cb7bd7be871c9f3ca-0.
INFO 03-02 01:13:30 [logger.py:42] Received request cmpl-a3e6bb019a58493791be3092e8a6b595-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:13:30 [async_llm.py:261] Added request cmpl-a3e6bb019a58493791be3092e8a6b595-0.
INFO 03-02 01:13:31 [logger.py:42] Received request cmpl-5863d6e612b74001a824f8dd1579586a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:13:31 [async_llm.py:261] Added request cmpl-5863d6e612b74001a824f8dd1579586a-0.
INFO 03-02 01:13:32 [logger.py:42] Received request cmpl-c3cda648e0644e04aa32a0c6b36f8f9f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:13:32 [async_llm.py:261] Added request cmpl-c3cda648e0644e04aa32a0c6b36f8f9f-0.
INFO 03-02 01:13:33 [logger.py:42] Received request cmpl-59212e709d16421a8a9b3eefd86fef2a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:13:33 [async_llm.py:261] Added request cmpl-59212e709d16421a8a9b3eefd86fef2a-0.
INFO 03-02 01:13:35 [logger.py:42] Received request cmpl-ef0e75f3d99f48249a3711b1c2fc397d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:13:35 [async_llm.py:261] Added request cmpl-ef0e75f3d99f48249a3711b1c2fc397d-0.
INFO 03-02 01:13:36 [logger.py:42] Received request cmpl-065ca6f73386468587909eacb39e55c6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:13:36 [async_llm.py:261] Added request cmpl-065ca6f73386468587909eacb39e55c6-0.
INFO 03-02 01:13:37 [logger.py:42] Received request cmpl-ab424049b3894c0090870b1ea6b8e1c9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:13:37 [async_llm.py:261] Added request cmpl-ab424049b3894c0090870b1ea6b8e1c9-0.
INFO 03-02 01:13:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:13:38 [logger.py:42] Received request cmpl-bdf0a51d533c4ec7992b2a01d2627b47-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:13:38 [async_llm.py:261] Added request cmpl-bdf0a51d533c4ec7992b2a01d2627b47-0.
INFO 03-02 01:13:39 [logger.py:42] Received request cmpl-bdc1981aafaf4ea999d303758c7a4853-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:13:39 [async_llm.py:261] Added request cmpl-bdc1981aafaf4ea999d303758c7a4853-0.
INFO 03-02 01:13:40 [logger.py:42] Received request cmpl-f17e6eb2dd3d40d38c76fc755a275840-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:13:40 [async_llm.py:261] Added request cmpl-f17e6eb2dd3d40d38c76fc755a275840-0.
INFO 03-02 01:13:42 [logger.py:42] Received request cmpl-9ef3bfff5f3d49a7a31e9e03bf4c02af-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:13:42 [async_llm.py:261] Added request cmpl-9ef3bfff5f3d49a7a31e9e03bf4c02af-0.
INFO 03-02 01:13:43 [logger.py:42] Received request cmpl-fbd1aeeda98645ae99af8e14ab90510d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:13:43 [async_llm.py:261] Added request cmpl-fbd1aeeda98645ae99af8e14ab90510d-0.
INFO 03-02 01:13:44 [logger.py:42] Received request cmpl-a5683674db5540d6b3f2c2fe268f7026-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:13:44 [async_llm.py:261] Added request cmpl-a5683674db5540d6b3f2c2fe268f7026-0.
INFO 03-02 01:13:45 [logger.py:42] Received request cmpl-6b486c5e8324409fbde5e4bbb8fd1c08-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:13:45 [async_llm.py:261] Added request cmpl-6b486c5e8324409fbde5e4bbb8fd1c08-0.
INFO 03-02 01:13:46 [logger.py:42] Received request cmpl-d3a5147025dd4a15a7c2d276bee39466-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:13:46 [async_llm.py:261] Added request cmpl-d3a5147025dd4a15a7c2d276bee39466-0.
INFO 03-02 01:13:47 [logger.py:42] Received request cmpl-0d1bbf8cc1c24ea690fa4d373d8d444c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:13:47 [async_llm.py:261] Added request cmpl-0d1bbf8cc1c24ea690fa4d373d8d444c-0.
INFO 03-02 01:13:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:13:48 [logger.py:42] Received request cmpl-6341ec88070c442695f323fd1af39eed-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:13:48 [async_llm.py:261] Added request cmpl-6341ec88070c442695f323fd1af39eed-0.
INFO 03-02 01:13:50 [logger.py:42] Received request cmpl-4bbadb7e79ed47e0b4ffdfb73c783a65-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:13:50 [async_llm.py:261] Added request cmpl-4bbadb7e79ed47e0b4ffdfb73c783a65-0.
INFO 03-02 01:13:51 [logger.py:42] Received request cmpl-bfdbdd27379e4ce2b582282999a7107e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:13:51 [async_llm.py:261] Added request cmpl-bfdbdd27379e4ce2b582282999a7107e-0.
INFO 03-02 01:13:52 [logger.py:42] Received request cmpl-e0c81f30e14040a98b50a08e91ee4047-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:13:52 [async_llm.py:261] Added request cmpl-e0c81f30e14040a98b50a08e91ee4047-0.
INFO 03-02 01:13:53 [logger.py:42] Received request cmpl-b5bdb9ff52b841eea1324b030db28785-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:13:53 [async_llm.py:261] Added request cmpl-b5bdb9ff52b841eea1324b030db28785-0.
INFO 03-02 01:13:54 [logger.py:42] Received request cmpl-107d96807a5c455a8635d8718c0fbae5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:13:54 [async_llm.py:261] Added request cmpl-107d96807a5c455a8635d8718c0fbae5-0.
INFO 03-02 01:13:55 [logger.py:42] Received request cmpl-362008791cf247aba8739f01b2fd4cf7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:13:55 [async_llm.py:261] Added request cmpl-362008791cf247aba8739f01b2fd4cf7-0.
INFO 03-02 01:13:57 [logger.py:42] Received request cmpl-de10ee9daeb043eb96d9c7c34af0a812-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:13:57 [async_llm.py:261] Added request cmpl-de10ee9daeb043eb96d9c7c34af0a812-0.
INFO 03-02 01:13:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:13:58 [logger.py:42] Received request cmpl-abc48c7bc5cf4cbf95a3e7744172cf27-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:13:58 [async_llm.py:261] Added request cmpl-abc48c7bc5cf4cbf95a3e7744172cf27-0.
INFO 03-02 01:13:59 [logger.py:42] Received request cmpl-eb89faa701df432697fa88fe71f3487f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:13:59 [async_llm.py:261] Added request cmpl-eb89faa701df432697fa88fe71f3487f-0.
INFO 03-02 01:14:00 [logger.py:42] Received request cmpl-32244e93b5bc45819c09d3fc933981c8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:14:00 [async_llm.py:261] Added request cmpl-32244e93b5bc45819c09d3fc933981c8-0.
INFO 03-02 01:14:01 [logger.py:42] Received request cmpl-fbe4b6ba087448beb1467f0806601a3b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:14:01 [async_llm.py:261] Added request cmpl-fbe4b6ba087448beb1467f0806601a3b-0.
INFO 03-02 01:14:02 [logger.py:42] Received request cmpl-295f52969ada46949ec436a3405e03d5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:14:02 [async_llm.py:261] Added request cmpl-295f52969ada46949ec436a3405e03d5-0.
INFO 03-02 01:14:03 [logger.py:42] Received request cmpl-d06f4b5a607141849a5542450161ff76-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:14:03 [async_llm.py:261] Added request cmpl-d06f4b5a607141849a5542450161ff76-0.
INFO 03-02 01:14:05 [logger.py:42] Received request cmpl-463bff19f652435eb5dfdc2b3a6694f8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:14:05 [async_llm.py:261] Added request cmpl-463bff19f652435eb5dfdc2b3a6694f8-0.
INFO 03-02 01:14:06 [logger.py:42] Received request cmpl-473de280669d49678d153395f8a9cf16-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:14:06 [async_llm.py:261] Added request cmpl-473de280669d49678d153395f8a9cf16-0.
INFO 03-02 01:14:07 [logger.py:42] Received request cmpl-f29711c3f3544cf698cfeff19c3ae844-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:14:07 [async_llm.py:261] Added request cmpl-f29711c3f3544cf698cfeff19c3ae844-0.
INFO 03-02 01:14:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:14:08 [logger.py:42] Received request cmpl-f2cdb3056a0d49b2aec2276fd18dd990-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:14:08 [async_llm.py:261] Added request cmpl-f2cdb3056a0d49b2aec2276fd18dd990-0.
INFO 03-02 01:14:09 [logger.py:42] Received request cmpl-d549e9f66d684e7c86fe385e6147e3a6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:14:09 [async_llm.py:261] Added request cmpl-d549e9f66d684e7c86fe385e6147e3a6-0.
INFO 03-02 01:14:10 [logger.py:42] Received request cmpl-e48e4321ca6d4bda9a4353db7f98be65-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:14:10 [async_llm.py:261] Added request cmpl-e48e4321ca6d4bda9a4353db7f98be65-0.
INFO 03-02 01:14:12 [logger.py:42] Received request cmpl-b56cb50f92cc4fc9b66086c9a3959e82-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:14:12 [async_llm.py:261] Added request cmpl-b56cb50f92cc4fc9b66086c9a3959e82-0.
INFO 03-02 01:14:13 [logger.py:42] Received request cmpl-7a68eb9f11d5455abb436a3724825382-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:14:13 [async_llm.py:261] Added request cmpl-7a68eb9f11d5455abb436a3724825382-0.
INFO 03-02 01:14:14 [logger.py:42] Received request cmpl-3a65d0ecbce74a76b69ceae47462c935-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:14:14 [async_llm.py:261] Added request cmpl-3a65d0ecbce74a76b69ceae47462c935-0.
INFO 03-02 01:14:15 [logger.py:42] Received request cmpl-4c539d424d114fa183141bc84e5530c6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:14:15 [async_llm.py:261] Added request cmpl-4c539d424d114fa183141bc84e5530c6-0.
INFO 03-02 01:14:16 [logger.py:42] Received request cmpl-27c0b9a29f6f490f9c256d5e4d3f57c8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:14:16 [async_llm.py:261] Added request cmpl-27c0b9a29f6f490f9c256d5e4d3f57c8-0.
INFO 03-02 01:14:17 [logger.py:42] Received request cmpl-dabca1d2801d47a6ba1c475216dd17ed-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:14:17 [async_llm.py:261] Added request cmpl-dabca1d2801d47a6ba1c475216dd17ed-0.
INFO 03-02 01:14:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:14:18 [logger.py:42] Received request cmpl-b679d12c72494704a86bc46b60e0e15a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:14:18 [async_llm.py:261] Added request cmpl-b679d12c72494704a86bc46b60e0e15a-0.
INFO 03-02 01:14:20 [logger.py:42] Received request cmpl-8cc8db0166474fa4ade3b45e876abea0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:14:20 [async_llm.py:261] Added request cmpl-8cc8db0166474fa4ade3b45e876abea0-0.
INFO 03-02 01:14:21 [logger.py:42] Received request cmpl-47b6730a312046b5a790c1ed82121519-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:14:21 [async_llm.py:261] Added request cmpl-47b6730a312046b5a790c1ed82121519-0.
INFO 03-02 01:14:22 [logger.py:42] Received request cmpl-1b5328ab49bb4be798e5ff2533c50883-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:14:22 [async_llm.py:261] Added request cmpl-1b5328ab49bb4be798e5ff2533c50883-0.
INFO 03-02 01:14:23 [logger.py:42] Received request cmpl-fd17a2af76e44fb4837997ec76e76bfd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:14:23 [async_llm.py:261] Added request cmpl-fd17a2af76e44fb4837997ec76e76bfd-0.
INFO 03-02 01:14:24 [logger.py:42] Received request cmpl-c27ad623938f414d8f6bfe099c935bf0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:14:24 [async_llm.py:261] Added request cmpl-c27ad623938f414d8f6bfe099c935bf0-0.
INFO 03-02 01:14:25 [logger.py:42] Received request cmpl-cb88f02ac30b486da19769994f94d3ce-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:14:25 [async_llm.py:261] Added request cmpl-cb88f02ac30b486da19769994f94d3ce-0.
INFO 03-02 01:14:27 [logger.py:42] Received request cmpl-8df5f7668ecd452080460d52aa1939a6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:14:27 [async_llm.py:261] Added request cmpl-8df5f7668ecd452080460d52aa1939a6-0.
INFO 03-02 01:14:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:14:28 [logger.py:42] Received request cmpl-757c16452a234175be92a20d236382ac-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:14:28 [async_llm.py:261] Added request cmpl-757c16452a234175be92a20d236382ac-0.
INFO 03-02 01:14:29 [logger.py:42] Received request cmpl-1067215b2b71496baa950e7f55521d02-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:14:29 [async_llm.py:261] Added request cmpl-1067215b2b71496baa950e7f55521d02-0.
INFO 03-02 01:14:30 [logger.py:42] Received request cmpl-cb876d5c3ec746bda0f276bdfb0809de-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:14:30 [async_llm.py:261] Added request cmpl-cb876d5c3ec746bda0f276bdfb0809de-0.
INFO 03-02 01:14:31 [logger.py:42] Received request cmpl-04515532044a4ef3bce510557379e59d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:14:31 [async_llm.py:261] Added request cmpl-04515532044a4ef3bce510557379e59d-0.
INFO 03-02 01:14:32 [logger.py:42] Received request cmpl-e5a536c4e0064c40b308a79b2e9a345c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:14:32 [async_llm.py:261] Added request cmpl-e5a536c4e0064c40b308a79b2e9a345c-0.
INFO 03-02 01:14:33 [logger.py:42] Received request cmpl-454a8bb1d3614175affd60e585254e04-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:14:33 [async_llm.py:261] Added request cmpl-454a8bb1d3614175affd60e585254e04-0.
INFO 03-02 01:14:35 [logger.py:42] Received request cmpl-abe6e9cc4741405bba2827145e55f2a0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:14:35 [async_llm.py:261] Added request cmpl-abe6e9cc4741405bba2827145e55f2a0-0.
INFO 03-02 01:14:36 [logger.py:42] Received request cmpl-930c1d63d1514261921a702c9df42bbf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:14:36 [async_llm.py:261] Added request cmpl-930c1d63d1514261921a702c9df42bbf-0.
INFO 03-02 01:14:37 [logger.py:42] Received request cmpl-e3aa98e8602b4afa89f22e6962da028c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:14:37 [async_llm.py:261] Added request cmpl-e3aa98e8602b4afa89f22e6962da028c-0.
INFO 03-02 01:14:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:14:38 [logger.py:42] Received request cmpl-9fe502e49f02426cb9c40c302e9f15ea-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:14:38 [async_llm.py:261] Added request cmpl-9fe502e49f02426cb9c40c302e9f15ea-0.
INFO 03-02 01:14:39 [logger.py:42] Received request cmpl-2686ffea3a3244a5ae30122218d8d026-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:14:39 [async_llm.py:261] Added request cmpl-2686ffea3a3244a5ae30122218d8d026-0.
INFO 03-02 01:14:40 [logger.py:42] Received request cmpl-ea9ee4f97b9940abb5734c8e1413c5b1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:14:40 [async_llm.py:261] Added request cmpl-ea9ee4f97b9940abb5734c8e1413c5b1-0.
INFO 03-02 01:14:42 [logger.py:42] Received request cmpl-bed71187d1534dfbb1b9dec161611dcc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:14:42 [async_llm.py:261] Added request cmpl-bed71187d1534dfbb1b9dec161611dcc-0.
INFO 03-02 01:14:43 [logger.py:42] Received request cmpl-d1d1825f765f4224bceecc92631a1a04-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:14:43 [async_llm.py:261] Added request cmpl-d1d1825f765f4224bceecc92631a1a04-0.
INFO 03-02 01:14:44 [logger.py:42] Received request cmpl-1aa4237d0efd4621924e44462f63ee76-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:14:44 [async_llm.py:261] Added request cmpl-1aa4237d0efd4621924e44462f63ee76-0.
INFO 03-02 01:14:45 [logger.py:42] Received request cmpl-e48217c2f64e4b2c8669494bfd3607b8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:14:45 [async_llm.py:261] Added request cmpl-e48217c2f64e4b2c8669494bfd3607b8-0.
INFO 03-02 01:14:46 [logger.py:42] Received request cmpl-994d4cf4d40345aba0bfd065644482a6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:14:46 [async_llm.py:261] Added request cmpl-994d4cf4d40345aba0bfd065644482a6-0.
INFO 03-02 01:14:47 [logger.py:42] Received request cmpl-275bdd9e2c524264ba5342f5473b4192-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:14:47 [async_llm.py:261] Added request cmpl-275bdd9e2c524264ba5342f5473b4192-0.
INFO 03-02 01:14:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:14:49 [logger.py:42] Received request cmpl-cbe255d6938446a9b506ef2d2627afc8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:14:49 [async_llm.py:261] Added request cmpl-cbe255d6938446a9b506ef2d2627afc8-0.
INFO 03-02 01:14:50 [logger.py:42] Received request cmpl-d5b833d64eb5458ba1f8d24ba3212054-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:14:50 [async_llm.py:261] Added request cmpl-d5b833d64eb5458ba1f8d24ba3212054-0.
INFO 03-02 01:14:51 [logger.py:42] Received request cmpl-db95c9d9037442feada60782a5844960-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:14:51 [async_llm.py:261] Added request cmpl-db95c9d9037442feada60782a5844960-0.
INFO 03-02 01:14:52 [logger.py:42] Received request cmpl-4caa4c8df08f48adb625a086f0688908-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:14:52 [async_llm.py:261] Added request cmpl-4caa4c8df08f48adb625a086f0688908-0.
INFO 03-02 01:14:53 [logger.py:42] Received request cmpl-9928251f4af54abea9c310faf2b2901f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:14:53 [async_llm.py:261] Added request cmpl-9928251f4af54abea9c310faf2b2901f-0.
INFO 03-02 01:14:54 [logger.py:42] Received request cmpl-51620fdf30b0406c9185de090d1f3273-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:14:54 [async_llm.py:261] Added request cmpl-51620fdf30b0406c9185de090d1f3273-0.
INFO 03-02 01:14:55 [logger.py:42] Received request cmpl-3abd16f2dae24b4eaf962abd0c811831-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:14:55 [async_llm.py:261] Added request cmpl-3abd16f2dae24b4eaf962abd0c811831-0.
INFO 03-02 01:14:57 [logger.py:42] Received request cmpl-a99c3883748d4c11adf73c1b22e06fd2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:14:57 [async_llm.py:261] Added request cmpl-a99c3883748d4c11adf73c1b22e06fd2-0.
INFO 03-02 01:14:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:14:58 [logger.py:42] Received request cmpl-b14dc934a5a3464bb0062df7f66bfae2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:14:58 [async_llm.py:261] Added request cmpl-b14dc934a5a3464bb0062df7f66bfae2-0.
INFO 03-02 01:14:59 [logger.py:42] Received request cmpl-27e708c6039b4f8d84c4d12d834bc818-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:14:59 [async_llm.py:261] Added request cmpl-27e708c6039b4f8d84c4d12d834bc818-0.
INFO 03-02 01:15:00 [logger.py:42] Received request cmpl-b6024ba64f674d3f831b013eab3ce91f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:15:00 [async_llm.py:261] Added request cmpl-b6024ba64f674d3f831b013eab3ce91f-0.
INFO 03-02 01:15:01 [logger.py:42] Received request cmpl-c9ca1adfe8f0402dacaf007c275737ed-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:15:01 [async_llm.py:261] Added request cmpl-c9ca1adfe8f0402dacaf007c275737ed-0.
INFO 03-02 01:15:02 [logger.py:42] Received request cmpl-1918260bec4140fa950991040adf9e0c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:15:02 [async_llm.py:261] Added request cmpl-1918260bec4140fa950991040adf9e0c-0.
INFO 03-02 01:15:04 [logger.py:42] Received request cmpl-d35e081709354f51bd4efe9b0daa2332-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:15:04 [async_llm.py:261] Added request cmpl-d35e081709354f51bd4efe9b0daa2332-0.
INFO 03-02 01:15:05 [logger.py:42] Received request cmpl-ac33a107ec6e46f4bbf85770af87e615-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:15:05 [async_llm.py:261] Added request cmpl-ac33a107ec6e46f4bbf85770af87e615-0.
INFO 03-02 01:15:06 [logger.py:42] Received request cmpl-b99b422a3a3e482f85d8255769a6f923-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:15:06 [async_llm.py:261] Added request cmpl-b99b422a3a3e482f85d8255769a6f923-0.
INFO 03-02 01:15:07 [logger.py:42] Received request cmpl-9a9bd2aab6114c698b645db45cc8f613-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:15:07 [async_llm.py:261] Added request cmpl-9a9bd2aab6114c698b645db45cc8f613-0.
INFO 03-02 01:15:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:15:08 [logger.py:42] Received request cmpl-a0e267c5a8874f8fa5848b98be65ef3d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:15:08 [async_llm.py:261] Added request cmpl-a0e267c5a8874f8fa5848b98be65ef3d-0.
INFO 03-02 01:15:09 [logger.py:42] Received request cmpl-ddc987da2b1045049e11477c5b4360a9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:15:09 [async_llm.py:261] Added request cmpl-ddc987da2b1045049e11477c5b4360a9-0.
INFO 03-02 01:15:10 [logger.py:42] Received request cmpl-5f96b8f6211c45bcbc213593f8b76836-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:15:10 [async_llm.py:261] Added request cmpl-5f96b8f6211c45bcbc213593f8b76836-0.
INFO 03-02 01:15:12 [logger.py:42] Received request cmpl-5d2b3b0d6dd649179298443073740147-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:15:12 [async_llm.py:261] Added request cmpl-5d2b3b0d6dd649179298443073740147-0.
INFO 03-02 01:15:13 [logger.py:42] Received request cmpl-8c3c5a09766248a2b55c937b2e792062-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:15:13 [async_llm.py:261] Added request cmpl-8c3c5a09766248a2b55c937b2e792062-0.
INFO 03-02 01:15:14 [logger.py:42] Received request cmpl-f49affaf0e7f43fabb634bc0268e8a92-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:15:14 [async_llm.py:261] Added request cmpl-f49affaf0e7f43fabb634bc0268e8a92-0.
INFO 03-02 01:15:15 [logger.py:42] Received request cmpl-8aed063be55a4e03bc7318716fa47c09-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:15:15 [async_llm.py:261] Added request cmpl-8aed063be55a4e03bc7318716fa47c09-0.
INFO 03-02 01:15:16 [logger.py:42] Received request cmpl-580c4e8e8e434b759328b20172cdd626-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:15:16 [async_llm.py:261] Added request cmpl-580c4e8e8e434b759328b20172cdd626-0.
INFO 03-02 01:15:17 [logger.py:42] Received request cmpl-56228fb09bd840469f349771080da522-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:15:17 [async_llm.py:261] Added request cmpl-56228fb09bd840469f349771080da522-0.
INFO 03-02 01:15:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:15:19 [logger.py:42] Received request cmpl-a76b1b24cfa64910b894807582aaae39-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:15:19 [async_llm.py:261] Added request cmpl-a76b1b24cfa64910b894807582aaae39-0.
INFO 03-02 01:15:20 [logger.py:42] Received request cmpl-9c00592e403d42aea24b2532ff096d45-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:15:20 [async_llm.py:261] Added request cmpl-9c00592e403d42aea24b2532ff096d45-0.
INFO 03-02 01:15:21 [logger.py:42] Received request cmpl-beda5a75dd3b474aa5f7ccc3c503bf07-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:15:21 [async_llm.py:261] Added request cmpl-beda5a75dd3b474aa5f7ccc3c503bf07-0.
INFO 03-02 01:15:22 [logger.py:42] Received request cmpl-28799500f39c4c98b00f3020b32640c2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:15:22 [async_llm.py:261] Added request cmpl-28799500f39c4c98b00f3020b32640c2-0.
INFO 03-02 01:15:23 [logger.py:42] Received request cmpl-cf3d35cc11014a81b9eff4e65f23aa7e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:15:23 [async_llm.py:261] Added request cmpl-cf3d35cc11014a81b9eff4e65f23aa7e-0.
INFO 03-02 01:15:24 [logger.py:42] Received request cmpl-cc95f89906194289813cb314df897b3c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:15:24 [async_llm.py:261] Added request cmpl-cc95f89906194289813cb314df897b3c-0.
INFO 03-02 01:15:25 [logger.py:42] Received request cmpl-9c57c21569a94beeabddd353185298c6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:15:25 [async_llm.py:261] Added request cmpl-9c57c21569a94beeabddd353185298c6-0.
INFO 03-02 01:15:27 [logger.py:42] Received request cmpl-9c5a951ad38a4f9a8f5bb4a372dae4d7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:15:27 [async_llm.py:261] Added request cmpl-9c5a951ad38a4f9a8f5bb4a372dae4d7-0.
INFO 03-02 01:15:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:15:28 [logger.py:42] Received request cmpl-2e36cd4460944a00b4bd9c1114817c45-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:15:28 [async_llm.py:261] Added request cmpl-2e36cd4460944a00b4bd9c1114817c45-0.
INFO 03-02 01:15:29 [logger.py:42] Received request cmpl-a658c5dd9e0b4cbc9ae6f5d3815a5754-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:15:29 [async_llm.py:261] Added request cmpl-a658c5dd9e0b4cbc9ae6f5d3815a5754-0.
INFO 03-02 01:15:30 [logger.py:42] Received request cmpl-abc386a6953d412181e5132d45a7d848-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:15:30 [async_llm.py:261] Added request cmpl-abc386a6953d412181e5132d45a7d848-0.
INFO 03-02 01:15:31 [logger.py:42] Received request cmpl-e2fe125f789640febbb8a14b3af42ed0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:15:31 [async_llm.py:261] Added request cmpl-e2fe125f789640febbb8a14b3af42ed0-0.
INFO 03-02 01:15:32 [logger.py:42] Received request cmpl-1b35dfe9f4c34bda8d744348bc3c6451-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:15:32 [async_llm.py:261] Added request cmpl-1b35dfe9f4c34bda8d744348bc3c6451-0.
INFO 03-02 01:15:34 [logger.py:42] Received request cmpl-87eefe48317b439c9964ccf459f89175-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:15:34 [async_llm.py:261] Added request cmpl-87eefe48317b439c9964ccf459f89175-0.
INFO 03-02 01:15:35 [logger.py:42] Received request cmpl-a352a2b23c05480ea3c2abb092ed2966-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:15:35 [async_llm.py:261] Added request cmpl-a352a2b23c05480ea3c2abb092ed2966-0.
INFO 03-02 01:15:36 [logger.py:42] Received request cmpl-08b75eabd67f4f90b53853e7bea74edd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:15:36 [async_llm.py:261] Added request cmpl-08b75eabd67f4f90b53853e7bea74edd-0.
INFO 03-02 01:15:37 [logger.py:42] Received request cmpl-c64c5cc13f1b4b5391fdee3891956c01-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:15:37 [async_llm.py:261] Added request cmpl-c64c5cc13f1b4b5391fdee3891956c01-0.
INFO 03-02 01:15:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:15:38 [logger.py:42] Received request cmpl-d1b18c0934194c09ab44a4253ae040e9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:15:38 [async_llm.py:261] Added request cmpl-d1b18c0934194c09ab44a4253ae040e9-0.
INFO 03-02 01:15:39 [logger.py:42] Received request cmpl-985c12d01faf4bd48d347aa196915e78-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:15:39 [async_llm.py:261] Added request cmpl-985c12d01faf4bd48d347aa196915e78-0.
INFO 03-02 01:15:40 [logger.py:42] Received request cmpl-24f7acc7dd214a95aea91c62ce54d1b7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:15:40 [async_llm.py:261] Added request cmpl-24f7acc7dd214a95aea91c62ce54d1b7-0.
INFO 03-02 01:15:42 [logger.py:42] Received request cmpl-f62c1f0d929f4ae6817c05f275661aee-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:15:42 [async_llm.py:261] Added request cmpl-f62c1f0d929f4ae6817c05f275661aee-0.
INFO 03-02 01:15:43 [logger.py:42] Received request cmpl-f45ca62dc0ce492b959bace8c6074c18-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:15:43 [async_llm.py:261] Added request cmpl-f45ca62dc0ce492b959bace8c6074c18-0.
INFO 03-02 01:15:44 [logger.py:42] Received request cmpl-4749db175abf4506999879e3a0915c06-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:15:44 [async_llm.py:261] Added request cmpl-4749db175abf4506999879e3a0915c06-0.
INFO 03-02 01:15:45 [logger.py:42] Received request cmpl-4c81ae2d627f4cecac6c0f2dbc3c5248-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:15:45 [async_llm.py:261] Added request cmpl-4c81ae2d627f4cecac6c0f2dbc3c5248-0.
INFO 03-02 01:15:46 [logger.py:42] Received request cmpl-bd06d0c61f3b49658fd52099b3f73fe5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:15:46 [async_llm.py:261] Added request cmpl-bd06d0c61f3b49658fd52099b3f73fe5-0.
INFO 03-02 01:15:47 [logger.py:42] Received request cmpl-c7905b6da5f7408c9537c5897bf55043-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:15:47 [async_llm.py:261] Added request cmpl-c7905b6da5f7408c9537c5897bf55043-0.
INFO 03-02 01:15:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:15:49 [logger.py:42] Received request cmpl-a16e1eacf84347f8bb2f9d2cdafa64d5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:15:49 [async_llm.py:261] Added request cmpl-a16e1eacf84347f8bb2f9d2cdafa64d5-0.
INFO 03-02 01:15:50 [logger.py:42] Received request cmpl-fb906f9ea5fc4541878a7bdd2248b0d9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:15:50 [async_llm.py:261] Added request cmpl-fb906f9ea5fc4541878a7bdd2248b0d9-0.
INFO 03-02 01:15:51 [logger.py:42] Received request cmpl-f3d36e62aa584771ac2523f0580bb649-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:15:51 [async_llm.py:261] Added request cmpl-f3d36e62aa584771ac2523f0580bb649-0.
INFO 03-02 01:15:52 [logger.py:42] Received request cmpl-6160932117a24a4d8e09f3bac11c5685-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:15:52 [async_llm.py:261] Added request cmpl-6160932117a24a4d8e09f3bac11c5685-0.
INFO 03-02 01:15:53 [logger.py:42] Received request cmpl-c83a737f98ee4518ae3d590a62f86c6d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:15:53 [async_llm.py:261] Added request cmpl-c83a737f98ee4518ae3d590a62f86c6d-0.
INFO 03-02 01:15:54 [logger.py:42] Received request cmpl-afc138ee3d6d45358309b5ee43c833d9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:15:54 [async_llm.py:261] Added request cmpl-afc138ee3d6d45358309b5ee43c833d9-0.
INFO 03-02 01:15:55 [logger.py:42] Received request cmpl-ab2abc82f02d4f00b504f4fe7f514ab8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:15:55 [async_llm.py:261] Added request cmpl-ab2abc82f02d4f00b504f4fe7f514ab8-0.
INFO 03-02 01:15:57 [logger.py:42] Received request cmpl-20550a9262d34ad48cb22db62b9ff381-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:15:57 [async_llm.py:261] Added request cmpl-20550a9262d34ad48cb22db62b9ff381-0.
INFO 03-02 01:15:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:15:58 [logger.py:42] Received request cmpl-f305bf68425c444f9a4005690265d081-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:15:58 [async_llm.py:261] Added request cmpl-f305bf68425c444f9a4005690265d081-0.
INFO 03-02 01:15:59 [logger.py:42] Received request cmpl-103985a7b55547f093acbb4b1378edd5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:15:59 [async_llm.py:261] Added request cmpl-103985a7b55547f093acbb4b1378edd5-0.
INFO 03-02 01:16:00 [logger.py:42] Received request cmpl-5fc08d79d1d24e7b9e86b1fb9c2bd3e4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:16:00 [async_llm.py:261] Added request cmpl-5fc08d79d1d24e7b9e86b1fb9c2bd3e4-0.
INFO 03-02 01:16:01 [logger.py:42] Received request cmpl-7d0dc9d4ba6d4bf5b186d9f40d133492-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:16:01 [async_llm.py:261] Added request cmpl-7d0dc9d4ba6d4bf5b186d9f40d133492-0.
INFO 03-02 01:16:02 [logger.py:42] Received request cmpl-3dfb767540024ca69a2baf41f4d9654c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:16:02 [async_llm.py:261] Added request cmpl-3dfb767540024ca69a2baf41f4d9654c-0.
INFO 03-02 01:16:04 [logger.py:42] Received request cmpl-a4a58686b86c4e9b8ba7a89ae06aeea4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:16:04 [async_llm.py:261] Added request cmpl-a4a58686b86c4e9b8ba7a89ae06aeea4-0.
INFO 03-02 01:16:05 [logger.py:42] Received request cmpl-a054e73767234ba2a2d7b392193ff394-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:16:05 [async_llm.py:261] Added request cmpl-a054e73767234ba2a2d7b392193ff394-0.
INFO 03-02 01:16:06 [logger.py:42] Received request cmpl-999658f75ca54b04891447ecc2de620a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:16:06 [async_llm.py:261] Added request cmpl-999658f75ca54b04891447ecc2de620a-0.
INFO 03-02 01:16:07 [logger.py:42] Received request cmpl-846883e43fef4d5abd643a6b182793d4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:16:07 [async_llm.py:261] Added request cmpl-846883e43fef4d5abd643a6b182793d4-0.
INFO 03-02 01:16:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:16:08 [logger.py:42] Received request cmpl-3e6879dd943245cebbdf33834110ccb4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:16:08 [async_llm.py:261] Added request cmpl-3e6879dd943245cebbdf33834110ccb4-0.
INFO 03-02 01:16:09 [logger.py:42] Received request cmpl-9148ccc052f34417b38ec37bc659c70d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:16:09 [async_llm.py:261] Added request cmpl-9148ccc052f34417b38ec37bc659c70d-0.
INFO 03-02 01:16:11 [logger.py:42] Received request cmpl-5a16eaeb4b504a8f96fb574c914a63ce-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:16:11 [async_llm.py:261] Added request cmpl-5a16eaeb4b504a8f96fb574c914a63ce-0.
INFO 03-02 01:16:12 [logger.py:42] Received request cmpl-0e55b30650c4442b80c7ef26982e8ba8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:16:12 [async_llm.py:261] Added request cmpl-0e55b30650c4442b80c7ef26982e8ba8-0.
INFO 03-02 01:16:13 [logger.py:42] Received request cmpl-f4375611b66d40cd8ee923b3515d05cb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:16:13 [async_llm.py:261] Added request cmpl-f4375611b66d40cd8ee923b3515d05cb-0.
INFO 03-02 01:16:14 [logger.py:42] Received request cmpl-c53f14896276497bb9dbc74fede03121-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:16:14 [async_llm.py:261] Added request cmpl-c53f14896276497bb9dbc74fede03121-0.
INFO 03-02 01:16:15 [logger.py:42] Received request cmpl-56e51de9febd4a4d9df8205254d2d776-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:16:15 [async_llm.py:261] Added request cmpl-56e51de9febd4a4d9df8205254d2d776-0.
INFO 03-02 01:16:16 [logger.py:42] Received request cmpl-8d89ad69194b416fae70befdfaae17bf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:16:16 [async_llm.py:261] Added request cmpl-8d89ad69194b416fae70befdfaae17bf-0.
INFO 03-02 01:16:17 [logger.py:42] Received request cmpl-8fbc4dd023314ecd96b5dc12d1e68e80-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:16:17 [async_llm.py:261] Added request cmpl-8fbc4dd023314ecd96b5dc12d1e68e80-0.
INFO 03-02 01:16:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:16:19 [logger.py:42] Received request cmpl-9e2db70ffc9c4f2d8505a681d0b69868-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:16:19 [async_llm.py:261] Added request cmpl-9e2db70ffc9c4f2d8505a681d0b69868-0.
INFO 03-02 01:16:20 [logger.py:42] Received request cmpl-ecc95964f2394bb2b454ea775dd2d52c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:16:20 [async_llm.py:261] Added request cmpl-ecc95964f2394bb2b454ea775dd2d52c-0.
INFO 03-02 01:16:21 [logger.py:42] Received request cmpl-c72409a456dd483ea3764d7c00b6f18f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:16:21 [async_llm.py:261] Added request cmpl-c72409a456dd483ea3764d7c00b6f18f-0.
INFO 03-02 01:16:22 [logger.py:42] Received request cmpl-e85ef8bb823f4c019dc5a06bb07ffc21-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:16:22 [async_llm.py:261] Added request cmpl-e85ef8bb823f4c019dc5a06bb07ffc21-0.
INFO 03-02 01:16:23 [logger.py:42] Received request cmpl-ff09e67e439f4d4bab4fb7398c2cf562-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:16:23 [async_llm.py:261] Added request cmpl-ff09e67e439f4d4bab4fb7398c2cf562-0.
INFO 03-02 01:16:24 [logger.py:42] Received request cmpl-9c04a297e30e40f59592d14ab0cd0636-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:16:24 [async_llm.py:261] Added request cmpl-9c04a297e30e40f59592d14ab0cd0636-0.
INFO 03-02 01:16:26 [logger.py:42] Received request cmpl-0b0f35b57de94bb79100c04a86f0df15-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:16:26 [async_llm.py:261] Added request cmpl-0b0f35b57de94bb79100c04a86f0df15-0.
INFO 03-02 01:16:27 [logger.py:42] Received request cmpl-a117a5a91ed84595b6e386c8d923e665-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:16:27 [async_llm.py:261] Added request cmpl-a117a5a91ed84595b6e386c8d923e665-0.
INFO 03-02 01:16:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:16:28 [logger.py:42] Received request cmpl-c5eed788517c4e1ebfef251d940b9c08-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:16:28 [async_llm.py:261] Added request cmpl-c5eed788517c4e1ebfef251d940b9c08-0.
INFO 03-02 01:16:29 [logger.py:42] Received request cmpl-a8973a0868cd43419fb6dbd136c03228-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:16:29 [async_llm.py:261] Added request cmpl-a8973a0868cd43419fb6dbd136c03228-0.
INFO 03-02 01:16:30 [logger.py:42] Received request cmpl-332ab1426c3c43f4adeceffb4169d357-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:16:30 [async_llm.py:261] Added request cmpl-332ab1426c3c43f4adeceffb4169d357-0.
INFO 03-02 01:16:31 [logger.py:42] Received request cmpl-b02c0bbe79524010877df3f7a2fbe30f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:16:31 [async_llm.py:261] Added request cmpl-b02c0bbe79524010877df3f7a2fbe30f-0.
INFO 03-02 01:16:32 [logger.py:42] Received request cmpl-d0cd4964989f47b39dcce83864764902-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:16:32 [async_llm.py:261] Added request cmpl-d0cd4964989f47b39dcce83864764902-0.
INFO 03-02 01:16:34 [logger.py:42] Received request cmpl-4e89819cb7fe45c692a0d02bb9633d4e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:16:34 [async_llm.py:261] Added request cmpl-4e89819cb7fe45c692a0d02bb9633d4e-0.
INFO 03-02 01:16:35 [logger.py:42] Received request cmpl-066375dda42d48a6a64a4c46b7a4f434-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:16:35 [async_llm.py:261] Added request cmpl-066375dda42d48a6a64a4c46b7a4f434-0.
INFO 03-02 01:16:36 [logger.py:42] Received request cmpl-643e6a1f2b3643f78213b6e82bbd4db3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:16:36 [async_llm.py:261] Added request cmpl-643e6a1f2b3643f78213b6e82bbd4db3-0.
INFO 03-02 01:16:37 [logger.py:42] Received request cmpl-c4890dbe9958487d8bc6a1bd5fb45dce-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:16:37 [async_llm.py:261] Added request cmpl-c4890dbe9958487d8bc6a1bd5fb45dce-0.
INFO 03-02 01:16:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:16:38 [logger.py:42] Received request cmpl-42c4841d729e477e852f10a3315a28b9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:16:38 [async_llm.py:261] Added request cmpl-42c4841d729e477e852f10a3315a28b9-0.
INFO 03-02 01:16:39 [logger.py:42] Received request cmpl-98b0db0b022745dbb44f94925220fc40-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:16:39 [async_llm.py:261] Added request cmpl-98b0db0b022745dbb44f94925220fc40-0.
INFO 03-02 01:16:41 [logger.py:42] Received request cmpl-f521513ad12c4768ba25ec3bfae12afe-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:16:41 [async_llm.py:261] Added request cmpl-f521513ad12c4768ba25ec3bfae12afe-0.
INFO 03-02 01:16:42 [logger.py:42] Received request cmpl-497dfe0797ea485aa0383c5d03a53788-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:16:42 [async_llm.py:261] Added request cmpl-497dfe0797ea485aa0383c5d03a53788-0.
INFO 03-02 01:16:43 [logger.py:42] Received request cmpl-af9a1a5d5923477c8c7097b422a80fa3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:16:43 [async_llm.py:261] Added request cmpl-af9a1a5d5923477c8c7097b422a80fa3-0.
INFO 03-02 01:16:44 [logger.py:42] Received request cmpl-878f467d4fc944599f78b0fafeffe033-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:16:44 [async_llm.py:261] Added request cmpl-878f467d4fc944599f78b0fafeffe033-0.
INFO 03-02 01:16:45 [logger.py:42] Received request cmpl-19b6574235a74938bdb502fdb9e6bf09-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:16:45 [async_llm.py:261] Added request cmpl-19b6574235a74938bdb502fdb9e6bf09-0.
INFO 03-02 01:16:46 [logger.py:42] Received request cmpl-53e39f0c5df04d42bae4df2000096903-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:16:46 [async_llm.py:261] Added request cmpl-53e39f0c5df04d42bae4df2000096903-0.
INFO 03-02 01:16:47 [logger.py:42] Received request cmpl-b2a8ba31044f4fbf9a716dd2c1c36711-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:16:47 [async_llm.py:261] Added request cmpl-b2a8ba31044f4fbf9a716dd2c1c36711-0.
INFO 03-02 01:16:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:16:49 [logger.py:42] Received request cmpl-91294366fbd7494bb9341926b989b9a4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:16:49 [async_llm.py:261] Added request cmpl-91294366fbd7494bb9341926b989b9a4-0.
INFO 03-02 01:16:50 [logger.py:42] Received request cmpl-87f395d7ae604ce0b7b577aa759d62fb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:16:50 [async_llm.py:261] Added request cmpl-87f395d7ae604ce0b7b577aa759d62fb-0.
INFO 03-02 01:16:51 [logger.py:42] Received request cmpl-f5e0d14890f24f188f172f77c6c330d8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:16:51 [async_llm.py:261] Added request cmpl-f5e0d14890f24f188f172f77c6c330d8-0.
INFO 03-02 01:16:52 [logger.py:42] Received request cmpl-21ca6bcd3e8542979a438170f73ba13c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:16:52 [async_llm.py:261] Added request cmpl-21ca6bcd3e8542979a438170f73ba13c-0.
INFO 03-02 01:16:53 [logger.py:42] Received request cmpl-0ea9b705538748409dd4a6a11eca0f5e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:16:53 [async_llm.py:261] Added request cmpl-0ea9b705538748409dd4a6a11eca0f5e-0.
INFO 03-02 01:16:54 [logger.py:42] Received request cmpl-51b9f2c262c845aa9c6466da4b555390-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:16:54 [async_llm.py:261] Added request cmpl-51b9f2c262c845aa9c6466da4b555390-0.
INFO 03-02 01:16:56 [logger.py:42] Received request cmpl-0a96aa7d59554c45939ea38d72c05e9d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:16:56 [async_llm.py:261] Added request cmpl-0a96aa7d59554c45939ea38d72c05e9d-0.
INFO 03-02 01:16:57 [logger.py:42] Received request cmpl-782ad6b04a1b4a11a2e5d8630805f8de-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:16:57 [async_llm.py:261] Added request cmpl-782ad6b04a1b4a11a2e5d8630805f8de-0.
INFO 03-02 01:16:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:16:58 [logger.py:42] Received request cmpl-63171c13fcc94bb69514ac3f2511f08f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:16:58 [async_llm.py:261] Added request cmpl-63171c13fcc94bb69514ac3f2511f08f-0.
INFO 03-02 01:16:59 [logger.py:42] Received request cmpl-0fbdfe862156409ab37ce8896b62d634-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:16:59 [async_llm.py:261] Added request cmpl-0fbdfe862156409ab37ce8896b62d634-0.
INFO 03-02 01:17:00 [logger.py:42] Received request cmpl-cdb7fa0257ba4a73bed615a8eb81410e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:17:00 [async_llm.py:261] Added request cmpl-cdb7fa0257ba4a73bed615a8eb81410e-0.
INFO 03-02 01:17:01 [logger.py:42] Received request cmpl-1c4a97a03a7c4f4d9d764a29ccdd3e1e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:17:01 [async_llm.py:261] Added request cmpl-1c4a97a03a7c4f4d9d764a29ccdd3e1e-0.
INFO 03-02 01:17:02 [logger.py:42] Received request cmpl-5e5bd456f3b0496da56d14c4c4604841-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:17:02 [async_llm.py:261] Added request cmpl-5e5bd456f3b0496da56d14c4c4604841-0.
INFO 03-02 01:17:04 [logger.py:42] Received request cmpl-aec7cfd79b354469a2875a29e6649da3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:17:04 [async_llm.py:261] Added request cmpl-aec7cfd79b354469a2875a29e6649da3-0.
INFO 03-02 01:17:05 [logger.py:42] Received request cmpl-47196811e889494287f7e61ff893fe1b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:17:05 [async_llm.py:261] Added request cmpl-47196811e889494287f7e61ff893fe1b-0.
INFO 03-02 01:17:06 [logger.py:42] Received request cmpl-ba2eb8a550da445bbb117368dba351c2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:17:06 [async_llm.py:261] Added request cmpl-ba2eb8a550da445bbb117368dba351c2-0.
INFO 03-02 01:17:07 [logger.py:42] Received request cmpl-fe11677b270c452a8971d7501190f1ea-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:17:07 [async_llm.py:261] Added request cmpl-fe11677b270c452a8971d7501190f1ea-0.
INFO 03-02 01:17:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:17:08 [logger.py:42] Received request cmpl-60f2b49d24084fe0b3f116ae02601ebc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:17:08 [async_llm.py:261] Added request cmpl-60f2b49d24084fe0b3f116ae02601ebc-0.
INFO 03-02 01:17:09 [logger.py:42] Received request cmpl-b9518ce502ee4cf5959f1205c4f647b7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:17:09 [async_llm.py:261] Added request cmpl-b9518ce502ee4cf5959f1205c4f647b7-0.
INFO 03-02 01:17:11 [logger.py:42] Received request cmpl-a8f69a712d99461da4c8c6f7f11b1e1d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:17:11 [async_llm.py:261] Added request cmpl-a8f69a712d99461da4c8c6f7f11b1e1d-0.
INFO 03-02 01:17:12 [logger.py:42] Received request cmpl-83a55d36ba0e48b8ba646dd06d68d1fe-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:17:12 [async_llm.py:261] Added request cmpl-83a55d36ba0e48b8ba646dd06d68d1fe-0.
INFO 03-02 01:17:13 [logger.py:42] Received request cmpl-40ab176bed3b4c7eb8ab4d1d718fefdb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:17:13 [async_llm.py:261] Added request cmpl-40ab176bed3b4c7eb8ab4d1d718fefdb-0.
INFO 03-02 01:17:14 [logger.py:42] Received request cmpl-14eab0c09e8d45788160cceef101a69e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:17:14 [async_llm.py:261] Added request cmpl-14eab0c09e8d45788160cceef101a69e-0.
INFO 03-02 01:17:15 [logger.py:42] Received request cmpl-aaaa792b2cff49d3b1acc90b584fc811-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:17:15 [async_llm.py:261] Added request cmpl-aaaa792b2cff49d3b1acc90b584fc811-0.
INFO 03-02 01:17:16 [logger.py:42] Received request cmpl-a93bc71406ef4b6d862bde5cbbb54521-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:17:16 [async_llm.py:261] Added request cmpl-a93bc71406ef4b6d862bde5cbbb54521-0.
INFO 03-02 01:17:17 [logger.py:42] Received request cmpl-03e3230d151944ac86dc0bd51af326a3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:17:17 [async_llm.py:261] Added request cmpl-03e3230d151944ac86dc0bd51af326a3-0.
INFO 03-02 01:17:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:17:19 [logger.py:42] Received request cmpl-75ac7316dfd944b8a434b355372efc31-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:17:19 [async_llm.py:261] Added request cmpl-75ac7316dfd944b8a434b355372efc31-0.
INFO 03-02 01:17:20 [logger.py:42] Received request cmpl-0077c982764b4731893215de7ea08f23-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:17:20 [async_llm.py:261] Added request cmpl-0077c982764b4731893215de7ea08f23-0.
INFO 03-02 01:17:21 [logger.py:42] Received request cmpl-0774ddbb587d4f8daadafe098ee6d616-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:17:21 [async_llm.py:261] Added request cmpl-0774ddbb587d4f8daadafe098ee6d616-0.
INFO 03-02 01:17:22 [logger.py:42] Received request cmpl-ff3b54ca5f94440881288470b5c89638-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:17:22 [async_llm.py:261] Added request cmpl-ff3b54ca5f94440881288470b5c89638-0.
INFO 03-02 01:17:23 [logger.py:42] Received request cmpl-efb98eaf76af4754a951812722718743-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:17:23 [async_llm.py:261] Added request cmpl-efb98eaf76af4754a951812722718743-0.
INFO 03-02 01:17:24 [logger.py:42] Received request cmpl-c280fae6769344998e2185521fcc8fda-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:17:24 [async_llm.py:261] Added request cmpl-c280fae6769344998e2185521fcc8fda-0.
INFO 03-02 01:17:26 [logger.py:42] Received request cmpl-59646a3c3682475899ebf37efd57dc9f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:17:26 [async_llm.py:261] Added request cmpl-59646a3c3682475899ebf37efd57dc9f-0.
INFO 03-02 01:17:27 [logger.py:42] Received request cmpl-ae482bbd47d54ad4af4a94d7dfa57ff0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:17:27 [async_llm.py:261] Added request cmpl-ae482bbd47d54ad4af4a94d7dfa57ff0-0.
INFO 03-02 01:17:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:17:28 [logger.py:42] Received request cmpl-7191596f00f740f59a93902802cd1ecd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:17:28 [async_llm.py:261] Added request cmpl-7191596f00f740f59a93902802cd1ecd-0.
INFO 03-02 01:17:29 [logger.py:42] Received request cmpl-46da8274674c4a2798743cd985efdcec-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:17:29 [async_llm.py:261] Added request cmpl-46da8274674c4a2798743cd985efdcec-0.
INFO 03-02 01:17:30 [logger.py:42] Received request cmpl-b8dcd14cc19440d0accacf67e9bb4f6b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:17:30 [async_llm.py:261] Added request cmpl-b8dcd14cc19440d0accacf67e9bb4f6b-0.
INFO 03-02 01:17:31 [logger.py:42] Received request cmpl-5a44adb7d2404f47af52547f9a789d56-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:17:31 [async_llm.py:261] Added request cmpl-5a44adb7d2404f47af52547f9a789d56-0.
INFO 03-02 01:17:32 [logger.py:42] Received request cmpl-5f57212e6aaa41818268e47563340b9c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:17:32 [async_llm.py:261] Added request cmpl-5f57212e6aaa41818268e47563340b9c-0.
INFO 03-02 01:17:34 [logger.py:42] Received request cmpl-298eecce913c49a9ba38d315d94e4e62-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:17:34 [async_llm.py:261] Added request cmpl-298eecce913c49a9ba38d315d94e4e62-0.
INFO 03-02 01:17:35 [logger.py:42] Received request cmpl-a8a9eea0378c4c49bcc08df96f96c9a9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:17:35 [async_llm.py:261] Added request cmpl-a8a9eea0378c4c49bcc08df96f96c9a9-0.
INFO 03-02 01:17:36 [logger.py:42] Received request cmpl-c79f2ada66d94d21bb5eb03167221858-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:17:36 [async_llm.py:261] Added request cmpl-c79f2ada66d94d21bb5eb03167221858-0.
INFO 03-02 01:17:37 [logger.py:42] Received request cmpl-5d0c21b5c9334847a59f07702bddc5a6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:17:37 [async_llm.py:261] Added request cmpl-5d0c21b5c9334847a59f07702bddc5a6-0.
INFO 03-02 01:17:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:17:38 [logger.py:42] Received request cmpl-5b88827b51104579917c9e338446f625-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:17:38 [async_llm.py:261] Added request cmpl-5b88827b51104579917c9e338446f625-0.
INFO 03-02 01:17:39 [logger.py:42] Received request cmpl-659602993c2f47d99f4f107b9bf0ab47-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:17:39 [async_llm.py:261] Added request cmpl-659602993c2f47d99f4f107b9bf0ab47-0.
INFO 03-02 01:17:41 [logger.py:42] Received request cmpl-ee04f9f35cfc469eaeba3b2e6cf0fde4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:17:41 [async_llm.py:261] Added request cmpl-ee04f9f35cfc469eaeba3b2e6cf0fde4-0.
INFO 03-02 01:17:42 [logger.py:42] Received request cmpl-e5a7b70c0ba7402aaacb0022b908f682-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:17:42 [async_llm.py:261] Added request cmpl-e5a7b70c0ba7402aaacb0022b908f682-0.
INFO 03-02 01:17:43 [logger.py:42] Received request cmpl-300e57de68fb4d64a670e84e85001493-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:17:43 [async_llm.py:261] Added request cmpl-300e57de68fb4d64a670e84e85001493-0.
INFO 03-02 01:17:44 [logger.py:42] Received request cmpl-0014b82a7dbb4af88e15228b2f8cbc97-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:17:44 [async_llm.py:261] Added request cmpl-0014b82a7dbb4af88e15228b2f8cbc97-0.
INFO 03-02 01:17:45 [logger.py:42] Received request cmpl-ea2a0b2ff0c94a598de57e12434eab3b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:17:45 [async_llm.py:261] Added request cmpl-ea2a0b2ff0c94a598de57e12434eab3b-0.
INFO 03-02 01:17:46 [logger.py:42] Received request cmpl-0dfa5bc11cd9498285d4dac7fb1291ff-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:17:46 [async_llm.py:261] Added request cmpl-0dfa5bc11cd9498285d4dac7fb1291ff-0.
INFO 03-02 01:17:47 [logger.py:42] Received request cmpl-76543c57f4e546a39bd9fcf0d883470e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:17:47 [async_llm.py:261] Added request cmpl-76543c57f4e546a39bd9fcf0d883470e-0.
INFO 03-02 01:17:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:17:49 [logger.py:42] Received request cmpl-e5fd2af1e1964f5da13c21c37d17fdf7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:17:49 [async_llm.py:261] Added request cmpl-e5fd2af1e1964f5da13c21c37d17fdf7-0.
INFO 03-02 01:17:50 [logger.py:42] Received request cmpl-44f6bbbae09942dab398eb1c06d66780-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:17:50 [async_llm.py:261] Added request cmpl-44f6bbbae09942dab398eb1c06d66780-0.
INFO 03-02 01:17:51 [logger.py:42] Received request cmpl-27c8ff804771417eaafcc79a6e6c7a60-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:17:51 [async_llm.py:261] Added request cmpl-27c8ff804771417eaafcc79a6e6c7a60-0.
INFO 03-02 01:17:52 [logger.py:42] Received request cmpl-a05cffb6791f416c93450af9d7fe781b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:17:52 [async_llm.py:261] Added request cmpl-a05cffb6791f416c93450af9d7fe781b-0.
INFO 03-02 01:17:53 [logger.py:42] Received request cmpl-71cad4f3162646efa41af29a8daa6021-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:17:53 [async_llm.py:261] Added request cmpl-71cad4f3162646efa41af29a8daa6021-0.
INFO 03-02 01:17:54 [logger.py:42] Received request cmpl-9c8ee4bc90d44532b5b35379da54203d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:17:54 [async_llm.py:261] Added request cmpl-9c8ee4bc90d44532b5b35379da54203d-0.
INFO 03-02 01:17:55 [logger.py:42] Received request cmpl-b19645dbc0b7433396f3bb41c66ab629-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:17:55 [async_llm.py:261] Added request cmpl-b19645dbc0b7433396f3bb41c66ab629-0.
INFO 03-02 01:17:57 [logger.py:42] Received request cmpl-c9ca48ca859145dbb33381181462da11-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:17:57 [async_llm.py:261] Added request cmpl-c9ca48ca859145dbb33381181462da11-0.
INFO 03-02 01:17:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:17:58 [logger.py:42] Received request cmpl-d04cf8c31c174e6490b5f01c01f37373-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:17:58 [async_llm.py:261] Added request cmpl-d04cf8c31c174e6490b5f01c01f37373-0.
INFO 03-02 01:17:59 [logger.py:42] Received request cmpl-72e380f782864dc8a5c086de8ad19878-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:17:59 [async_llm.py:261] Added request cmpl-72e380f782864dc8a5c086de8ad19878-0.
INFO 03-02 01:18:00 [logger.py:42] Received request cmpl-1b6dae10395d4385ac7ff3eb2a2a1f65-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:18:00 [async_llm.py:261] Added request cmpl-1b6dae10395d4385ac7ff3eb2a2a1f65-0.
INFO 03-02 01:18:01 [logger.py:42] Received request cmpl-cae18c1157be49108eca810ddcd797e2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:18:01 [async_llm.py:261] Added request cmpl-cae18c1157be49108eca810ddcd797e2-0.
INFO 03-02 01:18:02 [logger.py:42] Received request cmpl-6b1cac2d4a704754885c7fa1bed3b874-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:18:02 [async_llm.py:261] Added request cmpl-6b1cac2d4a704754885c7fa1bed3b874-0.
INFO 03-02 01:18:04 [logger.py:42] Received request cmpl-c19a53cdec1a4d19afd2c201bcb16e4b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:18:04 [async_llm.py:261] Added request cmpl-c19a53cdec1a4d19afd2c201bcb16e4b-0.
INFO 03-02 01:18:05 [logger.py:42] Received request cmpl-de1e22068c4b4aa8b0e04a3d3b915fb1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:18:05 [async_llm.py:261] Added request cmpl-de1e22068c4b4aa8b0e04a3d3b915fb1-0.
INFO 03-02 01:18:06 [logger.py:42] Received request cmpl-ac3cab13fbad47a98d7083c5c31b0ae3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:18:06 [async_llm.py:261] Added request cmpl-ac3cab13fbad47a98d7083c5c31b0ae3-0.
INFO 03-02 01:18:07 [logger.py:42] Received request cmpl-b9dc50f7cca94a01b44632ebaa158447-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:18:07 [async_llm.py:261] Added request cmpl-b9dc50f7cca94a01b44632ebaa158447-0.
INFO 03-02 01:18:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:18:08 [logger.py:42] Received request cmpl-e9d98be20e2144c4b5edd0eb4a47c546-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:18:08 [async_llm.py:261] Added request cmpl-e9d98be20e2144c4b5edd0eb4a47c546-0.
INFO 03-02 01:18:09 [logger.py:42] Received request cmpl-d44f4322c3be455192ffcf9f98fefbfa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:18:09 [async_llm.py:261] Added request cmpl-d44f4322c3be455192ffcf9f98fefbfa-0.
INFO 03-02 01:18:10 [logger.py:42] Received request cmpl-e51f1cf510a94faea7fa8b91cb25af87-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:18:10 [async_llm.py:261] Added request cmpl-e51f1cf510a94faea7fa8b91cb25af87-0.
INFO 03-02 01:18:12 [logger.py:42] Received request cmpl-05bc5341ac1f4bc8a6851821123fa683-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:18:12 [async_llm.py:261] Added request cmpl-05bc5341ac1f4bc8a6851821123fa683-0.
INFO 03-02 01:18:13 [logger.py:42] Received request cmpl-2bfe453a8b1e4a619e87a1a89f209f4b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:18:13 [async_llm.py:261] Added request cmpl-2bfe453a8b1e4a619e87a1a89f209f4b-0.
INFO 03-02 01:18:14 [logger.py:42] Received request cmpl-5b69be579bd14fe6813efadec3582c1d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:18:14 [async_llm.py:261] Added request cmpl-5b69be579bd14fe6813efadec3582c1d-0.
INFO 03-02 01:18:15 [logger.py:42] Received request cmpl-3bd1095986a84dd5973fb8a3858060bc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:18:15 [async_llm.py:261] Added request cmpl-3bd1095986a84dd5973fb8a3858060bc-0.
INFO 03-02 01:18:16 [logger.py:42] Received request cmpl-9054af90a19d4f2b983bf84dedef9a9e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:18:16 [async_llm.py:261] Added request cmpl-9054af90a19d4f2b983bf84dedef9a9e-0.
INFO 03-02 01:18:17 [logger.py:42] Received request cmpl-79dc7bf50abd4608a52bf0658f1141c6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:18:17 [async_llm.py:261] Added request cmpl-79dc7bf50abd4608a52bf0658f1141c6-0.
INFO 03-02 01:18:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:18:19 [logger.py:42] Received request cmpl-a897490c9cf04fd9bd177ec3cc78f753-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:18:19 [async_llm.py:261] Added request cmpl-a897490c9cf04fd9bd177ec3cc78f753-0.
INFO 03-02 01:18:20 [logger.py:42] Received request cmpl-a1905a42b40042c59fbbebc721e7f8c5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:18:20 [async_llm.py:261] Added request cmpl-a1905a42b40042c59fbbebc721e7f8c5-0.
INFO 03-02 01:18:21 [logger.py:42] Received request cmpl-6931298c05244e3b8b79fa17d383b759-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:18:21 [async_llm.py:261] Added request cmpl-6931298c05244e3b8b79fa17d383b759-0.
INFO 03-02 01:18:22 [logger.py:42] Received request cmpl-5b6ab99367284c40a04db661a357f260-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:18:22 [async_llm.py:261] Added request cmpl-5b6ab99367284c40a04db661a357f260-0.
INFO 03-02 01:18:23 [logger.py:42] Received request cmpl-4cbb1a685b56491cb2b36e13595dba86-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:18:23 [async_llm.py:261] Added request cmpl-4cbb1a685b56491cb2b36e13595dba86-0.
INFO 03-02 01:18:24 [logger.py:42] Received request cmpl-56b6ebb202e847ad988211d5748e73c0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:18:24 [async_llm.py:261] Added request cmpl-56b6ebb202e847ad988211d5748e73c0-0.
INFO 03-02 01:18:25 [logger.py:42] Received request cmpl-3010cd6194334f30aab3157f510efa57-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:18:25 [async_llm.py:261] Added request cmpl-3010cd6194334f30aab3157f510efa57-0.
INFO 03-02 01:18:27 [logger.py:42] Received request cmpl-b98fd469d5f24f2389ad9da9e9ea33b3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:18:27 [async_llm.py:261] Added request cmpl-b98fd469d5f24f2389ad9da9e9ea33b3-0.
INFO 03-02 01:18:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:18:28 [logger.py:42] Received request cmpl-c7c045d4974b46ca904de2457546300c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:18:28 [async_llm.py:261] Added request cmpl-c7c045d4974b46ca904de2457546300c-0.
INFO 03-02 01:18:29 [logger.py:42] Received request cmpl-bc8a9679c1b946f6976d63f396a586fc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:18:29 [async_llm.py:261] Added request cmpl-bc8a9679c1b946f6976d63f396a586fc-0.
INFO 03-02 01:18:30 [logger.py:42] Received request cmpl-dfaf7154196c484e916e7abb40f6da93-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:18:30 [async_llm.py:261] Added request cmpl-dfaf7154196c484e916e7abb40f6da93-0.
INFO 03-02 01:18:31 [logger.py:42] Received request cmpl-056eb2356cd249a0938c53a7fa3f4227-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:18:31 [async_llm.py:261] Added request cmpl-056eb2356cd249a0938c53a7fa3f4227-0.
INFO 03-02 01:18:32 [logger.py:42] Received request cmpl-6df20e654a9c4f40b11cca9287567279-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:18:32 [async_llm.py:261] Added request cmpl-6df20e654a9c4f40b11cca9287567279-0.
INFO 03-02 01:18:34 [logger.py:42] Received request cmpl-877ffaf4fc5f44aaa965f39ab23f278b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:18:34 [async_llm.py:261] Added request cmpl-877ffaf4fc5f44aaa965f39ab23f278b-0.
INFO 03-02 01:18:35 [logger.py:42] Received request cmpl-1f8d5bece9a5493588901cbf38773c8c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:18:35 [async_llm.py:261] Added request cmpl-1f8d5bece9a5493588901cbf38773c8c-0.
INFO 03-02 01:18:36 [logger.py:42] Received request cmpl-d489f9b04c274d0ca2839b9bd5f37afd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:18:36 [async_llm.py:261] Added request cmpl-d489f9b04c274d0ca2839b9bd5f37afd-0.
INFO 03-02 01:18:37 [logger.py:42] Received request cmpl-2e996af5248d406ba35e1040ef6f38f2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:18:37 [async_llm.py:261] Added request cmpl-2e996af5248d406ba35e1040ef6f38f2-0.
INFO 03-02 01:18:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:18:38 [logger.py:42] Received request cmpl-16e54e6febae4928bb6dd1ac7e73b575-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:18:38 [async_llm.py:261] Added request cmpl-16e54e6febae4928bb6dd1ac7e73b575-0.
INFO 03-02 01:18:39 [logger.py:42] Received request cmpl-008a1ebdcb9e47dbb3cf3a1c289795c4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:18:39 [async_llm.py:261] Added request cmpl-008a1ebdcb9e47dbb3cf3a1c289795c4-0.
INFO 03-02 01:18:40 [logger.py:42] Received request cmpl-b47384eeb3b843278e26880f6a8e788e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:18:40 [async_llm.py:261] Added request cmpl-b47384eeb3b843278e26880f6a8e788e-0.
INFO 03-02 01:18:42 [logger.py:42] Received request cmpl-43b316aa3c504989926757385813364f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:18:42 [async_llm.py:261] Added request cmpl-43b316aa3c504989926757385813364f-0.
INFO 03-02 01:18:43 [logger.py:42] Received request cmpl-b936aab5ead741eb87fd0d28fa9475e9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:18:43 [async_llm.py:261] Added request cmpl-b936aab5ead741eb87fd0d28fa9475e9-0.
INFO 03-02 01:18:44 [logger.py:42] Received request cmpl-0365f189980446519ab8fc472a605677-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:18:44 [async_llm.py:261] Added request cmpl-0365f189980446519ab8fc472a605677-0.
INFO 03-02 01:18:45 [logger.py:42] Received request cmpl-fb4739752e96476495114cee119330e0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:18:45 [async_llm.py:261] Added request cmpl-fb4739752e96476495114cee119330e0-0.
INFO 03-02 01:18:46 [logger.py:42] Received request cmpl-4cf7b679f58a46a6be5ccd8e38b7d3ae-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:18:46 [async_llm.py:261] Added request cmpl-4cf7b679f58a46a6be5ccd8e38b7d3ae-0.
INFO 03-02 01:18:47 [logger.py:42] Received request cmpl-1affe09dfca54b47ac916c050fd59f20-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:18:47 [async_llm.py:261] Added request cmpl-1affe09dfca54b47ac916c050fd59f20-0.
INFO 03-02 01:18:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:18:49 [logger.py:42] Received request cmpl-95a6637c25dd446a80d41cc8f0a8813f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:18:49 [async_llm.py:261] Added request cmpl-95a6637c25dd446a80d41cc8f0a8813f-0.
INFO 03-02 01:18:50 [logger.py:42] Received request cmpl-2882cb5573f54e0291004e6044194189-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:18:50 [async_llm.py:261] Added request cmpl-2882cb5573f54e0291004e6044194189-0.
INFO 03-02 01:18:51 [logger.py:42] Received request cmpl-d442017c801d4e879ef84fbb180dedcb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:18:51 [async_llm.py:261] Added request cmpl-d442017c801d4e879ef84fbb180dedcb-0.
INFO 03-02 01:18:52 [logger.py:42] Received request cmpl-5ba9ee63e55c43c6b9d6344547eb51f5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:18:52 [async_llm.py:261] Added request cmpl-5ba9ee63e55c43c6b9d6344547eb51f5-0.
INFO 03-02 01:18:53 [logger.py:42] Received request cmpl-40a534c1d70b4a3387679b0292246ddd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:18:53 [async_llm.py:261] Added request cmpl-40a534c1d70b4a3387679b0292246ddd-0.
INFO 03-02 01:18:54 [logger.py:42] Received request cmpl-158be6abe9e74982b663dd4bc11e518f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:18:54 [async_llm.py:261] Added request cmpl-158be6abe9e74982b663dd4bc11e518f-0.
INFO 03-02 01:18:55 [logger.py:42] Received request cmpl-be776c8b14de426aad08088de86d65d9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:18:55 [async_llm.py:261] Added request cmpl-be776c8b14de426aad08088de86d65d9-0.
INFO 03-02 01:18:57 [logger.py:42] Received request cmpl-4dd2a2f67c9f4a8d92cc3354538dc7d3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:18:57 [async_llm.py:261] Added request cmpl-4dd2a2f67c9f4a8d92cc3354538dc7d3-0.
INFO 03-02 01:18:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:18:58 [logger.py:42] Received request cmpl-7c47e761fbce4dc7a134e4288e2150f8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:18:58 [async_llm.py:261] Added request cmpl-7c47e761fbce4dc7a134e4288e2150f8-0.
INFO 03-02 01:18:59 [logger.py:42] Received request cmpl-8e307816d1504856a5259a2469e46f32-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:18:59 [async_llm.py:261] Added request cmpl-8e307816d1504856a5259a2469e46f32-0.
INFO 03-02 01:19:00 [logger.py:42] Received request cmpl-61220db77f614404af8a127a77361622-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:19:00 [async_llm.py:261] Added request cmpl-61220db77f614404af8a127a77361622-0.
INFO 03-02 01:19:01 [logger.py:42] Received request cmpl-329657ca070d4fc8a223a0b92ca2ef9f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:19:01 [async_llm.py:261] Added request cmpl-329657ca070d4fc8a223a0b92ca2ef9f-0.
INFO 03-02 01:19:02 [logger.py:42] Received request cmpl-403ff79e993a49b1bdb7ceb29449fd4a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:19:02 [async_llm.py:261] Added request cmpl-403ff79e993a49b1bdb7ceb29449fd4a-0.
INFO 03-02 01:19:04 [logger.py:42] Received request cmpl-58cddf449ef24768a95953d17b7abca4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:19:04 [async_llm.py:261] Added request cmpl-58cddf449ef24768a95953d17b7abca4-0.
INFO 03-02 01:19:05 [logger.py:42] Received request cmpl-5a9939ced8a34792b732b309a4f5639e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:19:05 [async_llm.py:261] Added request cmpl-5a9939ced8a34792b732b309a4f5639e-0.
INFO 03-02 01:19:06 [logger.py:42] Received request cmpl-12184d84577242258087da8fd793cdf0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:19:06 [async_llm.py:261] Added request cmpl-12184d84577242258087da8fd793cdf0-0.
INFO 03-02 01:19:07 [logger.py:42] Received request cmpl-8941881ccef64130ab139bcd821073e8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:19:07 [async_llm.py:261] Added request cmpl-8941881ccef64130ab139bcd821073e8-0.
INFO 03-02 01:19:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:19:08 [logger.py:42] Received request cmpl-f1848b7906324fc8a2ef27800d2e9387-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:19:08 [async_llm.py:261] Added request cmpl-f1848b7906324fc8a2ef27800d2e9387-0.
INFO 03-02 01:19:09 [logger.py:42] Received request cmpl-fac0d179197c4826939150fae4048478-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:19:09 [async_llm.py:261] Added request cmpl-fac0d179197c4826939150fae4048478-0.
INFO 03-02 01:19:10 [logger.py:42] Received request cmpl-d4aa1b77ed174ff58d2fbac006ddf1f1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:19:10 [async_llm.py:261] Added request cmpl-d4aa1b77ed174ff58d2fbac006ddf1f1-0.
INFO 03-02 01:19:12 [logger.py:42] Received request cmpl-e2346603bbb34da8808a290dfa9abbc6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:19:12 [async_llm.py:261] Added request cmpl-e2346603bbb34da8808a290dfa9abbc6-0.
INFO 03-02 01:19:13 [logger.py:42] Received request cmpl-3fad3e8a41d84d039699a311e710cbbc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:19:13 [async_llm.py:261] Added request cmpl-3fad3e8a41d84d039699a311e710cbbc-0.
INFO 03-02 01:19:14 [logger.py:42] Received request cmpl-a34e9592f3df407cbf2b8d14308bf58b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:19:14 [async_llm.py:261] Added request cmpl-a34e9592f3df407cbf2b8d14308bf58b-0.
INFO 03-02 01:19:15 [logger.py:42] Received request cmpl-552bf9d43f324a898dd6ea93201061d0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:19:15 [async_llm.py:261] Added request cmpl-552bf9d43f324a898dd6ea93201061d0-0.
INFO 03-02 01:19:16 [logger.py:42] Received request cmpl-62158550cf9249689db157a0936b1aad-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:19:16 [async_llm.py:261] Added request cmpl-62158550cf9249689db157a0936b1aad-0.
INFO 03-02 01:19:17 [logger.py:42] Received request cmpl-3da45df96f044db88656025848773ab2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:19:17 [async_llm.py:261] Added request cmpl-3da45df96f044db88656025848773ab2-0.
INFO 03-02 01:19:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:19:19 [logger.py:42] Received request cmpl-7fa999e7fe914d7a8ad0535963053eb5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:19:19 [async_llm.py:261] Added request cmpl-7fa999e7fe914d7a8ad0535963053eb5-0.
INFO 03-02 01:19:20 [logger.py:42] Received request cmpl-16c0acc82311485ba4a662ac06378381-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:19:20 [async_llm.py:261] Added request cmpl-16c0acc82311485ba4a662ac06378381-0.
INFO 03-02 01:19:21 [logger.py:42] Received request cmpl-6d694ea67f684664810a883de52a80b0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:19:21 [async_llm.py:261] Added request cmpl-6d694ea67f684664810a883de52a80b0-0.
INFO 03-02 01:19:22 [logger.py:42] Received request cmpl-38fbf8727cb14043854ca7d480e0c975-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:19:22 [async_llm.py:261] Added request cmpl-38fbf8727cb14043854ca7d480e0c975-0.
INFO 03-02 01:19:23 [logger.py:42] Received request cmpl-7f57b72285374ed1ac474a066baf45a2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:19:23 [async_llm.py:261] Added request cmpl-7f57b72285374ed1ac474a066baf45a2-0.
INFO 03-02 01:19:24 [logger.py:42] Received request cmpl-b782d52e8ea34df3a25e133ea3ae9589-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:19:24 [async_llm.py:261] Added request cmpl-b782d52e8ea34df3a25e133ea3ae9589-0.
INFO 03-02 01:19:25 [logger.py:42] Received request cmpl-1380af6b504c4839890a3291168b70dd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:19:25 [async_llm.py:261] Added request cmpl-1380af6b504c4839890a3291168b70dd-0.
INFO 03-02 01:19:27 [logger.py:42] Received request cmpl-fec38dcc89d14913bb0023cdfa831bc1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:19:27 [async_llm.py:261] Added request cmpl-fec38dcc89d14913bb0023cdfa831bc1-0.
INFO 03-02 01:19:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:19:28 [logger.py:42] Received request cmpl-56a84dc0273342ecbef6846796aa4be5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:19:28 [async_llm.py:261] Added request cmpl-56a84dc0273342ecbef6846796aa4be5-0.
INFO 03-02 01:19:29 [logger.py:42] Received request cmpl-96364ee228554e9f96b88cdebbe4696f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:19:29 [async_llm.py:261] Added request cmpl-96364ee228554e9f96b88cdebbe4696f-0.
INFO 03-02 01:19:30 [logger.py:42] Received request cmpl-7ccb3288abc2489ba5186f753f9c460f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:19:30 [async_llm.py:261] Added request cmpl-7ccb3288abc2489ba5186f753f9c460f-0.
INFO 03-02 01:19:31 [logger.py:42] Received request cmpl-c5dc672dbfe2478bb3406bd1e11927e7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:19:31 [async_llm.py:261] Added request cmpl-c5dc672dbfe2478bb3406bd1e11927e7-0.
INFO 03-02 01:19:32 [logger.py:42] Received request cmpl-2f9120110da14d729dbcd6943404364a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:19:32 [async_llm.py:261] Added request cmpl-2f9120110da14d729dbcd6943404364a-0.
INFO 03-02 01:19:34 [logger.py:42] Received request cmpl-0d9cd15af9e34edf827c0b23bf430fd9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:19:34 [async_llm.py:261] Added request cmpl-0d9cd15af9e34edf827c0b23bf430fd9-0.
INFO 03-02 01:19:35 [logger.py:42] Received request cmpl-c40d8cf0005641e7ab325a439a3afd8a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:19:35 [async_llm.py:261] Added request cmpl-c40d8cf0005641e7ab325a439a3afd8a-0.
INFO 03-02 01:19:36 [logger.py:42] Received request cmpl-90e2c95c8e894df784de0c85fc2a936e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:19:36 [async_llm.py:261] Added request cmpl-90e2c95c8e894df784de0c85fc2a936e-0.
INFO 03-02 01:19:37 [logger.py:42] Received request cmpl-88098325a5a94f1383aeab651811dc4a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:19:37 [async_llm.py:261] Added request cmpl-88098325a5a94f1383aeab651811dc4a-0.
INFO 03-02 01:19:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:19:38 [logger.py:42] Received request cmpl-253b22ae138b4c97b7605de848942a1f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:19:38 [async_llm.py:261] Added request cmpl-253b22ae138b4c97b7605de848942a1f-0.
INFO 03-02 01:19:39 [logger.py:42] Received request cmpl-26e489d9b15f4420abd818605e931b9a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:19:39 [async_llm.py:261] Added request cmpl-26e489d9b15f4420abd818605e931b9a-0.
INFO 03-02 01:19:40 [logger.py:42] Received request cmpl-6bcd56f4112e4a3b806d542af3cd8248-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:19:40 [async_llm.py:261] Added request cmpl-6bcd56f4112e4a3b806d542af3cd8248-0.
INFO 03-02 01:19:42 [logger.py:42] Received request cmpl-89beac156eeb43acaf98d55a22abcba9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:19:42 [async_llm.py:261] Added request cmpl-89beac156eeb43acaf98d55a22abcba9-0.
INFO 03-02 01:19:43 [logger.py:42] Received request cmpl-efb06c13642544deaf4f9c02f8e54d04-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:19:43 [async_llm.py:261] Added request cmpl-efb06c13642544deaf4f9c02f8e54d04-0.
INFO 03-02 01:19:44 [logger.py:42] Received request cmpl-1dba55ae09ae4e42b4c654bdb5e8006e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:19:44 [async_llm.py:261] Added request cmpl-1dba55ae09ae4e42b4c654bdb5e8006e-0.
INFO 03-02 01:19:45 [logger.py:42] Received request cmpl-91f58b2f1e194252916498d1e3e6a0b8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:19:45 [async_llm.py:261] Added request cmpl-91f58b2f1e194252916498d1e3e6a0b8-0.
INFO 03-02 01:19:46 [logger.py:42] Received request cmpl-8a3072f8b5784a9fb0c0829c97f0d721-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:19:46 [async_llm.py:261] Added request cmpl-8a3072f8b5784a9fb0c0829c97f0d721-0.
INFO 03-02 01:19:47 [logger.py:42] Received request cmpl-594a166b49a14f92819ea60019bae96b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:19:47 [async_llm.py:261] Added request cmpl-594a166b49a14f92819ea60019bae96b-0.
INFO 03-02 01:19:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:19:49 [logger.py:42] Received request cmpl-2de35fdaef1847e2ad43cc0e7844c61c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:19:49 [async_llm.py:261] Added request cmpl-2de35fdaef1847e2ad43cc0e7844c61c-0.
INFO 03-02 01:19:50 [logger.py:42] Received request cmpl-3fa2dca9a2cb4a159dd57d51a044f4be-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:19:50 [async_llm.py:261] Added request cmpl-3fa2dca9a2cb4a159dd57d51a044f4be-0.
INFO 03-02 01:19:51 [logger.py:42] Received request cmpl-e6f59228fe8d4183b3aca4eb18510119-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:19:51 [async_llm.py:261] Added request cmpl-e6f59228fe8d4183b3aca4eb18510119-0.
INFO 03-02 01:19:52 [logger.py:42] Received request cmpl-4563859e3d3c4f21b7bd9842c372ab1a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:19:52 [async_llm.py:261] Added request cmpl-4563859e3d3c4f21b7bd9842c372ab1a-0.
INFO 03-02 01:19:53 [logger.py:42] Received request cmpl-2f361322af2448c28bea5d2554018f4c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:19:53 [async_llm.py:261] Added request cmpl-2f361322af2448c28bea5d2554018f4c-0.
INFO 03-02 01:19:54 [logger.py:42] Received request cmpl-b7f1dda47807475f9c487705817fd997-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:19:54 [async_llm.py:261] Added request cmpl-b7f1dda47807475f9c487705817fd997-0.
INFO 03-02 01:19:55 [logger.py:42] Received request cmpl-1b114b1e017b4a8c9d3a1d9e049dfb18-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:19:55 [async_llm.py:261] Added request cmpl-1b114b1e017b4a8c9d3a1d9e049dfb18-0.
INFO 03-02 01:19:57 [logger.py:42] Received request cmpl-877fd8339870418198200a885b5a86c1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:19:57 [async_llm.py:261] Added request cmpl-877fd8339870418198200a885b5a86c1-0.
INFO 03-02 01:19:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:19:58 [logger.py:42] Received request cmpl-e96736f99de6409fa6907176772f8d29-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:19:58 [async_llm.py:261] Added request cmpl-e96736f99de6409fa6907176772f8d29-0.
INFO 03-02 01:19:59 [logger.py:42] Received request cmpl-4a62146f34b94a9e98e3de402bd0972c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:19:59 [async_llm.py:261] Added request cmpl-4a62146f34b94a9e98e3de402bd0972c-0.
INFO 03-02 01:20:00 [logger.py:42] Received request cmpl-7eba6dc94a424186843051acca0396bb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:20:00 [async_llm.py:261] Added request cmpl-7eba6dc94a424186843051acca0396bb-0.
INFO 03-02 01:20:01 [logger.py:42] Received request cmpl-1e68f76f7cfe4388878c4aac36cfc245-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:20:01 [async_llm.py:261] Added request cmpl-1e68f76f7cfe4388878c4aac36cfc245-0.
INFO 03-02 01:20:02 [logger.py:42] Received request cmpl-00ac4c95511a45749e2efa97e99862b0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:20:02 [async_llm.py:261] Added request cmpl-00ac4c95511a45749e2efa97e99862b0-0.
INFO 03-02 01:20:04 [logger.py:42] Received request cmpl-596e76f0528e4387b1769b1ad09b4352-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:20:04 [async_llm.py:261] Added request cmpl-596e76f0528e4387b1769b1ad09b4352-0.
INFO 03-02 01:20:05 [logger.py:42] Received request cmpl-d8beae867b2c462695994a943a541b18-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:20:05 [async_llm.py:261] Added request cmpl-d8beae867b2c462695994a943a541b18-0.
INFO 03-02 01:20:06 [logger.py:42] Received request cmpl-44feb62ec80042cca274f0dc8b21d11c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:20:06 [async_llm.py:261] Added request cmpl-44feb62ec80042cca274f0dc8b21d11c-0.
INFO 03-02 01:20:07 [logger.py:42] Received request cmpl-b4851ae5168a4b1a93232dcafb477d76-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:20:07 [async_llm.py:261] Added request cmpl-b4851ae5168a4b1a93232dcafb477d76-0.
INFO 03-02 01:20:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:20:08 [logger.py:42] Received request cmpl-754664ec09a54534a7f0835de7e6954c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:20:08 [async_llm.py:261] Added request cmpl-754664ec09a54534a7f0835de7e6954c-0.
INFO 03-02 01:20:09 [logger.py:42] Received request cmpl-738e7204000f4890a0b0f6e0eae21186-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:20:09 [async_llm.py:261] Added request cmpl-738e7204000f4890a0b0f6e0eae21186-0.
INFO 03-02 01:20:10 [logger.py:42] Received request cmpl-a15cf010f4654240ac90ab577d2826f1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:20:10 [async_llm.py:261] Added request cmpl-a15cf010f4654240ac90ab577d2826f1-0.
INFO 03-02 01:20:12 [logger.py:42] Received request cmpl-16ee33bd6db34ba19047aadff38790b1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:20:12 [async_llm.py:261] Added request cmpl-16ee33bd6db34ba19047aadff38790b1-0.
INFO 03-02 01:20:13 [logger.py:42] Received request cmpl-6ac3ba1b665c45be885848b67cd9a836-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:20:13 [async_llm.py:261] Added request cmpl-6ac3ba1b665c45be885848b67cd9a836-0.
INFO 03-02 01:20:14 [logger.py:42] Received request cmpl-7860b5d6f921400f8393da7eb9ada9d8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:20:14 [async_llm.py:261] Added request cmpl-7860b5d6f921400f8393da7eb9ada9d8-0.
INFO 03-02 01:20:15 [logger.py:42] Received request cmpl-94531dd7dac44041a7d19a1c53c28f6a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:20:15 [async_llm.py:261] Added request cmpl-94531dd7dac44041a7d19a1c53c28f6a-0.
INFO 03-02 01:20:16 [logger.py:42] Received request cmpl-68b1e601b85b4beab5e4e0d556ee682a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:20:16 [async_llm.py:261] Added request cmpl-68b1e601b85b4beab5e4e0d556ee682a-0.
INFO 03-02 01:20:17 [logger.py:42] Received request cmpl-1447cef6b1994bff9d0c92dfc8e7c8bd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:20:17 [async_llm.py:261] Added request cmpl-1447cef6b1994bff9d0c92dfc8e7c8bd-0.
INFO 03-02 01:20:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:20:19 [logger.py:42] Received request cmpl-246c4d9a0117491cab39cf91473bba3c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:20:19 [async_llm.py:261] Added request cmpl-246c4d9a0117491cab39cf91473bba3c-0.
INFO 03-02 01:20:20 [logger.py:42] Received request cmpl-6e9055373d83412a8131b4c22c9612c5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:20:20 [async_llm.py:261] Added request cmpl-6e9055373d83412a8131b4c22c9612c5-0.
INFO 03-02 01:20:21 [logger.py:42] Received request cmpl-6d761aad25064e6582c4058ed692d9bb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:20:21 [async_llm.py:261] Added request cmpl-6d761aad25064e6582c4058ed692d9bb-0.
INFO 03-02 01:20:22 [logger.py:42] Received request cmpl-c5e531f5e70840fb80cec7062f3211b9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:20:22 [async_llm.py:261] Added request cmpl-c5e531f5e70840fb80cec7062f3211b9-0.
INFO 03-02 01:20:23 [logger.py:42] Received request cmpl-99dbde4169b5420d818c64ca3733b652-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:20:23 [async_llm.py:261] Added request cmpl-99dbde4169b5420d818c64ca3733b652-0.
INFO 03-02 01:20:24 [logger.py:42] Received request cmpl-c5abdbbec7864a9f9289f93073594d06-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:20:24 [async_llm.py:261] Added request cmpl-c5abdbbec7864a9f9289f93073594d06-0.
INFO 03-02 01:20:25 [logger.py:42] Received request cmpl-92652d66c0a74e1b857871e656aecf75-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:20:25 [async_llm.py:261] Added request cmpl-92652d66c0a74e1b857871e656aecf75-0.
INFO 03-02 01:20:27 [logger.py:42] Received request cmpl-1eefd5f3ab4f4cfdbd42d7de1781b5f2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:20:27 [async_llm.py:261] Added request cmpl-1eefd5f3ab4f4cfdbd42d7de1781b5f2-0.
INFO 03-02 01:20:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:20:28 [logger.py:42] Received request cmpl-a20c6fb05d874c50b000c35aa96db0a5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:20:28 [async_llm.py:261] Added request cmpl-a20c6fb05d874c50b000c35aa96db0a5-0.
INFO 03-02 01:20:29 [logger.py:42] Received request cmpl-d46c98d372ed4aea9cb6bf43ae49decd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:20:29 [async_llm.py:261] Added request cmpl-d46c98d372ed4aea9cb6bf43ae49decd-0.
INFO 03-02 01:20:30 [logger.py:42] Received request cmpl-f26a208086d24279b2fe2da4ff1d820b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:20:30 [async_llm.py:261] Added request cmpl-f26a208086d24279b2fe2da4ff1d820b-0.
INFO 03-02 01:20:31 [logger.py:42] Received request cmpl-492c317ffe5a41929a9b036c90be8ff6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:20:31 [async_llm.py:261] Added request cmpl-492c317ffe5a41929a9b036c90be8ff6-0.
INFO 03-02 01:20:32 [logger.py:42] Received request cmpl-332c326951f54f40b73409cae7c0163b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:20:32 [async_llm.py:261] Added request cmpl-332c326951f54f40b73409cae7c0163b-0.
INFO 03-02 01:20:34 [logger.py:42] Received request cmpl-20b1ef9c65ca417580e43ffb61a8e166-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:20:34 [async_llm.py:261] Added request cmpl-20b1ef9c65ca417580e43ffb61a8e166-0.
INFO 03-02 01:20:35 [logger.py:42] Received request cmpl-2fc7862fa0ad49b0a209f695db267630-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:20:35 [async_llm.py:261] Added request cmpl-2fc7862fa0ad49b0a209f695db267630-0.
INFO 03-02 01:20:36 [logger.py:42] Received request cmpl-31ad124e16aa44119ebd8c8a388277a8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:20:36 [async_llm.py:261] Added request cmpl-31ad124e16aa44119ebd8c8a388277a8-0.
INFO 03-02 01:20:37 [logger.py:42] Received request cmpl-ed155543609141c19965bc578a95241f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:20:37 [async_llm.py:261] Added request cmpl-ed155543609141c19965bc578a95241f-0.
INFO 03-02 01:20:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:20:38 [logger.py:42] Received request cmpl-b690ef0455bc469cbe3434f024f6ef43-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:20:38 [async_llm.py:261] Added request cmpl-b690ef0455bc469cbe3434f024f6ef43-0.
INFO 03-02 01:20:39 [logger.py:42] Received request cmpl-0fa5ccdd0eb74e13944c9d027237c798-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:20:39 [async_llm.py:261] Added request cmpl-0fa5ccdd0eb74e13944c9d027237c798-0.
INFO 03-02 01:20:40 [logger.py:42] Received request cmpl-55f2edb59bd644e8a88f45cfe6f6dc72-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:20:40 [async_llm.py:261] Added request cmpl-55f2edb59bd644e8a88f45cfe6f6dc72-0.
INFO 03-02 01:20:42 [logger.py:42] Received request cmpl-c0fd56f613724b12864abab3389decc7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:20:42 [async_llm.py:261] Added request cmpl-c0fd56f613724b12864abab3389decc7-0.
INFO 03-02 01:20:43 [logger.py:42] Received request cmpl-cb0cff9bd12743068cf5a8436ecd25bd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:20:43 [async_llm.py:261] Added request cmpl-cb0cff9bd12743068cf5a8436ecd25bd-0.
INFO 03-02 01:20:44 [logger.py:42] Received request cmpl-4bd8d6e8976842609a6cb1436989f59a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:20:44 [async_llm.py:261] Added request cmpl-4bd8d6e8976842609a6cb1436989f59a-0.
INFO 03-02 01:20:45 [logger.py:42] Received request cmpl-ae1c4c9db0ca47cf9749f0968fa6e012-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:20:45 [async_llm.py:261] Added request cmpl-ae1c4c9db0ca47cf9749f0968fa6e012-0.
INFO 03-02 01:20:46 [logger.py:42] Received request cmpl-0f3739392e2e49d58c44c73bb56f6124-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:20:46 [async_llm.py:261] Added request cmpl-0f3739392e2e49d58c44c73bb56f6124-0.
INFO 03-02 01:20:47 [logger.py:42] Received request cmpl-1d87a405bf1e41d5a3afc23175a40944-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:20:47 [async_llm.py:261] Added request cmpl-1d87a405bf1e41d5a3afc23175a40944-0.
INFO 03-02 01:20:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:20:49 [logger.py:42] Received request cmpl-75fd897bff314908932028580ee98a23-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:20:49 [async_llm.py:261] Added request cmpl-75fd897bff314908932028580ee98a23-0.
INFO 03-02 01:20:50 [logger.py:42] Received request cmpl-82c392dbffd74b639001398543f5c74a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:20:50 [async_llm.py:261] Added request cmpl-82c392dbffd74b639001398543f5c74a-0.
INFO 03-02 01:20:51 [logger.py:42] Received request cmpl-4a3004f8151c4c4d95a08723b33e4c88-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:20:51 [async_llm.py:261] Added request cmpl-4a3004f8151c4c4d95a08723b33e4c88-0.
INFO 03-02 01:20:52 [logger.py:42] Received request cmpl-a29254865a384822ae0601a27bf3b1c1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:20:52 [async_llm.py:261] Added request cmpl-a29254865a384822ae0601a27bf3b1c1-0.
INFO 03-02 01:20:53 [logger.py:42] Received request cmpl-be24db2648c74bbd884128909111ae9a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:20:53 [async_llm.py:261] Added request cmpl-be24db2648c74bbd884128909111ae9a-0.
INFO 03-02 01:20:54 [logger.py:42] Received request cmpl-464253b28bcc46578a4e73d10e5ca835-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:20:54 [async_llm.py:261] Added request cmpl-464253b28bcc46578a4e73d10e5ca835-0.
INFO 03-02 01:20:55 [logger.py:42] Received request cmpl-7378399d410c412a8fccfbd30d001d4c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:20:55 [async_llm.py:261] Added request cmpl-7378399d410c412a8fccfbd30d001d4c-0.
INFO 03-02 01:20:57 [logger.py:42] Received request cmpl-6e12217751a643df8791f5e8524e8415-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:20:57 [async_llm.py:261] Added request cmpl-6e12217751a643df8791f5e8524e8415-0.
INFO 03-02 01:20:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:20:58 [logger.py:42] Received request cmpl-5516994fdf22488eb2d61e8823b81288-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:20:58 [async_llm.py:261] Added request cmpl-5516994fdf22488eb2d61e8823b81288-0.
INFO 03-02 01:20:59 [logger.py:42] Received request cmpl-8e0e36e8fc634d34be1540db38ee6550-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:20:59 [async_llm.py:261] Added request cmpl-8e0e36e8fc634d34be1540db38ee6550-0.
INFO 03-02 01:21:00 [logger.py:42] Received request cmpl-50b53466fee843d4afd5a4ae4f01c1ef-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:21:00 [async_llm.py:261] Added request cmpl-50b53466fee843d4afd5a4ae4f01c1ef-0.
INFO 03-02 01:21:01 [logger.py:42] Received request cmpl-9bd85acfe3db42328134a290f7f4ece5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:21:01 [async_llm.py:261] Added request cmpl-9bd85acfe3db42328134a290f7f4ece5-0.
INFO 03-02 01:21:02 [logger.py:42] Received request cmpl-699dcf0a06fb411b8b251076762eb466-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:21:02 [async_llm.py:261] Added request cmpl-699dcf0a06fb411b8b251076762eb466-0.
INFO 03-02 01:21:04 [logger.py:42] Received request cmpl-f088d079a3e248db98102aac3902dad9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:21:04 [async_llm.py:261] Added request cmpl-f088d079a3e248db98102aac3902dad9-0.
INFO 03-02 01:21:05 [logger.py:42] Received request cmpl-1fb706f9538f4a4baee0a47eeac580b4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:21:05 [async_llm.py:261] Added request cmpl-1fb706f9538f4a4baee0a47eeac580b4-0.
INFO 03-02 01:21:06 [logger.py:42] Received request cmpl-d575644eced447088fc1d699a3abe583-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:21:06 [async_llm.py:261] Added request cmpl-d575644eced447088fc1d699a3abe583-0.
INFO 03-02 01:21:07 [logger.py:42] Received request cmpl-634f2cacf56c4361b92460db422db597-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:21:07 [async_llm.py:261] Added request cmpl-634f2cacf56c4361b92460db422db597-0.
INFO 03-02 01:21:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:21:08 [logger.py:42] Received request cmpl-dfb09292f77e407c8c731fdda3c52bfb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:21:08 [async_llm.py:261] Added request cmpl-dfb09292f77e407c8c731fdda3c52bfb-0.
INFO 03-02 01:21:09 [logger.py:42] Received request cmpl-1ab2c68e5b0e4242a5453e1294aa71bc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:21:09 [async_llm.py:261] Added request cmpl-1ab2c68e5b0e4242a5453e1294aa71bc-0.
INFO 03-02 01:21:10 [logger.py:42] Received request cmpl-eed6dcf3dfa945b79c4fa09412f1f4a8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:21:10 [async_llm.py:261] Added request cmpl-eed6dcf3dfa945b79c4fa09412f1f4a8-0.
INFO 03-02 01:21:12 [logger.py:42] Received request cmpl-2039967de2f0408f82da97e76920e5b6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:21:12 [async_llm.py:261] Added request cmpl-2039967de2f0408f82da97e76920e5b6-0.
INFO 03-02 01:21:13 [logger.py:42] Received request cmpl-899b730498f241b8865adcdd9a7a66af-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:21:13 [async_llm.py:261] Added request cmpl-899b730498f241b8865adcdd9a7a66af-0.
INFO 03-02 01:21:14 [logger.py:42] Received request cmpl-0ceedc343d0346d192e2063a97593adb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:21:14 [async_llm.py:261] Added request cmpl-0ceedc343d0346d192e2063a97593adb-0.
INFO 03-02 01:21:15 [logger.py:42] Received request cmpl-3bbf652bf2f642a7827f507826fa77e8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:21:15 [async_llm.py:261] Added request cmpl-3bbf652bf2f642a7827f507826fa77e8-0.
INFO 03-02 01:21:16 [logger.py:42] Received request cmpl-3d25e18240e34dd29991b213202931eb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:21:16 [async_llm.py:261] Added request cmpl-3d25e18240e34dd29991b213202931eb-0.
INFO 03-02 01:21:17 [logger.py:42] Received request cmpl-277b05aed96246d2abb7289f36ccfc91-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:21:17 [async_llm.py:261] Added request cmpl-277b05aed96246d2abb7289f36ccfc91-0.
INFO 03-02 01:21:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:21:19 [logger.py:42] Received request cmpl-381a73ad05af4242ac9d3d9aa85c9afa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:21:19 [async_llm.py:261] Added request cmpl-381a73ad05af4242ac9d3d9aa85c9afa-0.
INFO 03-02 01:21:20 [logger.py:42] Received request cmpl-07b88f4c9e1f45aa8b574d5da53b766f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:21:20 [async_llm.py:261] Added request cmpl-07b88f4c9e1f45aa8b574d5da53b766f-0.
INFO 03-02 01:21:21 [logger.py:42] Received request cmpl-d04bfaa017e74ade950c071d401bdb95-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:21:21 [async_llm.py:261] Added request cmpl-d04bfaa017e74ade950c071d401bdb95-0.
INFO 03-02 01:21:22 [logger.py:42] Received request cmpl-4a3e834a102640388a9247ce4ddcdda5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:21:22 [async_llm.py:261] Added request cmpl-4a3e834a102640388a9247ce4ddcdda5-0.
INFO 03-02 01:21:23 [logger.py:42] Received request cmpl-3088c8f70faa4a6e9c9ca2c3323ff02a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:21:23 [async_llm.py:261] Added request cmpl-3088c8f70faa4a6e9c9ca2c3323ff02a-0.
INFO 03-02 01:21:24 [logger.py:42] Received request cmpl-ef516e0dc4c3448e8ccbf51a77a66ee5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:21:24 [async_llm.py:261] Added request cmpl-ef516e0dc4c3448e8ccbf51a77a66ee5-0.
INFO 03-02 01:21:25 [logger.py:42] Received request cmpl-b3eda0d595904797b36db960adc4ff77-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:21:25 [async_llm.py:261] Added request cmpl-b3eda0d595904797b36db960adc4ff77-0.
INFO 03-02 01:21:27 [logger.py:42] Received request cmpl-01c9a6ff4bc044e48c2edf532fecb9c4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:21:27 [async_llm.py:261] Added request cmpl-01c9a6ff4bc044e48c2edf532fecb9c4-0.
INFO 03-02 01:21:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:21:28 [logger.py:42] Received request cmpl-926f55f0f94441e59c14bc76e9edc363-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:21:28 [async_llm.py:261] Added request cmpl-926f55f0f94441e59c14bc76e9edc363-0.
INFO 03-02 01:21:29 [logger.py:42] Received request cmpl-a15f3dc13b4e4b27b8cffadb112309e1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:21:29 [async_llm.py:261] Added request cmpl-a15f3dc13b4e4b27b8cffadb112309e1-0.
INFO 03-02 01:21:30 [logger.py:42] Received request cmpl-bfda1f15ec834bdc99b9c3ae0c33c152-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:21:30 [async_llm.py:261] Added request cmpl-bfda1f15ec834bdc99b9c3ae0c33c152-0.
INFO 03-02 01:21:31 [logger.py:42] Received request cmpl-e3a404e01b924497b1c53256395aef7d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:21:31 [async_llm.py:261] Added request cmpl-e3a404e01b924497b1c53256395aef7d-0.
INFO 03-02 01:21:32 [logger.py:42] Received request cmpl-26c43758516a4856bfdd661df4642f26-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:21:32 [async_llm.py:261] Added request cmpl-26c43758516a4856bfdd661df4642f26-0.
INFO 03-02 01:21:34 [logger.py:42] Received request cmpl-4adcdaaf34614a2cadf16fbb3b857562-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:21:34 [async_llm.py:261] Added request cmpl-4adcdaaf34614a2cadf16fbb3b857562-0.
INFO 03-02 01:21:35 [logger.py:42] Received request cmpl-43f7676aab7d4ef68facd15719fc51af-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:21:35 [async_llm.py:261] Added request cmpl-43f7676aab7d4ef68facd15719fc51af-0.
INFO 03-02 01:21:36 [logger.py:42] Received request cmpl-bb4f6aa09fba4e7e9f7f5ee1214a4a94-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:21:36 [async_llm.py:261] Added request cmpl-bb4f6aa09fba4e7e9f7f5ee1214a4a94-0.
INFO 03-02 01:21:37 [logger.py:42] Received request cmpl-a5e90bd5be89482296b8360130cff2ed-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:21:37 [async_llm.py:261] Added request cmpl-a5e90bd5be89482296b8360130cff2ed-0.
INFO 03-02 01:21:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:21:38 [logger.py:42] Received request cmpl-c6833986f8a7407cac9a5775b621b9b9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:21:38 [async_llm.py:261] Added request cmpl-c6833986f8a7407cac9a5775b621b9b9-0.
INFO 03-02 01:21:39 [logger.py:42] Received request cmpl-0219c3004df74bce80e70ecdaf4ad646-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:21:39 [async_llm.py:261] Added request cmpl-0219c3004df74bce80e70ecdaf4ad646-0.
INFO 03-02 01:21:40 [logger.py:42] Received request cmpl-1bb10bb946b2415c902a98c963251cd1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:21:40 [async_llm.py:261] Added request cmpl-1bb10bb946b2415c902a98c963251cd1-0.
INFO 03-02 01:21:42 [logger.py:42] Received request cmpl-6cf1f18d120540278b69c54dc0df87b5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:21:42 [async_llm.py:261] Added request cmpl-6cf1f18d120540278b69c54dc0df87b5-0.
INFO 03-02 01:21:43 [logger.py:42] Received request cmpl-24f6e4b20b7e45c09095a35187ac34c5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:21:43 [async_llm.py:261] Added request cmpl-24f6e4b20b7e45c09095a35187ac34c5-0.
INFO 03-02 01:21:44 [logger.py:42] Received request cmpl-c59ced56ddd3490eba4d53e0118ee411-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:21:44 [async_llm.py:261] Added request cmpl-c59ced56ddd3490eba4d53e0118ee411-0.
INFO 03-02 01:21:45 [logger.py:42] Received request cmpl-3ae04d61c53946b7be2fb9c38d1d1ad7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:21:45 [async_llm.py:261] Added request cmpl-3ae04d61c53946b7be2fb9c38d1d1ad7-0.
INFO 03-02 01:21:46 [logger.py:42] Received request cmpl-e8efc8559bba41199c28a00834684900-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:21:46 [async_llm.py:261] Added request cmpl-e8efc8559bba41199c28a00834684900-0.
INFO 03-02 01:21:47 [logger.py:42] Received request cmpl-1b7d7fefe9764f0685e5379532c5736c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:21:47 [async_llm.py:261] Added request cmpl-1b7d7fefe9764f0685e5379532c5736c-0.
INFO 03-02 01:21:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:21:49 [logger.py:42] Received request cmpl-28caf7b3e5e04db285eb5c5d7329ce24-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:21:49 [async_llm.py:261] Added request cmpl-28caf7b3e5e04db285eb5c5d7329ce24-0.
INFO 03-02 01:21:50 [logger.py:42] Received request cmpl-0cbd6956626f48aaa5a4d7d8a639b548-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:21:50 [async_llm.py:261] Added request cmpl-0cbd6956626f48aaa5a4d7d8a639b548-0.
INFO 03-02 01:21:51 [logger.py:42] Received request cmpl-33a88899a2604e16a73a663405148866-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:21:51 [async_llm.py:261] Added request cmpl-33a88899a2604e16a73a663405148866-0.
INFO 03-02 01:21:52 [logger.py:42] Received request cmpl-b3ae1fb52b724cfea1b735fc79edcbfd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:21:52 [async_llm.py:261] Added request cmpl-b3ae1fb52b724cfea1b735fc79edcbfd-0.
INFO 03-02 01:21:53 [logger.py:42] Received request cmpl-45e12858fb59479ab1151db8bb261f16-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:21:53 [async_llm.py:261] Added request cmpl-45e12858fb59479ab1151db8bb261f16-0.
INFO 03-02 01:21:54 [logger.py:42] Received request cmpl-261ab88ba4b047298dd910776b41de91-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:21:54 [async_llm.py:261] Added request cmpl-261ab88ba4b047298dd910776b41de91-0.
INFO 03-02 01:21:55 [logger.py:42] Received request cmpl-6b4dfdd8dcd64222b22eb6f0fbb692cc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:21:55 [async_llm.py:261] Added request cmpl-6b4dfdd8dcd64222b22eb6f0fbb692cc-0.
INFO 03-02 01:21:57 [logger.py:42] Received request cmpl-0f52d3a7957b439b9647ab38eb43fb08-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:21:57 [async_llm.py:261] Added request cmpl-0f52d3a7957b439b9647ab38eb43fb08-0.
INFO 03-02 01:21:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:21:58 [logger.py:42] Received request cmpl-3d23a62f9ad0499391c44df87e71ef65-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:21:58 [async_llm.py:261] Added request cmpl-3d23a62f9ad0499391c44df87e71ef65-0.
INFO 03-02 01:21:59 [logger.py:42] Received request cmpl-aa2283ead1d940c2a85930ead1358bc0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:21:59 [async_llm.py:261] Added request cmpl-aa2283ead1d940c2a85930ead1358bc0-0.
INFO 03-02 01:22:00 [logger.py:42] Received request cmpl-f1ecfab46c5e4a88be58a6276ee683df-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:22:00 [async_llm.py:261] Added request cmpl-f1ecfab46c5e4a88be58a6276ee683df-0.
INFO 03-02 01:22:01 [logger.py:42] Received request cmpl-c0c32dfe6c6540c7bd458661248f5679-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:22:01 [async_llm.py:261] Added request cmpl-c0c32dfe6c6540c7bd458661248f5679-0.
INFO 03-02 01:22:02 [logger.py:42] Received request cmpl-b81820032945415d8edbf2c7c4a4c824-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:22:02 [async_llm.py:261] Added request cmpl-b81820032945415d8edbf2c7c4a4c824-0.
INFO 03-02 01:22:03 [logger.py:42] Received request cmpl-f9d8de9a6e914fd59fc266b01aab10ca-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:22:03 [async_llm.py:261] Added request cmpl-f9d8de9a6e914fd59fc266b01aab10ca-0.
INFO 03-02 01:22:05 [logger.py:42] Received request cmpl-daf408516fb449708dab701ed4506d38-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:22:05 [async_llm.py:261] Added request cmpl-daf408516fb449708dab701ed4506d38-0.
INFO 03-02 01:22:06 [logger.py:42] Received request cmpl-e6e34baf97f74114b8301d154b99c5e1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:22:06 [async_llm.py:261] Added request cmpl-e6e34baf97f74114b8301d154b99c5e1-0.
INFO 03-02 01:22:07 [logger.py:42] Received request cmpl-09e53f2588bb46c4a78ec11641b6d3c6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:22:07 [async_llm.py:261] Added request cmpl-09e53f2588bb46c4a78ec11641b6d3c6-0.
INFO 03-02 01:22:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:22:08 [logger.py:42] Received request cmpl-0818065d8c73455b87b5d09e510d8c9a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:22:08 [async_llm.py:261] Added request cmpl-0818065d8c73455b87b5d09e510d8c9a-0.
INFO 03-02 01:22:09 [logger.py:42] Received request cmpl-1a52824c8d5144acb09a6ee4fe0883fc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:22:09 [async_llm.py:261] Added request cmpl-1a52824c8d5144acb09a6ee4fe0883fc-0.
INFO 03-02 01:22:10 [logger.py:42] Received request cmpl-69a3bc757a0d438083b182c66e733e31-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:22:10 [async_llm.py:261] Added request cmpl-69a3bc757a0d438083b182c66e733e31-0.
INFO 03-02 01:22:12 [logger.py:42] Received request cmpl-4f84b6c863de4a3d91a84dae3b17abab-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:22:12 [async_llm.py:261] Added request cmpl-4f84b6c863de4a3d91a84dae3b17abab-0.
INFO 03-02 01:22:13 [logger.py:42] Received request cmpl-f4a6bff840994913a7fb724bfd73b209-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:22:13 [async_llm.py:261] Added request cmpl-f4a6bff840994913a7fb724bfd73b209-0.
INFO 03-02 01:22:14 [logger.py:42] Received request cmpl-e958a0b0ad6b41bfae2aafd980aa0496-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:22:14 [async_llm.py:261] Added request cmpl-e958a0b0ad6b41bfae2aafd980aa0496-0.
INFO 03-02 01:22:15 [logger.py:42] Received request cmpl-e24eb51828d4400ca81ad6d08c2f34b5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:22:15 [async_llm.py:261] Added request cmpl-e24eb51828d4400ca81ad6d08c2f34b5-0.
INFO 03-02 01:22:16 [logger.py:42] Received request cmpl-31bc8cecb5c044c1837f6922ce598213-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:22:16 [async_llm.py:261] Added request cmpl-31bc8cecb5c044c1837f6922ce598213-0.
INFO 03-02 01:22:17 [logger.py:42] Received request cmpl-e7953c65c06f42619eb0f7b1df810aa6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:22:17 [async_llm.py:261] Added request cmpl-e7953c65c06f42619eb0f7b1df810aa6-0.
INFO 03-02 01:22:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:22:18 [logger.py:42] Received request cmpl-093c965a7da245d2b2becd1df0a4739d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:22:18 [async_llm.py:261] Added request cmpl-093c965a7da245d2b2becd1df0a4739d-0.
INFO 03-02 01:22:20 [logger.py:42] Received request cmpl-226bad46c7654ad4b686d3e27b489121-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:22:20 [async_llm.py:261] Added request cmpl-226bad46c7654ad4b686d3e27b489121-0.
INFO 03-02 01:22:21 [logger.py:42] Received request cmpl-e6572d7a9c5d4c36847ad72b7481a92e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:22:21 [async_llm.py:261] Added request cmpl-e6572d7a9c5d4c36847ad72b7481a92e-0.
INFO 03-02 01:22:22 [logger.py:42] Received request cmpl-26866f34775245a4bc994d9ca35d14ab-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:22:22 [async_llm.py:261] Added request cmpl-26866f34775245a4bc994d9ca35d14ab-0.
INFO 03-02 01:22:23 [logger.py:42] Received request cmpl-2e64b58f7ee74115ac774c75f2c3e8bb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:22:23 [async_llm.py:261] Added request cmpl-2e64b58f7ee74115ac774c75f2c3e8bb-0.
INFO 03-02 01:22:24 [logger.py:42] Received request cmpl-fbeda34cfee24e20badfd28909e24d20-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:22:24 [async_llm.py:261] Added request cmpl-fbeda34cfee24e20badfd28909e24d20-0.
INFO 03-02 01:22:25 [logger.py:42] Received request cmpl-a97896d9a9ad453da2ff53eb495856cb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:22:25 [async_llm.py:261] Added request cmpl-a97896d9a9ad453da2ff53eb495856cb-0.
INFO 03-02 01:22:27 [logger.py:42] Received request cmpl-92c6632c510448409c7c613b2b46ca7e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:22:27 [async_llm.py:261] Added request cmpl-92c6632c510448409c7c613b2b46ca7e-0.
INFO 03-02 01:22:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:22:28 [logger.py:42] Received request cmpl-4715106fcd03436cb233810cd111c759-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:22:28 [async_llm.py:261] Added request cmpl-4715106fcd03436cb233810cd111c759-0.
INFO 03-02 01:22:29 [logger.py:42] Received request cmpl-96019e395bdb478a83c26991789f727e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:22:29 [async_llm.py:261] Added request cmpl-96019e395bdb478a83c26991789f727e-0.
INFO 03-02 01:22:30 [logger.py:42] Received request cmpl-ff1c24b0af4449c69fab07395b563634-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:22:30 [async_llm.py:261] Added request cmpl-ff1c24b0af4449c69fab07395b563634-0.
INFO 03-02 01:22:31 [logger.py:42] Received request cmpl-6aff92c5d263408b9faea92660376b06-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:22:31 [async_llm.py:261] Added request cmpl-6aff92c5d263408b9faea92660376b06-0.
INFO 03-02 01:22:32 [logger.py:42] Received request cmpl-aa0da3dd6ad946c994b9ddb0d5fa6fed-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:22:32 [async_llm.py:261] Added request cmpl-aa0da3dd6ad946c994b9ddb0d5fa6fed-0.
INFO 03-02 01:22:33 [logger.py:42] Received request cmpl-d093bef1cd32473094e58ee9cb9d4caa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:22:33 [async_llm.py:261] Added request cmpl-d093bef1cd32473094e58ee9cb9d4caa-0.
INFO 03-02 01:22:35 [logger.py:42] Received request cmpl-2ea9412220c042d0953a4829639505f1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:22:35 [async_llm.py:261] Added request cmpl-2ea9412220c042d0953a4829639505f1-0.
INFO 03-02 01:22:36 [logger.py:42] Received request cmpl-f3214e791a2f49cf99476d6449016df3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:22:36 [async_llm.py:261] Added request cmpl-f3214e791a2f49cf99476d6449016df3-0.
INFO 03-02 01:22:37 [logger.py:42] Received request cmpl-4657a61d0d174cccaf1154728267360b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:22:37 [async_llm.py:261] Added request cmpl-4657a61d0d174cccaf1154728267360b-0.
INFO 03-02 01:22:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:22:38 [logger.py:42] Received request cmpl-47e6b17f005b445d9b725b75907009d4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:22:38 [async_llm.py:261] Added request cmpl-47e6b17f005b445d9b725b75907009d4-0.
INFO 03-02 01:22:39 [logger.py:42] Received request cmpl-2d46b0d819414664b8809aee6bd8f8f5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:22:39 [async_llm.py:261] Added request cmpl-2d46b0d819414664b8809aee6bd8f8f5-0.
INFO 03-02 01:22:40 [logger.py:42] Received request cmpl-ff7ac054363e4c368717f435958ee74f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:22:40 [async_llm.py:261] Added request cmpl-ff7ac054363e4c368717f435958ee74f-0.
INFO 03-02 01:22:42 [logger.py:42] Received request cmpl-45f5bff7e60045fa8ce94504dd49cbd6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:22:42 [async_llm.py:261] Added request cmpl-45f5bff7e60045fa8ce94504dd49cbd6-0.
INFO 03-02 01:22:43 [logger.py:42] Received request cmpl-a75ef711a84446c3bdaef17c8816eeeb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:22:43 [async_llm.py:261] Added request cmpl-a75ef711a84446c3bdaef17c8816eeeb-0.
INFO 03-02 01:22:44 [logger.py:42] Received request cmpl-9b1ac1ed6ba447d584e751880306dcfc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:22:44 [async_llm.py:261] Added request cmpl-9b1ac1ed6ba447d584e751880306dcfc-0.
INFO 03-02 01:22:45 [logger.py:42] Received request cmpl-b04b2d13628c4fe895a95f9725c68384-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:22:45 [async_llm.py:261] Added request cmpl-b04b2d13628c4fe895a95f9725c68384-0.
INFO 03-02 01:22:46 [logger.py:42] Received request cmpl-ed9cfbe8e4114bbf9b1c5c83bf7eed83-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:22:46 [async_llm.py:261] Added request cmpl-ed9cfbe8e4114bbf9b1c5c83bf7eed83-0.
INFO 03-02 01:22:47 [logger.py:42] Received request cmpl-593c7898351140ec97413dc7bf95ddc9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:22:47 [async_llm.py:261] Added request cmpl-593c7898351140ec97413dc7bf95ddc9-0.
INFO 03-02 01:22:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:22:48 [logger.py:42] Received request cmpl-d0a12fbedd83412b8484b5d9d68bf7ec-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:22:48 [async_llm.py:261] Added request cmpl-d0a12fbedd83412b8484b5d9d68bf7ec-0.
INFO 03-02 01:22:50 [logger.py:42] Received request cmpl-eabbd7938fd9491dbde9b9296bd5dfea-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:22:50 [async_llm.py:261] Added request cmpl-eabbd7938fd9491dbde9b9296bd5dfea-0.
INFO 03-02 01:22:51 [logger.py:42] Received request cmpl-7e2bdebcb05f4ecc91265d009e971a33-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:22:51 [async_llm.py:261] Added request cmpl-7e2bdebcb05f4ecc91265d009e971a33-0.
INFO 03-02 01:22:52 [logger.py:42] Received request cmpl-79b50e0bbf1b4e59b31e1429fb434447-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:22:52 [async_llm.py:261] Added request cmpl-79b50e0bbf1b4e59b31e1429fb434447-0.
INFO 03-02 01:22:53 [logger.py:42] Received request cmpl-f94800eff8574a0aa7adaad6ce4c2cda-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:22:53 [async_llm.py:261] Added request cmpl-f94800eff8574a0aa7adaad6ce4c2cda-0.
INFO 03-02 01:22:54 [logger.py:42] Received request cmpl-605c88defeee411ba39ce16e56d7901f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:22:54 [async_llm.py:261] Added request cmpl-605c88defeee411ba39ce16e56d7901f-0.
INFO 03-02 01:22:55 [logger.py:42] Received request cmpl-7630be0da3544112b6bedacffa6861e0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:22:55 [async_llm.py:261] Added request cmpl-7630be0da3544112b6bedacffa6861e0-0.
INFO 03-02 01:22:57 [logger.py:42] Received request cmpl-2f8e64ac7bef409cadd44df4266deeba-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:22:57 [async_llm.py:261] Added request cmpl-2f8e64ac7bef409cadd44df4266deeba-0.
INFO 03-02 01:22:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:22:58 [logger.py:42] Received request cmpl-cfeb7ef8cf0a423cb1c63dc937c7a50f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:22:58 [async_llm.py:261] Added request cmpl-cfeb7ef8cf0a423cb1c63dc937c7a50f-0.
INFO 03-02 01:22:59 [logger.py:42] Received request cmpl-c720417279a84161bd710458ffd96561-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:22:59 [async_llm.py:261] Added request cmpl-c720417279a84161bd710458ffd96561-0.
INFO 03-02 01:23:00 [logger.py:42] Received request cmpl-d15a0f90da364295b57cdf6e2153e257-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:23:00 [async_llm.py:261] Added request cmpl-d15a0f90da364295b57cdf6e2153e257-0.
INFO 03-02 01:23:01 [logger.py:42] Received request cmpl-f754011ad8fb41a7b66be985f8a0b9d5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:23:01 [async_llm.py:261] Added request cmpl-f754011ad8fb41a7b66be985f8a0b9d5-0.
INFO 03-02 01:23:02 [logger.py:42] Received request cmpl-fecd32e5f88443f082e38b5d8bc30750-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:23:02 [async_llm.py:261] Added request cmpl-fecd32e5f88443f082e38b5d8bc30750-0.
INFO 03-02 01:23:03 [logger.py:42] Received request cmpl-baad2d793b554b46bf348f20748ba58c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:23:03 [async_llm.py:261] Added request cmpl-baad2d793b554b46bf348f20748ba58c-0.
INFO 03-02 01:23:05 [logger.py:42] Received request cmpl-e8c0d41a0214494dbffc35d8a87f0bc0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:23:05 [async_llm.py:261] Added request cmpl-e8c0d41a0214494dbffc35d8a87f0bc0-0.
INFO 03-02 01:23:06 [logger.py:42] Received request cmpl-251ac2d940294c6baedde81de0a0786f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:23:06 [async_llm.py:261] Added request cmpl-251ac2d940294c6baedde81de0a0786f-0.
INFO 03-02 01:23:07 [logger.py:42] Received request cmpl-07f6672a80d244e8b737c3bde42b804f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:23:07 [async_llm.py:261] Added request cmpl-07f6672a80d244e8b737c3bde42b804f-0.
INFO 03-02 01:23:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:23:08 [logger.py:42] Received request cmpl-4b41ebbac45445718f3d725fe7c55f5c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:23:08 [async_llm.py:261] Added request cmpl-4b41ebbac45445718f3d725fe7c55f5c-0.
INFO 03-02 01:23:09 [logger.py:42] Received request cmpl-aaeebc5f8c384ed3b223c5a25a189b3e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:23:09 [async_llm.py:261] Added request cmpl-aaeebc5f8c384ed3b223c5a25a189b3e-0.
INFO 03-02 01:23:10 [logger.py:42] Received request cmpl-9b24f3371f6e40b08a8731c4f17250cf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:23:10 [async_llm.py:261] Added request cmpl-9b24f3371f6e40b08a8731c4f17250cf-0.
INFO 03-02 01:23:12 [logger.py:42] Received request cmpl-a55d1c072b9c4d0786694e886e08e1ef-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:23:12 [async_llm.py:261] Added request cmpl-a55d1c072b9c4d0786694e886e08e1ef-0.
INFO 03-02 01:23:13 [logger.py:42] Received request cmpl-f369189acff649a5b41905192fae463c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:23:13 [async_llm.py:261] Added request cmpl-f369189acff649a5b41905192fae463c-0.
INFO 03-02 01:23:14 [logger.py:42] Received request cmpl-fa6b215a0d894d378ae04f33b44382b2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:23:14 [async_llm.py:261] Added request cmpl-fa6b215a0d894d378ae04f33b44382b2-0.
INFO 03-02 01:23:15 [logger.py:42] Received request cmpl-18bb390de5954c37887949decf5a0bb2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:23:15 [async_llm.py:261] Added request cmpl-18bb390de5954c37887949decf5a0bb2-0.
INFO 03-02 01:23:16 [logger.py:42] Received request cmpl-bef93ccb969e4d85af64db65e987d7fb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:23:16 [async_llm.py:261] Added request cmpl-bef93ccb969e4d85af64db65e987d7fb-0.
INFO 03-02 01:23:17 [logger.py:42] Received request cmpl-f5497d6eeb134d3c88b838df170423dd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:23:17 [async_llm.py:261] Added request cmpl-f5497d6eeb134d3c88b838df170423dd-0.
INFO 03-02 01:23:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:23:18 [logger.py:42] Received request cmpl-341a6e1e95de4ea3ace6c4b34e9b14ee-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:23:18 [async_llm.py:261] Added request cmpl-341a6e1e95de4ea3ace6c4b34e9b14ee-0.
INFO 03-02 01:23:20 [logger.py:42] Received request cmpl-219761f4c0ff49beabf63883fec8661f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:23:20 [async_llm.py:261] Added request cmpl-219761f4c0ff49beabf63883fec8661f-0.
INFO 03-02 01:23:21 [logger.py:42] Received request cmpl-f94e1f0c740b4c9eb794f80d67d1c6dd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:23:21 [async_llm.py:261] Added request cmpl-f94e1f0c740b4c9eb794f80d67d1c6dd-0.
INFO 03-02 01:23:22 [logger.py:42] Received request cmpl-06b37e6bc0d4487ab30a65c0eeff43ac-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:23:22 [async_llm.py:261] Added request cmpl-06b37e6bc0d4487ab30a65c0eeff43ac-0.
INFO 03-02 01:23:23 [logger.py:42] Received request cmpl-85bbf2df478b4d138363910ab945de68-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:23:23 [async_llm.py:261] Added request cmpl-85bbf2df478b4d138363910ab945de68-0.
INFO 03-02 01:23:24 [logger.py:42] Received request cmpl-9e9ac822effe45cb8379886f7f0a7ea8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:23:24 [async_llm.py:261] Added request cmpl-9e9ac822effe45cb8379886f7f0a7ea8-0.
INFO 03-02 01:23:25 [logger.py:42] Received request cmpl-7b64133fe4e744dbaca220d4ed9ddee0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:23:25 [async_llm.py:261] Added request cmpl-7b64133fe4e744dbaca220d4ed9ddee0-0.
INFO 03-02 01:23:27 [logger.py:42] Received request cmpl-25561eaf3ccf48cd8c4a53f782a60480-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:23:27 [async_llm.py:261] Added request cmpl-25561eaf3ccf48cd8c4a53f782a60480-0.
INFO 03-02 01:23:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:23:28 [logger.py:42] Received request cmpl-afb3945744264457bc4b99f981aac803-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:23:28 [async_llm.py:261] Added request cmpl-afb3945744264457bc4b99f981aac803-0.
INFO 03-02 01:23:29 [logger.py:42] Received request cmpl-3201216660df4b34a7dbd9977d807c94-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:23:29 [async_llm.py:261] Added request cmpl-3201216660df4b34a7dbd9977d807c94-0.
INFO 03-02 01:23:30 [logger.py:42] Received request cmpl-f78353132d534d92b2674ebd813a4c49-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:23:30 [async_llm.py:261] Added request cmpl-f78353132d534d92b2674ebd813a4c49-0.
INFO 03-02 01:23:31 [logger.py:42] Received request cmpl-cc00bd0cb10c464aa942d25bf1c527af-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:23:31 [async_llm.py:261] Added request cmpl-cc00bd0cb10c464aa942d25bf1c527af-0.
INFO 03-02 01:23:32 [logger.py:42] Received request cmpl-3a8f68f83c754f5d9c170c63ffb3554f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:23:32 [async_llm.py:261] Added request cmpl-3a8f68f83c754f5d9c170c63ffb3554f-0.
INFO 03-02 01:23:33 [logger.py:42] Received request cmpl-d61d90a55d3144b987afb7a7bbc0d332-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:23:33 [async_llm.py:261] Added request cmpl-d61d90a55d3144b987afb7a7bbc0d332-0.
INFO 03-02 01:23:35 [logger.py:42] Received request cmpl-0ee2cc7e2f2b490dbe5395240466214a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:23:35 [async_llm.py:261] Added request cmpl-0ee2cc7e2f2b490dbe5395240466214a-0.
INFO 03-02 01:23:36 [logger.py:42] Received request cmpl-24772b78fd2443b2bdf5baecd9a5d419-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:23:36 [async_llm.py:261] Added request cmpl-24772b78fd2443b2bdf5baecd9a5d419-0.
INFO 03-02 01:23:37 [logger.py:42] Received request cmpl-01b28494ef6d469abb04a0bcf21c47c5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:23:37 [async_llm.py:261] Added request cmpl-01b28494ef6d469abb04a0bcf21c47c5-0.
INFO 03-02 01:23:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:23:38 [logger.py:42] Received request cmpl-61c579a274594e22ac9ce92bc1bfeefa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:23:38 [async_llm.py:261] Added request cmpl-61c579a274594e22ac9ce92bc1bfeefa-0.
INFO 03-02 01:23:39 [logger.py:42] Received request cmpl-94b86fec36524eddb8cad87c8c4a608a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:23:39 [async_llm.py:261] Added request cmpl-94b86fec36524eddb8cad87c8c4a608a-0.
INFO 03-02 01:23:40 [logger.py:42] Received request cmpl-4a0891524e7f49f8b5a6b76bca0d60e4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:23:40 [async_llm.py:261] Added request cmpl-4a0891524e7f49f8b5a6b76bca0d60e4-0.
INFO 03-02 01:23:42 [logger.py:42] Received request cmpl-58f0dfd06390401297adf0aafd89f6f7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:23:42 [async_llm.py:261] Added request cmpl-58f0dfd06390401297adf0aafd89f6f7-0.
INFO 03-02 01:23:43 [logger.py:42] Received request cmpl-4451765a3b8b4a3d8ea8a13a5465db16-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:23:43 [async_llm.py:261] Added request cmpl-4451765a3b8b4a3d8ea8a13a5465db16-0.
INFO 03-02 01:23:44 [logger.py:42] Received request cmpl-d4cd8a1773ba4246ab51066ccd357304-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:23:44 [async_llm.py:261] Added request cmpl-d4cd8a1773ba4246ab51066ccd357304-0.
INFO 03-02 01:23:45 [logger.py:42] Received request cmpl-0ba098218f2646589ab93cc9c8ce1049-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:23:45 [async_llm.py:261] Added request cmpl-0ba098218f2646589ab93cc9c8ce1049-0.
INFO 03-02 01:23:46 [logger.py:42] Received request cmpl-daf895c589f04c5581f1f26e824e796d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:23:46 [async_llm.py:261] Added request cmpl-daf895c589f04c5581f1f26e824e796d-0.
INFO 03-02 01:23:47 [logger.py:42] Received request cmpl-d2fb3bc427be4b38af8ccd7f465b656e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:23:47 [async_llm.py:261] Added request cmpl-d2fb3bc427be4b38af8ccd7f465b656e-0.
INFO 03-02 01:23:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:23:48 [logger.py:42] Received request cmpl-40b17bda8b4d47cdb74f27b2e79b2e42-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:23:48 [async_llm.py:261] Added request cmpl-40b17bda8b4d47cdb74f27b2e79b2e42-0.
INFO 03-02 01:23:50 [logger.py:42] Received request cmpl-da57b8dc2939410c81dbeb81458b59d7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:23:50 [async_llm.py:261] Added request cmpl-da57b8dc2939410c81dbeb81458b59d7-0.
INFO 03-02 01:23:51 [logger.py:42] Received request cmpl-928286290919441cb7005514d06a72c3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:23:51 [async_llm.py:261] Added request cmpl-928286290919441cb7005514d06a72c3-0.
INFO 03-02 01:23:52 [logger.py:42] Received request cmpl-9e01ae7256064cc283c0082759f2d93a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:23:52 [async_llm.py:261] Added request cmpl-9e01ae7256064cc283c0082759f2d93a-0.
INFO 03-02 01:23:53 [logger.py:42] Received request cmpl-2c0913511bed4e76af9f384da7ac80d1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:23:53 [async_llm.py:261] Added request cmpl-2c0913511bed4e76af9f384da7ac80d1-0.
INFO 03-02 01:23:54 [logger.py:42] Received request cmpl-5cf27d4e24e948c79f4d10a39e086d5e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:23:54 [async_llm.py:261] Added request cmpl-5cf27d4e24e948c79f4d10a39e086d5e-0.
INFO 03-02 01:23:55 [logger.py:42] Received request cmpl-ffc191cd4b6c4065ac693f251c27e05e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:23:55 [async_llm.py:261] Added request cmpl-ffc191cd4b6c4065ac693f251c27e05e-0.
INFO 03-02 01:23:57 [logger.py:42] Received request cmpl-b30b7a7908a842e88031b7c463d037a6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:23:57 [async_llm.py:261] Added request cmpl-b30b7a7908a842e88031b7c463d037a6-0.
INFO 03-02 01:23:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:23:58 [logger.py:42] Received request cmpl-865b80031d124ec390ee3fa1561f45f4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:23:58 [async_llm.py:261] Added request cmpl-865b80031d124ec390ee3fa1561f45f4-0.
INFO 03-02 01:23:59 [logger.py:42] Received request cmpl-a42289d7ae9a40a6b3b9d772d68d9d8a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:23:59 [async_llm.py:261] Added request cmpl-a42289d7ae9a40a6b3b9d772d68d9d8a-0.
INFO 03-02 01:24:00 [logger.py:42] Received request cmpl-a2a29d7a77a8441b8ba0651d8cf51d8a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:24:00 [async_llm.py:261] Added request cmpl-a2a29d7a77a8441b8ba0651d8cf51d8a-0.
INFO 03-02 01:24:01 [logger.py:42] Received request cmpl-d16749e4ec7c49f3a470d13ae0d7333d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:24:01 [async_llm.py:261] Added request cmpl-d16749e4ec7c49f3a470d13ae0d7333d-0.
INFO 03-02 01:24:02 [logger.py:42] Received request cmpl-deef4e458a954b0b9de62ad9187a665a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:24:02 [async_llm.py:261] Added request cmpl-deef4e458a954b0b9de62ad9187a665a-0.
INFO 03-02 01:24:04 [logger.py:42] Received request cmpl-4b5c79bc3d0548e3be453be340b6a50d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:24:04 [async_llm.py:261] Added request cmpl-4b5c79bc3d0548e3be453be340b6a50d-0.
INFO 03-02 01:24:05 [logger.py:42] Received request cmpl-3d42d453a839412580f07e0e4bce9bb0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:24:05 [async_llm.py:261] Added request cmpl-3d42d453a839412580f07e0e4bce9bb0-0.
INFO 03-02 01:24:06 [logger.py:42] Received request cmpl-eaa1593f11dd46f6ab0b3c2219642a3e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:24:06 [async_llm.py:261] Added request cmpl-eaa1593f11dd46f6ab0b3c2219642a3e-0.
INFO 03-02 01:24:07 [logger.py:42] Received request cmpl-9287b21738e54cbdac691ee28ffab84f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:24:07 [async_llm.py:261] Added request cmpl-9287b21738e54cbdac691ee28ffab84f-0.
INFO 03-02 01:24:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:24:08 [logger.py:42] Received request cmpl-9cea0c4d71724177933d867565ecbaf9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:24:08 [async_llm.py:261] Added request cmpl-9cea0c4d71724177933d867565ecbaf9-0.
INFO 03-02 01:24:09 [logger.py:42] Received request cmpl-18c33e1995a2404e8d1484cde65dab0c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:24:09 [async_llm.py:261] Added request cmpl-18c33e1995a2404e8d1484cde65dab0c-0.
INFO 03-02 01:24:10 [logger.py:42] Received request cmpl-1a28a9253754435c9d5b12d2e68b2de9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:24:10 [async_llm.py:261] Added request cmpl-1a28a9253754435c9d5b12d2e68b2de9-0.
INFO 03-02 01:24:12 [logger.py:42] Received request cmpl-ed0a86ed6ea64221b7a433bc2442e28c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:24:12 [async_llm.py:261] Added request cmpl-ed0a86ed6ea64221b7a433bc2442e28c-0.
INFO 03-02 01:24:13 [logger.py:42] Received request cmpl-f425a1d664034230bca51f8a71f3742c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:24:13 [async_llm.py:261] Added request cmpl-f425a1d664034230bca51f8a71f3742c-0.
INFO 03-02 01:24:14 [logger.py:42] Received request cmpl-f8c329e20b354e9b82d7f9d570282334-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:24:14 [async_llm.py:261] Added request cmpl-f8c329e20b354e9b82d7f9d570282334-0.
INFO 03-02 01:24:15 [logger.py:42] Received request cmpl-81bb9d41219748268c37a292668d170e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:24:15 [async_llm.py:261] Added request cmpl-81bb9d41219748268c37a292668d170e-0.
INFO 03-02 01:24:16 [logger.py:42] Received request cmpl-5238b62562ba434d9c1797b1a5a070df-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:24:16 [async_llm.py:261] Added request cmpl-5238b62562ba434d9c1797b1a5a070df-0.
INFO 03-02 01:24:17 [logger.py:42] Received request cmpl-adf987e7d2734528a9bc6d0b0a9aa6c4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:24:17 [async_llm.py:261] Added request cmpl-adf987e7d2734528a9bc6d0b0a9aa6c4-0.
INFO 03-02 01:24:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:24:19 [logger.py:42] Received request cmpl-ab361ffce4644eb4aa0a4fd989e09fd1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:24:19 [async_llm.py:261] Added request cmpl-ab361ffce4644eb4aa0a4fd989e09fd1-0.
INFO 03-02 01:24:20 [logger.py:42] Received request cmpl-c44203c7baae40a08250e2e6cc92b3f6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:24:20 [async_llm.py:261] Added request cmpl-c44203c7baae40a08250e2e6cc92b3f6-0.
INFO 03-02 01:24:21 [logger.py:42] Received request cmpl-f9cbcd3b85f84a679df0e0b4b92969a8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:24:21 [async_llm.py:261] Added request cmpl-f9cbcd3b85f84a679df0e0b4b92969a8-0.
INFO 03-02 01:24:22 [logger.py:42] Received request cmpl-6a38c49d5ae6481baaade10015da8630-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:24:22 [async_llm.py:261] Added request cmpl-6a38c49d5ae6481baaade10015da8630-0.
INFO 03-02 01:24:23 [logger.py:42] Received request cmpl-0514cf78527b490d86c7809929117edb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:24:23 [async_llm.py:261] Added request cmpl-0514cf78527b490d86c7809929117edb-0.
INFO 03-02 01:24:24 [logger.py:42] Received request cmpl-8680254fbcbe429b8babaa219690ee6e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:24:24 [async_llm.py:261] Added request cmpl-8680254fbcbe429b8babaa219690ee6e-0.
INFO 03-02 01:24:25 [logger.py:42] Received request cmpl-6c5543f0d94341afaee85508a414b8ac-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:24:25 [async_llm.py:261] Added request cmpl-6c5543f0d94341afaee85508a414b8ac-0.
INFO 03-02 01:24:27 [logger.py:42] Received request cmpl-75e1980e3afc4f75b973259f66aadeed-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:24:27 [async_llm.py:261] Added request cmpl-75e1980e3afc4f75b973259f66aadeed-0.
INFO 03-02 01:24:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:24:28 [logger.py:42] Received request cmpl-7dfb252156684e68967ed04f2600b6ea-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:24:28 [async_llm.py:261] Added request cmpl-7dfb252156684e68967ed04f2600b6ea-0.
INFO 03-02 01:24:29 [logger.py:42] Received request cmpl-73c95d780cb847e4948db8a236b20e49-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:24:29 [async_llm.py:261] Added request cmpl-73c95d780cb847e4948db8a236b20e49-0.
INFO 03-02 01:24:30 [logger.py:42] Received request cmpl-656e2e9542584aa4acf1b29f40d1c2fc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:24:30 [async_llm.py:261] Added request cmpl-656e2e9542584aa4acf1b29f40d1c2fc-0.
INFO 03-02 01:24:31 [logger.py:42] Received request cmpl-0999c44d17c74c53926a7e9af1f967b2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:24:31 [async_llm.py:261] Added request cmpl-0999c44d17c74c53926a7e9af1f967b2-0.
INFO 03-02 01:24:32 [logger.py:42] Received request cmpl-146d9b27caf24b239dfc37159e0c3d08-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:24:32 [async_llm.py:261] Added request cmpl-146d9b27caf24b239dfc37159e0c3d08-0.
INFO 03-02 01:24:34 [logger.py:42] Received request cmpl-df2067c804184b93bfc62a8200c1c2a9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:24:34 [async_llm.py:261] Added request cmpl-df2067c804184b93bfc62a8200c1c2a9-0.
INFO 03-02 01:24:35 [logger.py:42] Received request cmpl-69ababd2055e4bdc89420f1fcb1c1c20-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:24:35 [async_llm.py:261] Added request cmpl-69ababd2055e4bdc89420f1fcb1c1c20-0.
INFO 03-02 01:24:36 [logger.py:42] Received request cmpl-e936837dea49436dad8063bac3c78548-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:24:36 [async_llm.py:261] Added request cmpl-e936837dea49436dad8063bac3c78548-0.
INFO 03-02 01:24:37 [logger.py:42] Received request cmpl-40eba03c9e22490181cefbfdd5607083-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:24:37 [async_llm.py:261] Added request cmpl-40eba03c9e22490181cefbfdd5607083-0.
INFO 03-02 01:24:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:24:38 [logger.py:42] Received request cmpl-3b31ba1211464a168d83f3159adca2c7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:24:38 [async_llm.py:261] Added request cmpl-3b31ba1211464a168d83f3159adca2c7-0.
INFO 03-02 01:24:39 [logger.py:42] Received request cmpl-a8aebf601ee24697bc299d4782a10735-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:24:39 [async_llm.py:261] Added request cmpl-a8aebf601ee24697bc299d4782a10735-0.
INFO 03-02 01:24:40 [logger.py:42] Received request cmpl-e2254918cb99467388970289bf51fe7a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:24:40 [async_llm.py:261] Added request cmpl-e2254918cb99467388970289bf51fe7a-0.
INFO 03-02 01:24:42 [logger.py:42] Received request cmpl-11836bd17ede4ea6a3c03efbc5845823-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:24:42 [async_llm.py:261] Added request cmpl-11836bd17ede4ea6a3c03efbc5845823-0.
INFO 03-02 01:24:43 [logger.py:42] Received request cmpl-494fe3e164384879be641d8b70f68050-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:24:43 [async_llm.py:261] Added request cmpl-494fe3e164384879be641d8b70f68050-0.
INFO 03-02 01:24:44 [logger.py:42] Received request cmpl-d99852951bcd43df9a0424a7871d25c2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:24:44 [async_llm.py:261] Added request cmpl-d99852951bcd43df9a0424a7871d25c2-0.
INFO 03-02 01:24:45 [logger.py:42] Received request cmpl-5fddd2e4e7fb4c978bdd89c83977fe82-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:24:45 [async_llm.py:261] Added request cmpl-5fddd2e4e7fb4c978bdd89c83977fe82-0.
INFO 03-02 01:24:46 [logger.py:42] Received request cmpl-165d831281b14b6089056843a657ee66-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:24:46 [async_llm.py:261] Added request cmpl-165d831281b14b6089056843a657ee66-0.
INFO 03-02 01:24:47 [logger.py:42] Received request cmpl-08693e6e810a4e9396ec19f2302e16fb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:24:47 [async_llm.py:261] Added request cmpl-08693e6e810a4e9396ec19f2302e16fb-0.
INFO 03-02 01:24:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:24:49 [logger.py:42] Received request cmpl-eaf71233383a4b01951826da1b5e058c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:24:49 [async_llm.py:261] Added request cmpl-eaf71233383a4b01951826da1b5e058c-0.
INFO 03-02 01:24:50 [logger.py:42] Received request cmpl-ee8d7aea141949199704b41f743d16e6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:24:50 [async_llm.py:261] Added request cmpl-ee8d7aea141949199704b41f743d16e6-0.
INFO 03-02 01:24:51 [logger.py:42] Received request cmpl-ef15a5a912484b82b24115aef7e650c4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:24:51 [async_llm.py:261] Added request cmpl-ef15a5a912484b82b24115aef7e650c4-0.
INFO 03-02 01:24:52 [logger.py:42] Received request cmpl-7448c54898274bbf900f33910d82b518-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:24:52 [async_llm.py:261] Added request cmpl-7448c54898274bbf900f33910d82b518-0.
INFO 03-02 01:24:53 [logger.py:42] Received request cmpl-c48f78a771764a3da7c422a7855f5afd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:24:53 [async_llm.py:261] Added request cmpl-c48f78a771764a3da7c422a7855f5afd-0.
INFO 03-02 01:24:54 [logger.py:42] Received request cmpl-843d1bcb2dbf4f0986da4d87c81e4ecb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:24:54 [async_llm.py:261] Added request cmpl-843d1bcb2dbf4f0986da4d87c81e4ecb-0.
INFO 03-02 01:24:55 [logger.py:42] Received request cmpl-16387eb06ad14dcdac096f2ab0a61558-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:24:55 [async_llm.py:261] Added request cmpl-16387eb06ad14dcdac096f2ab0a61558-0.
INFO 03-02 01:24:57 [logger.py:42] Received request cmpl-d4491ad5384a405a90d448fe695fec9c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:24:57 [async_llm.py:261] Added request cmpl-d4491ad5384a405a90d448fe695fec9c-0.
INFO 03-02 01:24:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:24:58 [logger.py:42] Received request cmpl-db67dc08503945d4b47db0730b7e6b8b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:24:58 [async_llm.py:261] Added request cmpl-db67dc08503945d4b47db0730b7e6b8b-0.
INFO 03-02 01:24:59 [logger.py:42] Received request cmpl-96d07861afb24daf9eb46717ffdee40e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:24:59 [async_llm.py:261] Added request cmpl-96d07861afb24daf9eb46717ffdee40e-0.
INFO 03-02 01:25:00 [logger.py:42] Received request cmpl-63ef079f78534d2fb671b2b4e83a16c5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:25:00 [async_llm.py:261] Added request cmpl-63ef079f78534d2fb671b2b4e83a16c5-0.
INFO 03-02 01:25:01 [logger.py:42] Received request cmpl-ae964bd0eee24066b2de04cacb75d3cb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:25:01 [async_llm.py:261] Added request cmpl-ae964bd0eee24066b2de04cacb75d3cb-0.
INFO 03-02 01:25:02 [logger.py:42] Received request cmpl-fe6f881d659f48f18409e0412432495d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:25:02 [async_llm.py:261] Added request cmpl-fe6f881d659f48f18409e0412432495d-0.
INFO 03-02 01:25:04 [logger.py:42] Received request cmpl-7328da6524494544b163badf9b0920de-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:25:04 [async_llm.py:261] Added request cmpl-7328da6524494544b163badf9b0920de-0.
INFO 03-02 01:25:05 [logger.py:42] Received request cmpl-a4cc388f5b684cff9d49acc2be37d027-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:25:05 [async_llm.py:261] Added request cmpl-a4cc388f5b684cff9d49acc2be37d027-0.
INFO 03-02 01:25:06 [logger.py:42] Received request cmpl-ce42aa67c0d54f3cadbaeca5a02014d5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:25:06 [async_llm.py:261] Added request cmpl-ce42aa67c0d54f3cadbaeca5a02014d5-0.
INFO 03-02 01:25:07 [logger.py:42] Received request cmpl-d1a91d7f6d7d4effb95e7eb0b57cd426-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:25:07 [async_llm.py:261] Added request cmpl-d1a91d7f6d7d4effb95e7eb0b57cd426-0.
INFO 03-02 01:25:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:25:08 [logger.py:42] Received request cmpl-adc0f20ba5f44afd865722401026f9c3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:25:08 [async_llm.py:261] Added request cmpl-adc0f20ba5f44afd865722401026f9c3-0.
INFO 03-02 01:25:09 [logger.py:42] Received request cmpl-7168b3acc0004ed48e2625e914cd5410-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:25:09 [async_llm.py:261] Added request cmpl-7168b3acc0004ed48e2625e914cd5410-0.
INFO 03-02 01:25:10 [logger.py:42] Received request cmpl-2653e00b076e47d39d9ad445c10bba51-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:25:10 [async_llm.py:261] Added request cmpl-2653e00b076e47d39d9ad445c10bba51-0.
INFO 03-02 01:25:12 [logger.py:42] Received request cmpl-76b4d77f2ca345e58b983c62b94a3842-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:25:12 [async_llm.py:261] Added request cmpl-76b4d77f2ca345e58b983c62b94a3842-0.
INFO 03-02 01:25:13 [logger.py:42] Received request cmpl-811a3db96ff24f889e5ab5645d57c759-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:25:13 [async_llm.py:261] Added request cmpl-811a3db96ff24f889e5ab5645d57c759-0.
INFO 03-02 01:25:14 [logger.py:42] Received request cmpl-e5c25142993047d0a7220728e0a02f27-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:25:14 [async_llm.py:261] Added request cmpl-e5c25142993047d0a7220728e0a02f27-0.
INFO 03-02 01:25:15 [logger.py:42] Received request cmpl-a6080fe15239460da5bbb79cf9fb27aa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:25:15 [async_llm.py:261] Added request cmpl-a6080fe15239460da5bbb79cf9fb27aa-0.
INFO 03-02 01:25:16 [logger.py:42] Received request cmpl-f9d6d2e26fdb477ca9a1d48f0b2504dc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:25:16 [async_llm.py:261] Added request cmpl-f9d6d2e26fdb477ca9a1d48f0b2504dc-0.
INFO 03-02 01:25:17 [logger.py:42] Received request cmpl-dce5e4021ffa4aee964d94a4e6c9af7a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:25:17 [async_llm.py:261] Added request cmpl-dce5e4021ffa4aee964d94a4e6c9af7a-0.
INFO 03-02 01:25:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:25:19 [logger.py:42] Received request cmpl-a5c683a0185849baba8813f3760c0245-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:25:19 [async_llm.py:261] Added request cmpl-a5c683a0185849baba8813f3760c0245-0.
INFO 03-02 01:25:20 [logger.py:42] Received request cmpl-b4459157507d46ce9b0c7eedfc544ce9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:25:20 [async_llm.py:261] Added request cmpl-b4459157507d46ce9b0c7eedfc544ce9-0.
INFO 03-02 01:25:21 [logger.py:42] Received request cmpl-7ac8caca099f4264b17f763f364b9803-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:25:21 [async_llm.py:261] Added request cmpl-7ac8caca099f4264b17f763f364b9803-0.
INFO 03-02 01:25:22 [logger.py:42] Received request cmpl-1e0bff374f664cebb3cb1c3acb236251-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:25:22 [async_llm.py:261] Added request cmpl-1e0bff374f664cebb3cb1c3acb236251-0.
INFO 03-02 01:25:23 [logger.py:42] Received request cmpl-db899f17843b48c1bdf0d073d41c8c3a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:25:23 [async_llm.py:261] Added request cmpl-db899f17843b48c1bdf0d073d41c8c3a-0.
INFO 03-02 01:25:24 [logger.py:42] Received request cmpl-0ac770020bbd439c9db99a2444f10b37-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:25:24 [async_llm.py:261] Added request cmpl-0ac770020bbd439c9db99a2444f10b37-0.
INFO 03-02 01:25:25 [logger.py:42] Received request cmpl-9631998c1b4944758aa28c65f8c8f215-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:25:25 [async_llm.py:261] Added request cmpl-9631998c1b4944758aa28c65f8c8f215-0.
INFO 03-02 01:25:27 [logger.py:42] Received request cmpl-7c296ef79bc047a1a39d4622f306fd56-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:25:27 [async_llm.py:261] Added request cmpl-7c296ef79bc047a1a39d4622f306fd56-0.
INFO 03-02 01:25:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:25:28 [logger.py:42] Received request cmpl-2ea02ff3d8dd42b9b006d94b98be24cf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:25:28 [async_llm.py:261] Added request cmpl-2ea02ff3d8dd42b9b006d94b98be24cf-0.
INFO 03-02 01:25:29 [logger.py:42] Received request cmpl-7223c2719b034a2cb940c601cb7f3138-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:25:29 [async_llm.py:261] Added request cmpl-7223c2719b034a2cb940c601cb7f3138-0.
INFO 03-02 01:25:30 [logger.py:42] Received request cmpl-2f4f30f7d2ed48ba86bc833408826d09-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:25:30 [async_llm.py:261] Added request cmpl-2f4f30f7d2ed48ba86bc833408826d09-0.
INFO 03-02 01:25:31 [logger.py:42] Received request cmpl-8810894f03004e29a4ec893df8b55741-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:25:31 [async_llm.py:261] Added request cmpl-8810894f03004e29a4ec893df8b55741-0.
INFO 03-02 01:25:32 [logger.py:42] Received request cmpl-2c2c3e348e184e32ab4dc988f82981f9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:25:32 [async_llm.py:261] Added request cmpl-2c2c3e348e184e32ab4dc988f82981f9-0.
INFO 03-02 01:25:34 [logger.py:42] Received request cmpl-12fc6e8f6c9f4f7e87916cb90a01fc08-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:25:34 [async_llm.py:261] Added request cmpl-12fc6e8f6c9f4f7e87916cb90a01fc08-0.
INFO 03-02 01:25:35 [logger.py:42] Received request cmpl-f305ae87c68647f6bc77e78b801a9387-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:25:35 [async_llm.py:261] Added request cmpl-f305ae87c68647f6bc77e78b801a9387-0.
INFO 03-02 01:25:36 [logger.py:42] Received request cmpl-d72ffd1a9d5a459caf8432d7e994d371-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:25:36 [async_llm.py:261] Added request cmpl-d72ffd1a9d5a459caf8432d7e994d371-0.
INFO 03-02 01:25:37 [logger.py:42] Received request cmpl-149c28d6b59c40769d4e7f43ba22a154-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:25:37 [async_llm.py:261] Added request cmpl-149c28d6b59c40769d4e7f43ba22a154-0.
INFO 03-02 01:25:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:25:38 [logger.py:42] Received request cmpl-6c6d20d020a0474eb66bd92f0c57c4b7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:25:38 [async_llm.py:261] Added request cmpl-6c6d20d020a0474eb66bd92f0c57c4b7-0.
INFO 03-02 01:25:39 [logger.py:42] Received request cmpl-500f0705b0eb4bf89fcae26341cb4216-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:25:39 [async_llm.py:261] Added request cmpl-500f0705b0eb4bf89fcae26341cb4216-0.
INFO 03-02 01:25:40 [logger.py:42] Received request cmpl-afdaf130588b45f099df86d9c721ab75-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:25:40 [async_llm.py:261] Added request cmpl-afdaf130588b45f099df86d9c721ab75-0.
INFO 03-02 01:25:42 [logger.py:42] Received request cmpl-add152a8c27d4d788e733fd5873da476-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:25:42 [async_llm.py:261] Added request cmpl-add152a8c27d4d788e733fd5873da476-0.
INFO 03-02 01:25:43 [logger.py:42] Received request cmpl-6d988dd1888a41739699f59758b72610-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:25:43 [async_llm.py:261] Added request cmpl-6d988dd1888a41739699f59758b72610-0.
INFO 03-02 01:25:44 [logger.py:42] Received request cmpl-af0425871ece4979b8525c6991d85187-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:25:44 [async_llm.py:261] Added request cmpl-af0425871ece4979b8525c6991d85187-0.
INFO 03-02 01:25:45 [logger.py:42] Received request cmpl-e06ec0a51bc148b681e9337493accc17-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:25:45 [async_llm.py:261] Added request cmpl-e06ec0a51bc148b681e9337493accc17-0.
INFO 03-02 01:25:46 [logger.py:42] Received request cmpl-074e1573e9d14f8dae5611686a69ad5c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:25:46 [async_llm.py:261] Added request cmpl-074e1573e9d14f8dae5611686a69ad5c-0.
INFO 03-02 01:25:47 [logger.py:42] Received request cmpl-64676ce70544472288c40965c44fe6f3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:25:47 [async_llm.py:261] Added request cmpl-64676ce70544472288c40965c44fe6f3-0.
INFO 03-02 01:25:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:25:49 [logger.py:42] Received request cmpl-9116dd35bf494298938da45e1d785b66-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:25:49 [async_llm.py:261] Added request cmpl-9116dd35bf494298938da45e1d785b66-0.
INFO 03-02 01:25:50 [logger.py:42] Received request cmpl-8714ee3c50b94c91a9259c05bc9febf8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:25:50 [async_llm.py:261] Added request cmpl-8714ee3c50b94c91a9259c05bc9febf8-0.
INFO 03-02 01:25:51 [logger.py:42] Received request cmpl-77ac8fef8ef54cf3b0d9e0e3a5551a62-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:25:51 [async_llm.py:261] Added request cmpl-77ac8fef8ef54cf3b0d9e0e3a5551a62-0.
INFO 03-02 01:25:52 [logger.py:42] Received request cmpl-1d1dc598fa18467ba0fd082e7fa8a249-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:25:52 [async_llm.py:261] Added request cmpl-1d1dc598fa18467ba0fd082e7fa8a249-0.
INFO 03-02 01:25:53 [logger.py:42] Received request cmpl-e568a3795f6c44f5b8f01da9b0ec5393-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:25:53 [async_llm.py:261] Added request cmpl-e568a3795f6c44f5b8f01da9b0ec5393-0.
INFO 03-02 01:25:54 [logger.py:42] Received request cmpl-8fbea648bdf9418cbc69ede617ad4cf4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:25:54 [async_llm.py:261] Added request cmpl-8fbea648bdf9418cbc69ede617ad4cf4-0.
INFO 03-02 01:25:55 [logger.py:42] Received request cmpl-63909a09858c467897f96bc68e5c1787-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:25:55 [async_llm.py:261] Added request cmpl-63909a09858c467897f96bc68e5c1787-0.
INFO 03-02 01:25:57 [logger.py:42] Received request cmpl-1f2b2a09ef114237a97553324c580e54-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:25:57 [async_llm.py:261] Added request cmpl-1f2b2a09ef114237a97553324c580e54-0.
INFO 03-02 01:25:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:25:58 [logger.py:42] Received request cmpl-34ca35259b01477181385283789ed033-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:25:58 [async_llm.py:261] Added request cmpl-34ca35259b01477181385283789ed033-0.
INFO 03-02 01:25:59 [logger.py:42] Received request cmpl-5de3810ff6ea4982ac84985756b323e8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:25:59 [async_llm.py:261] Added request cmpl-5de3810ff6ea4982ac84985756b323e8-0.
INFO 03-02 01:26:00 [logger.py:42] Received request cmpl-37c92865a88b44cf8f4fa576d43b8468-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:26:00 [async_llm.py:261] Added request cmpl-37c92865a88b44cf8f4fa576d43b8468-0.
INFO 03-02 01:26:01 [logger.py:42] Received request cmpl-916fa2eae3824fcab4143f99ed770c29-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:26:01 [async_llm.py:261] Added request cmpl-916fa2eae3824fcab4143f99ed770c29-0.
INFO 03-02 01:26:02 [logger.py:42] Received request cmpl-7af10ea771c14fe2962f1423a96de227-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:26:02 [async_llm.py:261] Added request cmpl-7af10ea771c14fe2962f1423a96de227-0.
INFO 03-02 01:26:04 [logger.py:42] Received request cmpl-975454f990e74266ae97884b37df1c09-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:26:04 [async_llm.py:261] Added request cmpl-975454f990e74266ae97884b37df1c09-0.
INFO 03-02 01:26:05 [logger.py:42] Received request cmpl-f07203068d7f40e19de42a430d956f35-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:26:05 [async_llm.py:261] Added request cmpl-f07203068d7f40e19de42a430d956f35-0.
INFO 03-02 01:26:06 [logger.py:42] Received request cmpl-224d84f8ac1a40cc91f91a3372bda120-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:26:06 [async_llm.py:261] Added request cmpl-224d84f8ac1a40cc91f91a3372bda120-0.
INFO 03-02 01:26:07 [logger.py:42] Received request cmpl-20babd5542a541718549f744dffe188d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:26:07 [async_llm.py:261] Added request cmpl-20babd5542a541718549f744dffe188d-0.
INFO 03-02 01:26:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:26:08 [logger.py:42] Received request cmpl-3c6a95c126a94b7abc4d55277e49e69e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:26:08 [async_llm.py:261] Added request cmpl-3c6a95c126a94b7abc4d55277e49e69e-0.
INFO 03-02 01:26:09 [logger.py:42] Received request cmpl-00c5b8ec1397464f9990345fb4d80fa6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:26:09 [async_llm.py:261] Added request cmpl-00c5b8ec1397464f9990345fb4d80fa6-0.
INFO 03-02 01:26:10 [logger.py:42] Received request cmpl-2f3a8dabb5e94f2f873305be18266d82-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:26:10 [async_llm.py:261] Added request cmpl-2f3a8dabb5e94f2f873305be18266d82-0.
INFO 03-02 01:26:12 [logger.py:42] Received request cmpl-e1b05f72f850471d917e2b39bc98d190-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:26:12 [async_llm.py:261] Added request cmpl-e1b05f72f850471d917e2b39bc98d190-0.
INFO 03-02 01:26:13 [logger.py:42] Received request cmpl-6f92816437fe4ef69492e3578bd3967e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:26:13 [async_llm.py:261] Added request cmpl-6f92816437fe4ef69492e3578bd3967e-0.
INFO 03-02 01:26:14 [logger.py:42] Received request cmpl-bc1081e9b58e478eb4b67354c3b62e6a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:26:14 [async_llm.py:261] Added request cmpl-bc1081e9b58e478eb4b67354c3b62e6a-0.
INFO 03-02 01:26:15 [logger.py:42] Received request cmpl-fa0d82b369704e07943353358edf53b4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:26:15 [async_llm.py:261] Added request cmpl-fa0d82b369704e07943353358edf53b4-0.
INFO 03-02 01:26:16 [logger.py:42] Received request cmpl-a29d147bc10541aeba8cd70042c9dde3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:26:16 [async_llm.py:261] Added request cmpl-a29d147bc10541aeba8cd70042c9dde3-0.
INFO 03-02 01:26:17 [logger.py:42] Received request cmpl-281d66f6a21a41158ce9299278f0ed76-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:26:17 [async_llm.py:261] Added request cmpl-281d66f6a21a41158ce9299278f0ed76-0.
INFO 03-02 01:26:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:26:19 [logger.py:42] Received request cmpl-1fc5cb595eb54c46850ebe156121e5ca-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:26:19 [async_llm.py:261] Added request cmpl-1fc5cb595eb54c46850ebe156121e5ca-0.
INFO 03-02 01:26:20 [logger.py:42] Received request cmpl-9bfbfce191f846878e1220259c7e9e5d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:26:20 [async_llm.py:261] Added request cmpl-9bfbfce191f846878e1220259c7e9e5d-0.
INFO 03-02 01:26:21 [logger.py:42] Received request cmpl-e3a4cc4935a94068832abfaa7276fc04-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:26:21 [async_llm.py:261] Added request cmpl-e3a4cc4935a94068832abfaa7276fc04-0.
INFO 03-02 01:26:22 [logger.py:42] Received request cmpl-8b620bf2fe744851bad045e857562ef8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:26:22 [async_llm.py:261] Added request cmpl-8b620bf2fe744851bad045e857562ef8-0.
INFO 03-02 01:26:23 [logger.py:42] Received request cmpl-e35ccc9c7bfa426fb4791ab52b92cf4b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:26:23 [async_llm.py:261] Added request cmpl-e35ccc9c7bfa426fb4791ab52b92cf4b-0.
INFO 03-02 01:26:24 [logger.py:42] Received request cmpl-9d1b7d062c9a4ec88d7640d36e55fa94-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:26:24 [async_llm.py:261] Added request cmpl-9d1b7d062c9a4ec88d7640d36e55fa94-0.
INFO 03-02 01:26:25 [logger.py:42] Received request cmpl-88ef86cbb0364e829a670c26be77c0b7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:26:25 [async_llm.py:261] Added request cmpl-88ef86cbb0364e829a670c26be77c0b7-0.
INFO 03-02 01:26:27 [logger.py:42] Received request cmpl-dfa7012197b842d8bd1a21803d0318fb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:26:27 [async_llm.py:261] Added request cmpl-dfa7012197b842d8bd1a21803d0318fb-0.
INFO 03-02 01:26:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:26:28 [logger.py:42] Received request cmpl-f718ca08a2a74142a289a1e0339bde47-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:26:28 [async_llm.py:261] Added request cmpl-f718ca08a2a74142a289a1e0339bde47-0.
INFO 03-02 01:26:29 [logger.py:42] Received request cmpl-556e229c00224876b1873763563c02c7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:26:29 [async_llm.py:261] Added request cmpl-556e229c00224876b1873763563c02c7-0.
INFO 03-02 01:26:30 [logger.py:42] Received request cmpl-19b8a872020545eca64307f2454ae446-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:26:30 [async_llm.py:261] Added request cmpl-19b8a872020545eca64307f2454ae446-0.
INFO 03-02 01:26:31 [logger.py:42] Received request cmpl-410e52dc419e4bf1aebfed03e322e8aa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:26:31 [async_llm.py:261] Added request cmpl-410e52dc419e4bf1aebfed03e322e8aa-0.
INFO 03-02 01:26:32 [logger.py:42] Received request cmpl-a7946a0bb8284f34bfefa561b04c2129-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:26:32 [async_llm.py:261] Added request cmpl-a7946a0bb8284f34bfefa561b04c2129-0.
INFO 03-02 01:26:34 [logger.py:42] Received request cmpl-68f4dd711cdc4338abafc9710112431b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:26:34 [async_llm.py:261] Added request cmpl-68f4dd711cdc4338abafc9710112431b-0.
INFO 03-02 01:26:35 [logger.py:42] Received request cmpl-adefe098efe347ba9a033bf651b5aa01-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:26:35 [async_llm.py:261] Added request cmpl-adefe098efe347ba9a033bf651b5aa01-0.
INFO 03-02 01:26:36 [logger.py:42] Received request cmpl-b9e026822834420f9b221a79a970313b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:26:36 [async_llm.py:261] Added request cmpl-b9e026822834420f9b221a79a970313b-0.
INFO 03-02 01:26:37 [logger.py:42] Received request cmpl-737b5a87e11a410da1096124fb364313-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:26:37 [async_llm.py:261] Added request cmpl-737b5a87e11a410da1096124fb364313-0.
INFO 03-02 01:26:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:26:38 [logger.py:42] Received request cmpl-3d74eda660eb4087ac6b740db90ec96e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:26:38 [async_llm.py:261] Added request cmpl-3d74eda660eb4087ac6b740db90ec96e-0.
INFO 03-02 01:26:39 [logger.py:42] Received request cmpl-0a5bcc3e5b0b4434bff1bb48c16f4a79-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:26:39 [async_llm.py:261] Added request cmpl-0a5bcc3e5b0b4434bff1bb48c16f4a79-0.
INFO 03-02 01:26:40 [logger.py:42] Received request cmpl-13473431d597493f9a7a9120ce050091-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:26:40 [async_llm.py:261] Added request cmpl-13473431d597493f9a7a9120ce050091-0.
INFO 03-02 01:26:42 [logger.py:42] Received request cmpl-534ff3035b38464190f1ff32f0993208-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:26:42 [async_llm.py:261] Added request cmpl-534ff3035b38464190f1ff32f0993208-0.
INFO 03-02 01:26:43 [logger.py:42] Received request cmpl-767b50f231434b5da64f09ef20d87a3a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:26:43 [async_llm.py:261] Added request cmpl-767b50f231434b5da64f09ef20d87a3a-0.
INFO 03-02 01:26:44 [logger.py:42] Received request cmpl-46579a3eff964ecb90e3e1c2c56c80ab-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:26:44 [async_llm.py:261] Added request cmpl-46579a3eff964ecb90e3e1c2c56c80ab-0.
INFO 03-02 01:26:45 [logger.py:42] Received request cmpl-0cc0e51f2f96401480c823c956da91fe-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:26:45 [async_llm.py:261] Added request cmpl-0cc0e51f2f96401480c823c956da91fe-0.
INFO 03-02 01:26:46 [logger.py:42] Received request cmpl-323033810c4443c0b8d8de6bb14b1999-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:26:46 [async_llm.py:261] Added request cmpl-323033810c4443c0b8d8de6bb14b1999-0.
INFO 03-02 01:26:47 [logger.py:42] Received request cmpl-b0f49a199c084a1388bc35bad272de4a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:26:47 [async_llm.py:261] Added request cmpl-b0f49a199c084a1388bc35bad272de4a-0.
INFO 03-02 01:26:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:26:49 [logger.py:42] Received request cmpl-dba850a2cd7945a49b855a97bdcc14b9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:26:49 [async_llm.py:261] Added request cmpl-dba850a2cd7945a49b855a97bdcc14b9-0.
INFO 03-02 01:26:50 [logger.py:42] Received request cmpl-3cdaca3205bc4185a564ebf23c99f237-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:26:50 [async_llm.py:261] Added request cmpl-3cdaca3205bc4185a564ebf23c99f237-0.
INFO 03-02 01:26:51 [logger.py:42] Received request cmpl-066c194719ac414bba2f98b4f3fd1bfe-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:26:51 [async_llm.py:261] Added request cmpl-066c194719ac414bba2f98b4f3fd1bfe-0.
INFO 03-02 01:26:52 [logger.py:42] Received request cmpl-b6c9bf252bb74d138af64f2b0067606a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:26:52 [async_llm.py:261] Added request cmpl-b6c9bf252bb74d138af64f2b0067606a-0.
INFO 03-02 01:26:53 [logger.py:42] Received request cmpl-3aabe438b487465e87958ef470c6d11a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:26:53 [async_llm.py:261] Added request cmpl-3aabe438b487465e87958ef470c6d11a-0.
INFO 03-02 01:26:54 [logger.py:42] Received request cmpl-c105fd01d3f14c65af5bf633cb7254af-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:26:54 [async_llm.py:261] Added request cmpl-c105fd01d3f14c65af5bf633cb7254af-0.
INFO 03-02 01:26:55 [logger.py:42] Received request cmpl-a878da647920419c8f0e166c5eca6441-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:26:55 [async_llm.py:261] Added request cmpl-a878da647920419c8f0e166c5eca6441-0.
INFO 03-02 01:26:57 [logger.py:42] Received request cmpl-fdd9e25789ee4c8cb001d587dc1bc4c4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:26:57 [async_llm.py:261] Added request cmpl-fdd9e25789ee4c8cb001d587dc1bc4c4-0.
INFO 03-02 01:26:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:26:58 [logger.py:42] Received request cmpl-cf48c3226cf44ac3b4a729884f709cb8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:26:58 [async_llm.py:261] Added request cmpl-cf48c3226cf44ac3b4a729884f709cb8-0.
INFO 03-02 01:26:59 [logger.py:42] Received request cmpl-a209e3d7f9764ccc9e7e665772d6bf22-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:26:59 [async_llm.py:261] Added request cmpl-a209e3d7f9764ccc9e7e665772d6bf22-0.
INFO 03-02 01:27:00 [logger.py:42] Received request cmpl-d2f5d662e4de4400b1642ab5096d9f84-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:27:00 [async_llm.py:261] Added request cmpl-d2f5d662e4de4400b1642ab5096d9f84-0.
INFO 03-02 01:27:01 [logger.py:42] Received request cmpl-7073f1bb3d2d4928b424a4eb9247ffa3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:27:01 [async_llm.py:261] Added request cmpl-7073f1bb3d2d4928b424a4eb9247ffa3-0.
INFO 03-02 01:27:02 [logger.py:42] Received request cmpl-03f8ce8153ed4b009c9388f03b4424ec-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:27:02 [async_llm.py:261] Added request cmpl-03f8ce8153ed4b009c9388f03b4424ec-0.
INFO 03-02 01:27:04 [logger.py:42] Received request cmpl-ef6dff7dd2ed45ca97c6a6dd504a3363-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:27:04 [async_llm.py:261] Added request cmpl-ef6dff7dd2ed45ca97c6a6dd504a3363-0.
INFO 03-02 01:27:05 [logger.py:42] Received request cmpl-3d07dda678cd4d86b658f8b4e1a53f1a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:27:05 [async_llm.py:261] Added request cmpl-3d07dda678cd4d86b658f8b4e1a53f1a-0.
INFO 03-02 01:27:06 [logger.py:42] Received request cmpl-102e45c1e173466793b6a8774a17c8f5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:27:06 [async_llm.py:261] Added request cmpl-102e45c1e173466793b6a8774a17c8f5-0.
INFO 03-02 01:27:07 [logger.py:42] Received request cmpl-05734b4ad88f47f9b6523c0606664330-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:27:07 [async_llm.py:261] Added request cmpl-05734b4ad88f47f9b6523c0606664330-0.
INFO 03-02 01:27:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:27:08 [logger.py:42] Received request cmpl-78b792b575d6435091a90a7811668997-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:27:08 [async_llm.py:261] Added request cmpl-78b792b575d6435091a90a7811668997-0.
INFO 03-02 01:27:09 [logger.py:42] Received request cmpl-18bd4a7a608d44c98d8bfdce018b9005-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:27:09 [async_llm.py:261] Added request cmpl-18bd4a7a608d44c98d8bfdce018b9005-0.
INFO 03-02 01:27:10 [logger.py:42] Received request cmpl-0abe31518595451791bce95eb3c62fa3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:27:10 [async_llm.py:261] Added request cmpl-0abe31518595451791bce95eb3c62fa3-0.
INFO 03-02 01:27:12 [logger.py:42] Received request cmpl-7e6c58ea83bc42908db287846b86c7e2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:27:12 [async_llm.py:261] Added request cmpl-7e6c58ea83bc42908db287846b86c7e2-0.
INFO 03-02 01:27:13 [logger.py:42] Received request cmpl-c0dce6ab4c4d40499db9d0643984fb67-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:27:13 [async_llm.py:261] Added request cmpl-c0dce6ab4c4d40499db9d0643984fb67-0.
INFO 03-02 01:27:14 [logger.py:42] Received request cmpl-85f2df0e52ea435496f1c87a48c8b4bc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:27:14 [async_llm.py:261] Added request cmpl-85f2df0e52ea435496f1c87a48c8b4bc-0.
INFO 03-02 01:27:15 [logger.py:42] Received request cmpl-9e76da5ab1524c26a548bebff5d89e5b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:27:15 [async_llm.py:261] Added request cmpl-9e76da5ab1524c26a548bebff5d89e5b-0.
INFO 03-02 01:27:16 [logger.py:42] Received request cmpl-c689535f879748808912eb8665bbdc84-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:27:16 [async_llm.py:261] Added request cmpl-c689535f879748808912eb8665bbdc84-0.
INFO 03-02 01:27:17 [logger.py:42] Received request cmpl-98b5dc6d313045089588984205720676-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:27:17 [async_llm.py:261] Added request cmpl-98b5dc6d313045089588984205720676-0.
INFO 03-02 01:27:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:27:19 [logger.py:42] Received request cmpl-7b4d17f742b74bccaa27724d1f46e893-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:27:19 [async_llm.py:261] Added request cmpl-7b4d17f742b74bccaa27724d1f46e893-0.
INFO 03-02 01:27:20 [logger.py:42] Received request cmpl-d6b217eceeb04f0aa32a9d4496942958-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:27:20 [async_llm.py:261] Added request cmpl-d6b217eceeb04f0aa32a9d4496942958-0.
INFO 03-02 01:27:21 [logger.py:42] Received request cmpl-07373f38be674c3183cef24d71045c1d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:27:21 [async_llm.py:261] Added request cmpl-07373f38be674c3183cef24d71045c1d-0.
INFO 03-02 01:27:22 [logger.py:42] Received request cmpl-e1df0e3a346745799d837c0ae6646aff-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:27:22 [async_llm.py:261] Added request cmpl-e1df0e3a346745799d837c0ae6646aff-0.
INFO 03-02 01:27:23 [logger.py:42] Received request cmpl-fa77d3c6a9ba4c74bd1184c9c0b906d7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:27:23 [async_llm.py:261] Added request cmpl-fa77d3c6a9ba4c74bd1184c9c0b906d7-0.
INFO 03-02 01:27:24 [logger.py:42] Received request cmpl-32d61fa1d42d441485be70140a4197ad-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:27:24 [async_llm.py:261] Added request cmpl-32d61fa1d42d441485be70140a4197ad-0.
INFO 03-02 01:27:25 [logger.py:42] Received request cmpl-832a7994e00b4c68b3dc2576b17def6b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:27:25 [async_llm.py:261] Added request cmpl-832a7994e00b4c68b3dc2576b17def6b-0.
INFO 03-02 01:27:27 [logger.py:42] Received request cmpl-393493610ed444c3b27fe7931c7a4caa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:27:27 [async_llm.py:261] Added request cmpl-393493610ed444c3b27fe7931c7a4caa-0.
INFO 03-02 01:27:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:27:28 [logger.py:42] Received request cmpl-e0f43833cd8549a79af7363031ef0f3a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:27:28 [async_llm.py:261] Added request cmpl-e0f43833cd8549a79af7363031ef0f3a-0.
INFO 03-02 01:27:29 [logger.py:42] Received request cmpl-4ffc651f7cd64ee49db91018338c14b5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:27:29 [async_llm.py:261] Added request cmpl-4ffc651f7cd64ee49db91018338c14b5-0.
INFO 03-02 01:27:30 [logger.py:42] Received request cmpl-f8575a8c23a2419e90d99e83403f39c8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:27:30 [async_llm.py:261] Added request cmpl-f8575a8c23a2419e90d99e83403f39c8-0.
INFO 03-02 01:27:31 [logger.py:42] Received request cmpl-0e12793c3a2147bead0d3394132f1e17-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:27:31 [async_llm.py:261] Added request cmpl-0e12793c3a2147bead0d3394132f1e17-0.
INFO 03-02 01:27:32 [logger.py:42] Received request cmpl-a8525dee082a4b8e8608bf8622d1a9d5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:27:32 [async_llm.py:261] Added request cmpl-a8525dee082a4b8e8608bf8622d1a9d5-0.
INFO 03-02 01:27:34 [logger.py:42] Received request cmpl-84d8d79f6c3c425e8b0e0fc348c84b00-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:27:34 [async_llm.py:261] Added request cmpl-84d8d79f6c3c425e8b0e0fc348c84b00-0.
INFO 03-02 01:27:35 [logger.py:42] Received request cmpl-d91e7f94ab984157b05c6a4df8724730-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:27:35 [async_llm.py:261] Added request cmpl-d91e7f94ab984157b05c6a4df8724730-0.
INFO 03-02 01:27:36 [logger.py:42] Received request cmpl-300e76eda0294c718311b77658ff18b9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:27:36 [async_llm.py:261] Added request cmpl-300e76eda0294c718311b77658ff18b9-0.
INFO 03-02 01:27:37 [logger.py:42] Received request cmpl-c395095034a04646849708a57900159d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:27:37 [async_llm.py:261] Added request cmpl-c395095034a04646849708a57900159d-0.
INFO 03-02 01:27:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:27:38 [logger.py:42] Received request cmpl-f055745756334a3cba52f1d9c8840dbd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:27:38 [async_llm.py:261] Added request cmpl-f055745756334a3cba52f1d9c8840dbd-0.
INFO 03-02 01:27:39 [logger.py:42] Received request cmpl-8518ac47cd73444c96d2ca380e2ead6d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:27:39 [async_llm.py:261] Added request cmpl-8518ac47cd73444c96d2ca380e2ead6d-0.
INFO 03-02 01:27:40 [logger.py:42] Received request cmpl-5f9c08c0e7f64c4489c48fc2d7730031-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:27:40 [async_llm.py:261] Added request cmpl-5f9c08c0e7f64c4489c48fc2d7730031-0.
INFO 03-02 01:27:42 [logger.py:42] Received request cmpl-8bd2a38a8b5546f7b7d4e78089369876-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:27:42 [async_llm.py:261] Added request cmpl-8bd2a38a8b5546f7b7d4e78089369876-0.
INFO 03-02 01:27:43 [logger.py:42] Received request cmpl-396358fcf4404a7486bfce78cfd13c6c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:27:43 [async_llm.py:261] Added request cmpl-396358fcf4404a7486bfce78cfd13c6c-0.
INFO 03-02 01:27:44 [logger.py:42] Received request cmpl-086eaef2479e474887a742dbddb6f20a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:27:44 [async_llm.py:261] Added request cmpl-086eaef2479e474887a742dbddb6f20a-0.
INFO 03-02 01:27:45 [logger.py:42] Received request cmpl-4c3366cb27bc4135ae706616ea1e6079-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:27:45 [async_llm.py:261] Added request cmpl-4c3366cb27bc4135ae706616ea1e6079-0.
INFO 03-02 01:27:46 [logger.py:42] Received request cmpl-c013a032812b47839f896e34987cb961-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:27:46 [async_llm.py:261] Added request cmpl-c013a032812b47839f896e34987cb961-0.
INFO 03-02 01:27:47 [logger.py:42] Received request cmpl-9a9b264f8791469e9162d884e57fd302-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:27:47 [async_llm.py:261] Added request cmpl-9a9b264f8791469e9162d884e57fd302-0.
INFO 03-02 01:27:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:27:49 [logger.py:42] Received request cmpl-59664f6ac6ef42c7a2fbdcbdb078c713-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:27:49 [async_llm.py:261] Added request cmpl-59664f6ac6ef42c7a2fbdcbdb078c713-0.
INFO 03-02 01:27:50 [logger.py:42] Received request cmpl-2dfd9b59d6ba49ef9607efaad8607151-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:27:50 [async_llm.py:261] Added request cmpl-2dfd9b59d6ba49ef9607efaad8607151-0.
INFO 03-02 01:27:51 [logger.py:42] Received request cmpl-784efb001daf44249eafd46c9255a55f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:27:51 [async_llm.py:261] Added request cmpl-784efb001daf44249eafd46c9255a55f-0.
INFO 03-02 01:27:52 [logger.py:42] Received request cmpl-7fc63a03c2e24af6a081939a64854caf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:27:52 [async_llm.py:261] Added request cmpl-7fc63a03c2e24af6a081939a64854caf-0.
INFO 03-02 01:27:53 [logger.py:42] Received request cmpl-b16360deeeec420e9cd27e0fd5df0d5f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:27:53 [async_llm.py:261] Added request cmpl-b16360deeeec420e9cd27e0fd5df0d5f-0.
INFO 03-02 01:27:54 [logger.py:42] Received request cmpl-39814a560749430c9eb6ccdfdbe4d01c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:27:54 [async_llm.py:261] Added request cmpl-39814a560749430c9eb6ccdfdbe4d01c-0.
INFO 03-02 01:27:55 [logger.py:42] Received request cmpl-47a3c0c46bd94c6b9b81309cfd83e8f9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:27:55 [async_llm.py:261] Added request cmpl-47a3c0c46bd94c6b9b81309cfd83e8f9-0.
INFO 03-02 01:27:57 [logger.py:42] Received request cmpl-c1a1f19db18d4d7094f9e97e5a0ebf8e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:27:57 [async_llm.py:261] Added request cmpl-c1a1f19db18d4d7094f9e97e5a0ebf8e-0.
INFO 03-02 01:27:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:27:58 [logger.py:42] Received request cmpl-9b75d07f219a4f1ab84b59f477e07bd9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:27:58 [async_llm.py:261] Added request cmpl-9b75d07f219a4f1ab84b59f477e07bd9-0.
INFO 03-02 01:27:59 [logger.py:42] Received request cmpl-17166de0af9446adaed151ac471dcefa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:27:59 [async_llm.py:261] Added request cmpl-17166de0af9446adaed151ac471dcefa-0.
INFO 03-02 01:28:00 [logger.py:42] Received request cmpl-1fee16c304aa4aa884c8746bee57b6cd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:28:00 [async_llm.py:261] Added request cmpl-1fee16c304aa4aa884c8746bee57b6cd-0.
INFO 03-02 01:28:01 [logger.py:42] Received request cmpl-044d2b3e4cf442db933fde67b274dec2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:28:01 [async_llm.py:261] Added request cmpl-044d2b3e4cf442db933fde67b274dec2-0.
INFO 03-02 01:28:02 [logger.py:42] Received request cmpl-9a417b06538b445e8871ad0872e6211b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:28:02 [async_llm.py:261] Added request cmpl-9a417b06538b445e8871ad0872e6211b-0.
INFO 03-02 01:28:04 [logger.py:42] Received request cmpl-9cf0c0c0621b42c7b425a49eaa32a0a4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:28:04 [async_llm.py:261] Added request cmpl-9cf0c0c0621b42c7b425a49eaa32a0a4-0.
INFO 03-02 01:28:05 [logger.py:42] Received request cmpl-14146a11988d4eaf90ae876001c54897-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:28:05 [async_llm.py:261] Added request cmpl-14146a11988d4eaf90ae876001c54897-0.
INFO 03-02 01:28:06 [logger.py:42] Received request cmpl-83e1dba20db84cb8b58ccfaea55f0a0e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:28:06 [async_llm.py:261] Added request cmpl-83e1dba20db84cb8b58ccfaea55f0a0e-0.
INFO 03-02 01:28:07 [logger.py:42] Received request cmpl-2b2e9e222280447e98107017e7af696c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:28:07 [async_llm.py:261] Added request cmpl-2b2e9e222280447e98107017e7af696c-0.
INFO 03-02 01:28:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:28:08 [logger.py:42] Received request cmpl-b010c5f488a94f1b97528c8d54547409-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:28:08 [async_llm.py:261] Added request cmpl-b010c5f488a94f1b97528c8d54547409-0.
INFO 03-02 01:28:09 [logger.py:42] Received request cmpl-540047567fc347fc822aa34fbd03934e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:28:09 [async_llm.py:261] Added request cmpl-540047567fc347fc822aa34fbd03934e-0.
INFO 03-02 01:28:10 [logger.py:42] Received request cmpl-3fda8bcdc80d45c196a809fc6ef78786-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:28:10 [async_llm.py:261] Added request cmpl-3fda8bcdc80d45c196a809fc6ef78786-0.
INFO 03-02 01:28:12 [logger.py:42] Received request cmpl-f1a2ed8a6fce40699d5cd05943ddf789-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:28:12 [async_llm.py:261] Added request cmpl-f1a2ed8a6fce40699d5cd05943ddf789-0.
INFO 03-02 01:28:13 [logger.py:42] Received request cmpl-794c543554a944ce96c477ecc73edbf5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:28:13 [async_llm.py:261] Added request cmpl-794c543554a944ce96c477ecc73edbf5-0.
INFO 03-02 01:28:14 [logger.py:42] Received request cmpl-e9087756382e4b78a199baccedf67103-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:28:14 [async_llm.py:261] Added request cmpl-e9087756382e4b78a199baccedf67103-0.
INFO 03-02 01:28:15 [logger.py:42] Received request cmpl-53a8c502d78f4168b20a91ecc4fd0321-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:28:15 [async_llm.py:261] Added request cmpl-53a8c502d78f4168b20a91ecc4fd0321-0.
INFO 03-02 01:28:16 [logger.py:42] Received request cmpl-140dde62d528401b8dbb890e50d8bd8f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:28:16 [async_llm.py:261] Added request cmpl-140dde62d528401b8dbb890e50d8bd8f-0.
INFO 03-02 01:28:17 [logger.py:42] Received request cmpl-85eee071f2d44eb4a1487bff509ff610-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:28:17 [async_llm.py:261] Added request cmpl-85eee071f2d44eb4a1487bff509ff610-0.
INFO 03-02 01:28:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:28:19 [logger.py:42] Received request cmpl-fb769796257c498a8b6f0d90baf314e6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:28:19 [async_llm.py:261] Added request cmpl-fb769796257c498a8b6f0d90baf314e6-0.
INFO 03-02 01:28:20 [logger.py:42] Received request cmpl-9a2b4f6fd6714e30b79f3bd9c452354d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:28:20 [async_llm.py:261] Added request cmpl-9a2b4f6fd6714e30b79f3bd9c452354d-0.
INFO 03-02 01:28:21 [logger.py:42] Received request cmpl-878c43b8f4c24a28b8d017d187b2842d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:28:21 [async_llm.py:261] Added request cmpl-878c43b8f4c24a28b8d017d187b2842d-0.
INFO 03-02 01:28:22 [logger.py:42] Received request cmpl-04febe1bf7ba4d9a9f59aa27ea74f0a1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:28:22 [async_llm.py:261] Added request cmpl-04febe1bf7ba4d9a9f59aa27ea74f0a1-0.
INFO 03-02 01:28:23 [logger.py:42] Received request cmpl-d05b000b521a4902978d712ea941f7c1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:28:23 [async_llm.py:261] Added request cmpl-d05b000b521a4902978d712ea941f7c1-0.
INFO 03-02 01:28:24 [logger.py:42] Received request cmpl-704d7c64e4d044928047f3bb87ad0d37-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:28:24 [async_llm.py:261] Added request cmpl-704d7c64e4d044928047f3bb87ad0d37-0.
INFO 03-02 01:28:25 [logger.py:42] Received request cmpl-1003f0441d224c54b198ca27073018db-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:28:25 [async_llm.py:261] Added request cmpl-1003f0441d224c54b198ca27073018db-0.
INFO 03-02 01:28:27 [logger.py:42] Received request cmpl-f7767f1d1e244bd89876b5857b976d42-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:28:27 [async_llm.py:261] Added request cmpl-f7767f1d1e244bd89876b5857b976d42-0.
INFO 03-02 01:28:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:28:28 [logger.py:42] Received request cmpl-9b361d34d7314005ab95f8275fc32495-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:28:28 [async_llm.py:261] Added request cmpl-9b361d34d7314005ab95f8275fc32495-0.
INFO 03-02 01:28:29 [logger.py:42] Received request cmpl-58eeee77948c44b189906499ec7e9325-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:28:29 [async_llm.py:261] Added request cmpl-58eeee77948c44b189906499ec7e9325-0.
INFO 03-02 01:28:30 [logger.py:42] Received request cmpl-0f45e423b5fb478d8fc71745b74a053b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:28:30 [async_llm.py:261] Added request cmpl-0f45e423b5fb478d8fc71745b74a053b-0.
INFO 03-02 01:28:31 [logger.py:42] Received request cmpl-cc834c2e25fd4e4fb49218cd1aea9857-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:28:31 [async_llm.py:261] Added request cmpl-cc834c2e25fd4e4fb49218cd1aea9857-0.
INFO 03-02 01:28:32 [logger.py:42] Received request cmpl-d28c08dda9d54e349c500422d8f0f0db-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:28:32 [async_llm.py:261] Added request cmpl-d28c08dda9d54e349c500422d8f0f0db-0.
INFO 03-02 01:28:34 [logger.py:42] Received request cmpl-f6d359a69ccc4ea8b6998227f794a31b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:28:34 [async_llm.py:261] Added request cmpl-f6d359a69ccc4ea8b6998227f794a31b-0.
INFO 03-02 01:28:35 [logger.py:42] Received request cmpl-676e339482c9403b9200e1b1b2538c1c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:28:35 [async_llm.py:261] Added request cmpl-676e339482c9403b9200e1b1b2538c1c-0.
INFO 03-02 01:28:36 [logger.py:42] Received request cmpl-84f84a1f89de42acbae61d4436d11a42-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:28:36 [async_llm.py:261] Added request cmpl-84f84a1f89de42acbae61d4436d11a42-0.
INFO 03-02 01:28:37 [logger.py:42] Received request cmpl-e6a22aa5980b46dfb1c94a7e4dc383b4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:28:37 [async_llm.py:261] Added request cmpl-e6a22aa5980b46dfb1c94a7e4dc383b4-0.
INFO 03-02 01:28:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:28:38 [logger.py:42] Received request cmpl-277bd75188e64d3f94480df627ea33d1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:28:38 [async_llm.py:261] Added request cmpl-277bd75188e64d3f94480df627ea33d1-0.
INFO 03-02 01:28:39 [logger.py:42] Received request cmpl-b648d85c713a4dcb91cafc059a460d29-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:28:39 [async_llm.py:261] Added request cmpl-b648d85c713a4dcb91cafc059a460d29-0.
INFO 03-02 01:28:40 [logger.py:42] Received request cmpl-f70cd916882047f99b51ad1aa0209910-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:28:40 [async_llm.py:261] Added request cmpl-f70cd916882047f99b51ad1aa0209910-0.
INFO 03-02 01:28:42 [logger.py:42] Received request cmpl-d9f36e026ec64dd08f706a3a8a6077a0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:28:42 [async_llm.py:261] Added request cmpl-d9f36e026ec64dd08f706a3a8a6077a0-0.
INFO 03-02 01:28:43 [logger.py:42] Received request cmpl-534b8923030447ba859c24aa1ccc8129-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:28:43 [async_llm.py:261] Added request cmpl-534b8923030447ba859c24aa1ccc8129-0.
INFO 03-02 01:28:44 [logger.py:42] Received request cmpl-ed54eceb910b4ec89f45ebcdb5340f30-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:28:44 [async_llm.py:261] Added request cmpl-ed54eceb910b4ec89f45ebcdb5340f30-0.
INFO 03-02 01:28:45 [logger.py:42] Received request cmpl-6e008677f1c94eb2ad561c03bea849ff-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:28:45 [async_llm.py:261] Added request cmpl-6e008677f1c94eb2ad561c03bea849ff-0.
INFO 03-02 01:28:46 [logger.py:42] Received request cmpl-4fa37ae0abbf4c389004ba73ee4ea87f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:28:46 [async_llm.py:261] Added request cmpl-4fa37ae0abbf4c389004ba73ee4ea87f-0.
INFO 03-02 01:28:47 [logger.py:42] Received request cmpl-8dac41b707b14912bf867c0807cf84a3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:28:47 [async_llm.py:261] Added request cmpl-8dac41b707b14912bf867c0807cf84a3-0.
INFO 03-02 01:28:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:28:48 [logger.py:42] Received request cmpl-ec83a1c851974efa8c5694cb7ad098b7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:28:48 [async_llm.py:261] Added request cmpl-ec83a1c851974efa8c5694cb7ad098b7-0.
INFO 03-02 01:28:50 [logger.py:42] Received request cmpl-dd40bd7ebcdb48da911eb16ee17185ae-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:28:50 [async_llm.py:261] Added request cmpl-dd40bd7ebcdb48da911eb16ee17185ae-0.
INFO 03-02 01:28:51 [logger.py:42] Received request cmpl-21d9fcb473334f759dc4a4d24442205a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:28:51 [async_llm.py:261] Added request cmpl-21d9fcb473334f759dc4a4d24442205a-0.
INFO 03-02 01:28:52 [logger.py:42] Received request cmpl-9e706896b74a4a3eb616c6aff4ee6758-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:28:52 [async_llm.py:261] Added request cmpl-9e706896b74a4a3eb616c6aff4ee6758-0.
INFO 03-02 01:28:53 [logger.py:42] Received request cmpl-500b169106184d29993c2b8a8efdd723-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:28:53 [async_llm.py:261] Added request cmpl-500b169106184d29993c2b8a8efdd723-0.
INFO 03-02 01:28:54 [logger.py:42] Received request cmpl-c3d922e714544f04a0e2e78fea39eb30-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:28:54 [async_llm.py:261] Added request cmpl-c3d922e714544f04a0e2e78fea39eb30-0.
INFO 03-02 01:28:55 [logger.py:42] Received request cmpl-b1ed8c2aa1244be393b792bedb12a4dd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:28:55 [async_llm.py:261] Added request cmpl-b1ed8c2aa1244be393b792bedb12a4dd-0.
INFO 03-02 01:28:57 [logger.py:42] Received request cmpl-5ae1626dc4a447c5a4ea6b1558e3a375-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:28:57 [async_llm.py:261] Added request cmpl-5ae1626dc4a447c5a4ea6b1558e3a375-0.
INFO 03-02 01:28:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:28:58 [logger.py:42] Received request cmpl-0ee6aa8e2eba4fa0bdd5799efacc2b6a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:28:58 [async_llm.py:261] Added request cmpl-0ee6aa8e2eba4fa0bdd5799efacc2b6a-0.
INFO 03-02 01:28:59 [logger.py:42] Received request cmpl-651722f2f6a7441ea23431fba302a728-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:28:59 [async_llm.py:261] Added request cmpl-651722f2f6a7441ea23431fba302a728-0.
INFO 03-02 01:29:00 [logger.py:42] Received request cmpl-8d435b45feae4b57993ebd7d1e1ddbc3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:29:00 [async_llm.py:261] Added request cmpl-8d435b45feae4b57993ebd7d1e1ddbc3-0.
INFO 03-02 01:29:01 [logger.py:42] Received request cmpl-4fa9faab680942ff90fe3dad6fadd2e8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:29:01 [async_llm.py:261] Added request cmpl-4fa9faab680942ff90fe3dad6fadd2e8-0.
INFO 03-02 01:29:02 [logger.py:42] Received request cmpl-f781ae1580114b7cb41e7bbe7387fb29-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:29:02 [async_llm.py:261] Added request cmpl-f781ae1580114b7cb41e7bbe7387fb29-0.
INFO 03-02 01:29:03 [logger.py:42] Received request cmpl-6b3241adf7174009b8442a84a33b2631-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:29:03 [async_llm.py:261] Added request cmpl-6b3241adf7174009b8442a84a33b2631-0.
INFO 03-02 01:29:05 [logger.py:42] Received request cmpl-9976431e83344853bc835212c796e8e3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:29:05 [async_llm.py:261] Added request cmpl-9976431e83344853bc835212c796e8e3-0.
INFO 03-02 01:29:06 [logger.py:42] Received request cmpl-cc4372a43a2840bbbc1716429cc32028-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:29:06 [async_llm.py:261] Added request cmpl-cc4372a43a2840bbbc1716429cc32028-0.
INFO 03-02 01:29:07 [logger.py:42] Received request cmpl-ea0607d41fc943f0a268e3ec314e5503-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:29:07 [async_llm.py:261] Added request cmpl-ea0607d41fc943f0a268e3ec314e5503-0.
INFO 03-02 01:29:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:29:08 [logger.py:42] Received request cmpl-16ac80f2e104486a812f2387f701c066-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:29:08 [async_llm.py:261] Added request cmpl-16ac80f2e104486a812f2387f701c066-0.
INFO 03-02 01:29:09 [logger.py:42] Received request cmpl-8722f4a7f5ec47a4a90d15eed55adc34-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:29:09 [async_llm.py:261] Added request cmpl-8722f4a7f5ec47a4a90d15eed55adc34-0.
INFO 03-02 01:29:10 [logger.py:42] Received request cmpl-434ebc74037f4d939a53123004322baf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:29:10 [async_llm.py:261] Added request cmpl-434ebc74037f4d939a53123004322baf-0.
INFO 03-02 01:29:12 [logger.py:42] Received request cmpl-27daa00fe15e43f8b2311a30da5b0363-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:29:12 [async_llm.py:261] Added request cmpl-27daa00fe15e43f8b2311a30da5b0363-0.
INFO 03-02 01:29:13 [logger.py:42] Received request cmpl-a50c4a943fb9425cb0dbacdbf4af1049-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:29:13 [async_llm.py:261] Added request cmpl-a50c4a943fb9425cb0dbacdbf4af1049-0.
INFO 03-02 01:29:14 [logger.py:42] Received request cmpl-9ca7485da7084f9abc24bd58f466457c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:29:14 [async_llm.py:261] Added request cmpl-9ca7485da7084f9abc24bd58f466457c-0.
INFO 03-02 01:29:15 [logger.py:42] Received request cmpl-7ceec0921d6b4472bea3ce0b3f5129e9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:29:15 [async_llm.py:261] Added request cmpl-7ceec0921d6b4472bea3ce0b3f5129e9-0.
INFO 03-02 01:29:16 [logger.py:42] Received request cmpl-96b655628b2a434d9e7a526d86dc2538-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:29:16 [async_llm.py:261] Added request cmpl-96b655628b2a434d9e7a526d86dc2538-0.
INFO 03-02 01:29:17 [logger.py:42] Received request cmpl-0ab1653ea15f401f8e0e9ab29d8f99d5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:29:17 [async_llm.py:261] Added request cmpl-0ab1653ea15f401f8e0e9ab29d8f99d5-0.
INFO 03-02 01:29:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:29:18 [logger.py:42] Received request cmpl-3b07712cf6c04dc1ba6c9cec9983244a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:29:18 [async_llm.py:261] Added request cmpl-3b07712cf6c04dc1ba6c9cec9983244a-0.
INFO 03-02 01:29:20 [logger.py:42] Received request cmpl-17cac272175140bdb9522eb2c9555016-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:29:20 [async_llm.py:261] Added request cmpl-17cac272175140bdb9522eb2c9555016-0.
INFO 03-02 01:29:21 [logger.py:42] Received request cmpl-576866ba12a4497885456465cba36018-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:29:21 [async_llm.py:261] Added request cmpl-576866ba12a4497885456465cba36018-0.
INFO 03-02 01:29:22 [logger.py:42] Received request cmpl-44c376e35a0741e7aa2700b9d8ccd77d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:29:22 [async_llm.py:261] Added request cmpl-44c376e35a0741e7aa2700b9d8ccd77d-0.
INFO 03-02 01:29:23 [logger.py:42] Received request cmpl-f85c34e972594b91bbc3b1615ed3890d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:29:23 [async_llm.py:261] Added request cmpl-f85c34e972594b91bbc3b1615ed3890d-0.
INFO 03-02 01:29:24 [logger.py:42] Received request cmpl-b551eacf314d48248b0fc4410b89a112-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:29:24 [async_llm.py:261] Added request cmpl-b551eacf314d48248b0fc4410b89a112-0.
INFO 03-02 01:29:25 [logger.py:42] Received request cmpl-ca3ddc7707f1424f8360ce625a28694c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:29:25 [async_llm.py:261] Added request cmpl-ca3ddc7707f1424f8360ce625a28694c-0.
INFO 03-02 01:29:27 [logger.py:42] Received request cmpl-7c6d8de15309461996848568a12534b1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:29:27 [async_llm.py:261] Added request cmpl-7c6d8de15309461996848568a12534b1-0.
INFO 03-02 01:29:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:29:28 [logger.py:42] Received request cmpl-f7a5bfa2d6f64629a377063aec9c3111-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:29:28 [async_llm.py:261] Added request cmpl-f7a5bfa2d6f64629a377063aec9c3111-0.
INFO 03-02 01:29:29 [logger.py:42] Received request cmpl-2cf5669f066c45d08203ddfe5e2bbfba-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:29:29 [async_llm.py:261] Added request cmpl-2cf5669f066c45d08203ddfe5e2bbfba-0.
INFO 03-02 01:29:30 [logger.py:42] Received request cmpl-21040b2a7b23406da060e3c424bce7a3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:29:30 [async_llm.py:261] Added request cmpl-21040b2a7b23406da060e3c424bce7a3-0.
INFO 03-02 01:29:31 [logger.py:42] Received request cmpl-fbc864562cd9484fba180b51cefc4f2e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:29:31 [async_llm.py:261] Added request cmpl-fbc864562cd9484fba180b51cefc4f2e-0.
INFO 03-02 01:29:32 [logger.py:42] Received request cmpl-8af9d96183a64022805fe88302a9204f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:29:32 [async_llm.py:261] Added request cmpl-8af9d96183a64022805fe88302a9204f-0.
INFO 03-02 01:29:34 [logger.py:42] Received request cmpl-19e4ff04673440fabb72e9078be64828-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:29:34 [async_llm.py:261] Added request cmpl-19e4ff04673440fabb72e9078be64828-0.
INFO 03-02 01:29:35 [logger.py:42] Received request cmpl-3589ffe347b34093b36cff75dbe457de-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:29:35 [async_llm.py:261] Added request cmpl-3589ffe347b34093b36cff75dbe457de-0.
INFO 03-02 01:29:36 [logger.py:42] Received request cmpl-0901eecd14b147d3bef36131cea6d818-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:29:36 [async_llm.py:261] Added request cmpl-0901eecd14b147d3bef36131cea6d818-0.
INFO 03-02 01:29:37 [logger.py:42] Received request cmpl-1262af6ae6a340f1a90a70d246a81dc6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:29:37 [async_llm.py:261] Added request cmpl-1262af6ae6a340f1a90a70d246a81dc6-0.
INFO 03-02 01:29:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:29:38 [logger.py:42] Received request cmpl-2a86d016c90a41b7a4c4fea73561c45f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:29:38 [async_llm.py:261] Added request cmpl-2a86d016c90a41b7a4c4fea73561c45f-0.
INFO 03-02 01:29:39 [logger.py:42] Received request cmpl-08287a1ba47043a8900dddbe52b37d10-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:29:39 [async_llm.py:261] Added request cmpl-08287a1ba47043a8900dddbe52b37d10-0.
INFO 03-02 01:29:40 [logger.py:42] Received request cmpl-0cb7c3c695c844998060ecb9a98b8ccb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:29:40 [async_llm.py:261] Added request cmpl-0cb7c3c695c844998060ecb9a98b8ccb-0.
INFO 03-02 01:29:42 [logger.py:42] Received request cmpl-88aa44d201ef4b23be0eed49f9f8ce9b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:29:42 [async_llm.py:261] Added request cmpl-88aa44d201ef4b23be0eed49f9f8ce9b-0.
INFO 03-02 01:29:43 [logger.py:42] Received request cmpl-6fb1b2ae52dc4f37a0a0dee6ebfeb92a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:29:43 [async_llm.py:261] Added request cmpl-6fb1b2ae52dc4f37a0a0dee6ebfeb92a-0.
INFO 03-02 01:29:44 [logger.py:42] Received request cmpl-c1142fc528a14da99ca16d8fc2eef04b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:29:44 [async_llm.py:261] Added request cmpl-c1142fc528a14da99ca16d8fc2eef04b-0.
INFO 03-02 01:29:45 [logger.py:42] Received request cmpl-557ea2fa472843a993367771fbfe0c4f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:29:45 [async_llm.py:261] Added request cmpl-557ea2fa472843a993367771fbfe0c4f-0.
INFO 03-02 01:29:46 [logger.py:42] Received request cmpl-0b0b7dead52c468dac7d2145e945aa24-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:29:46 [async_llm.py:261] Added request cmpl-0b0b7dead52c468dac7d2145e945aa24-0.
INFO 03-02 01:29:47 [logger.py:42] Received request cmpl-441dcc37d91543fda35c673bf43d2229-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:29:47 [async_llm.py:261] Added request cmpl-441dcc37d91543fda35c673bf43d2229-0.
INFO 03-02 01:29:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:29:49 [logger.py:42] Received request cmpl-12344ef33e374a4ba84f5e7531e0204f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:29:49 [async_llm.py:261] Added request cmpl-12344ef33e374a4ba84f5e7531e0204f-0.
INFO 03-02 01:29:50 [logger.py:42] Received request cmpl-f0d7c7a99608433eba4ee790b3a45ba6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:29:50 [async_llm.py:261] Added request cmpl-f0d7c7a99608433eba4ee790b3a45ba6-0.
INFO 03-02 01:29:51 [logger.py:42] Received request cmpl-dea9b113c3764f7284a12ef024145bb6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:29:51 [async_llm.py:261] Added request cmpl-dea9b113c3764f7284a12ef024145bb6-0.
INFO 03-02 01:29:52 [logger.py:42] Received request cmpl-30028343308041b58f62aa030e8c46d2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:29:52 [async_llm.py:261] Added request cmpl-30028343308041b58f62aa030e8c46d2-0.
INFO 03-02 01:29:53 [logger.py:42] Received request cmpl-f4476f8c86414871bcfc6700a6663812-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:29:53 [async_llm.py:261] Added request cmpl-f4476f8c86414871bcfc6700a6663812-0.
INFO 03-02 01:29:54 [logger.py:42] Received request cmpl-2cf3cce518c84d8baa6f30b576572c24-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:29:54 [async_llm.py:261] Added request cmpl-2cf3cce518c84d8baa6f30b576572c24-0.
INFO 03-02 01:29:55 [logger.py:42] Received request cmpl-d5226cfdc0c048cebc7b9c1d55c4ffbb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:29:55 [async_llm.py:261] Added request cmpl-d5226cfdc0c048cebc7b9c1d55c4ffbb-0.
INFO 03-02 01:29:57 [logger.py:42] Received request cmpl-a701dbe0c63a4b67851d5a64a868ef84-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:29:57 [async_llm.py:261] Added request cmpl-a701dbe0c63a4b67851d5a64a868ef84-0.
INFO 03-02 01:29:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:29:58 [logger.py:42] Received request cmpl-5867df933745477e919fad1e5f32c42f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:29:58 [async_llm.py:261] Added request cmpl-5867df933745477e919fad1e5f32c42f-0.
INFO 03-02 01:29:59 [logger.py:42] Received request cmpl-7331098757d942999cd6ca17c137e17a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:29:59 [async_llm.py:261] Added request cmpl-7331098757d942999cd6ca17c137e17a-0.
INFO 03-02 01:30:00 [logger.py:42] Received request cmpl-ebf26c262f594ac190f211f041527593-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:30:00 [async_llm.py:261] Added request cmpl-ebf26c262f594ac190f211f041527593-0.
INFO 03-02 01:30:01 [logger.py:42] Received request cmpl-8c5b09fcc8974bd7bb5d2ee0dbddbdc4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:30:01 [async_llm.py:261] Added request cmpl-8c5b09fcc8974bd7bb5d2ee0dbddbdc4-0.
INFO 03-02 01:30:02 [logger.py:42] Received request cmpl-af16fa802ae147d29b88de6b375c6692-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:30:02 [async_llm.py:261] Added request cmpl-af16fa802ae147d29b88de6b375c6692-0.
INFO 03-02 01:30:03 [logger.py:42] Received request cmpl-92599588fa4340e196d7c7d41c18f6b6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:30:03 [async_llm.py:261] Added request cmpl-92599588fa4340e196d7c7d41c18f6b6-0.
INFO 03-02 01:30:05 [logger.py:42] Received request cmpl-735e1c1dd3aa47978d06334218a8418f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:30:05 [async_llm.py:261] Added request cmpl-735e1c1dd3aa47978d06334218a8418f-0.
INFO 03-02 01:30:06 [logger.py:42] Received request cmpl-59fc6c98f02b42dfb0fbcd6b5b429ebd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:30:06 [async_llm.py:261] Added request cmpl-59fc6c98f02b42dfb0fbcd6b5b429ebd-0.
INFO 03-02 01:30:07 [logger.py:42] Received request cmpl-768c95876eb2472db4cada441e251a92-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:30:07 [async_llm.py:261] Added request cmpl-768c95876eb2472db4cada441e251a92-0.
INFO 03-02 01:30:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:30:08 [logger.py:42] Received request cmpl-4c07ec84d87a48dbaa505ef7192093ec-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:30:08 [async_llm.py:261] Added request cmpl-4c07ec84d87a48dbaa505ef7192093ec-0.
INFO 03-02 01:30:09 [logger.py:42] Received request cmpl-d9bd906bb1414535afb0a5e1870260dc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:30:09 [async_llm.py:261] Added request cmpl-d9bd906bb1414535afb0a5e1870260dc-0.
INFO 03-02 01:30:10 [logger.py:42] Received request cmpl-28936e847f924767aa3f6c40efc69542-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:30:10 [async_llm.py:261] Added request cmpl-28936e847f924767aa3f6c40efc69542-0.
INFO 03-02 01:30:12 [logger.py:42] Received request cmpl-e43fde1fe7ee49de8aac41ec28dd9a45-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:30:12 [async_llm.py:261] Added request cmpl-e43fde1fe7ee49de8aac41ec28dd9a45-0.
INFO 03-02 01:30:13 [logger.py:42] Received request cmpl-7adbec9462b6476db6026d2b8fc0ee5a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:30:13 [async_llm.py:261] Added request cmpl-7adbec9462b6476db6026d2b8fc0ee5a-0.
INFO 03-02 01:30:14 [logger.py:42] Received request cmpl-f067e69420624a399eb3b614d77fe5b5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:30:14 [async_llm.py:261] Added request cmpl-f067e69420624a399eb3b614d77fe5b5-0.
INFO 03-02 01:30:15 [logger.py:42] Received request cmpl-a841864495484016b182e443cd6d5d0f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:30:15 [async_llm.py:261] Added request cmpl-a841864495484016b182e443cd6d5d0f-0.
INFO 03-02 01:30:16 [logger.py:42] Received request cmpl-9d7eac1af745482da35e0d6fc01d2bea-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:30:16 [async_llm.py:261] Added request cmpl-9d7eac1af745482da35e0d6fc01d2bea-0.
INFO 03-02 01:30:17 [logger.py:42] Received request cmpl-a97df2483b7b4a78888b3dc1b28db5ef-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:30:17 [async_llm.py:261] Added request cmpl-a97df2483b7b4a78888b3dc1b28db5ef-0.
INFO 03-02 01:30:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:30:18 [logger.py:42] Received request cmpl-b86c9861af5047c0b17c8ea9e72482e2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:30:18 [async_llm.py:261] Added request cmpl-b86c9861af5047c0b17c8ea9e72482e2-0.
INFO 03-02 01:30:20 [logger.py:42] Received request cmpl-94fa384a9e7b4d2db52f47873c592a47-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:30:20 [async_llm.py:261] Added request cmpl-94fa384a9e7b4d2db52f47873c592a47-0.
INFO 03-02 01:30:21 [logger.py:42] Received request cmpl-d3651ea93ab0426f9d108f440da58658-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:30:21 [async_llm.py:261] Added request cmpl-d3651ea93ab0426f9d108f440da58658-0.
INFO 03-02 01:30:22 [logger.py:42] Received request cmpl-1b1cfe97576a4443a451c46f1a1b162b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:30:22 [async_llm.py:261] Added request cmpl-1b1cfe97576a4443a451c46f1a1b162b-0.
INFO 03-02 01:30:23 [logger.py:42] Received request cmpl-5a5a2078497d416faec00e99568a70a7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:30:23 [async_llm.py:261] Added request cmpl-5a5a2078497d416faec00e99568a70a7-0.
INFO 03-02 01:30:24 [logger.py:42] Received request cmpl-35c9ffb06946420d9d2ae09fdfd9f779-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:30:24 [async_llm.py:261] Added request cmpl-35c9ffb06946420d9d2ae09fdfd9f779-0.
INFO 03-02 01:30:25 [logger.py:42] Received request cmpl-1c923115e6cd42afb1893de59adbd344-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:30:25 [async_llm.py:261] Added request cmpl-1c923115e6cd42afb1893de59adbd344-0.
INFO 03-02 01:30:27 [logger.py:42] Received request cmpl-f4d7918da5764a779f32854e2984fbd3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:30:27 [async_llm.py:261] Added request cmpl-f4d7918da5764a779f32854e2984fbd3-0.
INFO 03-02 01:30:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:30:28 [logger.py:42] Received request cmpl-5717f888947940f8a52f47be9a818a0d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:30:28 [async_llm.py:261] Added request cmpl-5717f888947940f8a52f47be9a818a0d-0.
INFO 03-02 01:30:29 [logger.py:42] Received request cmpl-aa9d19c3eea343ec98e9676cb6f5be9b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:30:29 [async_llm.py:261] Added request cmpl-aa9d19c3eea343ec98e9676cb6f5be9b-0.
INFO 03-02 01:30:30 [logger.py:42] Received request cmpl-3adfae64689940d9bc0dda8da9de4a80-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:30:30 [async_llm.py:261] Added request cmpl-3adfae64689940d9bc0dda8da9de4a80-0.
INFO 03-02 01:30:31 [logger.py:42] Received request cmpl-2ada476afebe4c5cb5154b9e8faeb11b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:30:31 [async_llm.py:261] Added request cmpl-2ada476afebe4c5cb5154b9e8faeb11b-0.
INFO 03-02 01:30:32 [logger.py:42] Received request cmpl-27fe6d528f0f4a74b58e55edc3a32424-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:30:32 [async_llm.py:261] Added request cmpl-27fe6d528f0f4a74b58e55edc3a32424-0.
INFO 03-02 01:30:33 [logger.py:42] Received request cmpl-96f0c110d8f741bca6b11d90d221f707-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:30:33 [async_llm.py:261] Added request cmpl-96f0c110d8f741bca6b11d90d221f707-0.
INFO 03-02 01:30:35 [logger.py:42] Received request cmpl-4b768abd871944248e907b0c9a8cf210-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:30:35 [async_llm.py:261] Added request cmpl-4b768abd871944248e907b0c9a8cf210-0.
INFO 03-02 01:30:36 [logger.py:42] Received request cmpl-f012e8035b534aa0a38a64b52d328ee6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:30:36 [async_llm.py:261] Added request cmpl-f012e8035b534aa0a38a64b52d328ee6-0.
INFO 03-02 01:30:37 [logger.py:42] Received request cmpl-b13d41d87b0f43babd1942f46f5e4d8e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:30:37 [async_llm.py:261] Added request cmpl-b13d41d87b0f43babd1942f46f5e4d8e-0.
INFO 03-02 01:30:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:30:38 [logger.py:42] Received request cmpl-ad64b578d05d45358e374d2baa1201c3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:30:38 [async_llm.py:261] Added request cmpl-ad64b578d05d45358e374d2baa1201c3-0.
INFO 03-02 01:30:39 [logger.py:42] Received request cmpl-cf5280a41f7a4cd9bfa87307d5f845f0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:30:39 [async_llm.py:261] Added request cmpl-cf5280a41f7a4cd9bfa87307d5f845f0-0.
INFO 03-02 01:30:40 [logger.py:42] Received request cmpl-d6caa69285a54d03b135e6e90e91505f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:30:40 [async_llm.py:261] Added request cmpl-d6caa69285a54d03b135e6e90e91505f-0.
INFO 03-02 01:30:42 [logger.py:42] Received request cmpl-e16539c285e345caab2fdf6fcc5c2628-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:30:42 [async_llm.py:261] Added request cmpl-e16539c285e345caab2fdf6fcc5c2628-0.
INFO 03-02 01:30:43 [logger.py:42] Received request cmpl-f56320b6a28a46a486902e1fa2335696-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:30:43 [async_llm.py:261] Added request cmpl-f56320b6a28a46a486902e1fa2335696-0.
INFO 03-02 01:30:44 [logger.py:42] Received request cmpl-9fe6087f7b084be9b740e5464fd27c1c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:30:44 [async_llm.py:261] Added request cmpl-9fe6087f7b084be9b740e5464fd27c1c-0.
INFO 03-02 01:30:45 [logger.py:42] Received request cmpl-f297e2df59f645079f4bce4181f87a1a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:30:45 [async_llm.py:261] Added request cmpl-f297e2df59f645079f4bce4181f87a1a-0.
INFO 03-02 01:30:46 [logger.py:42] Received request cmpl-e05f16c79295429fa35e1a5408c35783-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:30:46 [async_llm.py:261] Added request cmpl-e05f16c79295429fa35e1a5408c35783-0.
INFO 03-02 01:30:47 [logger.py:42] Received request cmpl-f43e2c0051a049bea9a8e818f577b884-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:30:47 [async_llm.py:261] Added request cmpl-f43e2c0051a049bea9a8e818f577b884-0.
INFO 03-02 01:30:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:30:48 [logger.py:42] Received request cmpl-14ed1f2688bf4319ba0e7b9516067b7c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:30:48 [async_llm.py:261] Added request cmpl-14ed1f2688bf4319ba0e7b9516067b7c-0.
INFO 03-02 01:30:50 [logger.py:42] Received request cmpl-4200cd5774e04cb3a0fa7d2891f4998d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:30:50 [async_llm.py:261] Added request cmpl-4200cd5774e04cb3a0fa7d2891f4998d-0.
INFO 03-02 01:30:51 [logger.py:42] Received request cmpl-e1e7cc9811c548179f5799cdd559adc0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:30:51 [async_llm.py:261] Added request cmpl-e1e7cc9811c548179f5799cdd559adc0-0.
INFO 03-02 01:30:52 [logger.py:42] Received request cmpl-a3b5dd6632b5444991234488230c15bb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:30:52 [async_llm.py:261] Added request cmpl-a3b5dd6632b5444991234488230c15bb-0.
INFO 03-02 01:30:53 [logger.py:42] Received request cmpl-9dfc6252181d4ffc87e3fd962255b12d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:30:53 [async_llm.py:261] Added request cmpl-9dfc6252181d4ffc87e3fd962255b12d-0.
INFO 03-02 01:30:54 [logger.py:42] Received request cmpl-6194862464a14e549950d5c22d48915b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:30:54 [async_llm.py:261] Added request cmpl-6194862464a14e549950d5c22d48915b-0.
INFO 03-02 01:30:55 [logger.py:42] Received request cmpl-53d0fb0d24864b4e8073d6ff7f793696-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:30:55 [async_llm.py:261] Added request cmpl-53d0fb0d24864b4e8073d6ff7f793696-0.
INFO 03-02 01:30:57 [logger.py:42] Received request cmpl-d17eae46ad874927b4ea56feb5488c07-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:30:57 [async_llm.py:261] Added request cmpl-d17eae46ad874927b4ea56feb5488c07-0.
INFO 03-02 01:30:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:30:58 [logger.py:42] Received request cmpl-ce63cf4929ca4d088a622a058b6c2e3e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:30:58 [async_llm.py:261] Added request cmpl-ce63cf4929ca4d088a622a058b6c2e3e-0.
INFO 03-02 01:30:59 [logger.py:42] Received request cmpl-d6c50b68c985459aad9d6d42217c13af-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:30:59 [async_llm.py:261] Added request cmpl-d6c50b68c985459aad9d6d42217c13af-0.
INFO 03-02 01:31:00 [logger.py:42] Received request cmpl-991abea2c83540039e0b22fc3a78f747-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:31:00 [async_llm.py:261] Added request cmpl-991abea2c83540039e0b22fc3a78f747-0.
INFO 03-02 01:31:01 [logger.py:42] Received request cmpl-14e5acff0eb84aef8e6b09cc4bc2c152-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:31:01 [async_llm.py:261] Added request cmpl-14e5acff0eb84aef8e6b09cc4bc2c152-0.
INFO 03-02 01:31:02 [logger.py:42] Received request cmpl-f8dfedc1ab9c43d39bd44be226b00ebf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:31:02 [async_llm.py:261] Added request cmpl-f8dfedc1ab9c43d39bd44be226b00ebf-0.
INFO 03-02 01:31:03 [logger.py:42] Received request cmpl-56f113e8738345518ef602003402fd5e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:31:03 [async_llm.py:261] Added request cmpl-56f113e8738345518ef602003402fd5e-0.
INFO 03-02 01:31:05 [logger.py:42] Received request cmpl-7cecf9814d63446eb02eb39dbd80aee3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:31:05 [async_llm.py:261] Added request cmpl-7cecf9814d63446eb02eb39dbd80aee3-0.
INFO 03-02 01:31:06 [logger.py:42] Received request cmpl-6a65e87dc2344c23afc24c9866194fed-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:31:06 [async_llm.py:261] Added request cmpl-6a65e87dc2344c23afc24c9866194fed-0.
INFO 03-02 01:31:07 [logger.py:42] Received request cmpl-7232ebdf660b446b8041a503356270f5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:31:07 [async_llm.py:261] Added request cmpl-7232ebdf660b446b8041a503356270f5-0.
INFO 03-02 01:31:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:31:08 [logger.py:42] Received request cmpl-eed66e05c2284854b06fd9397c482c6b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:31:08 [async_llm.py:261] Added request cmpl-eed66e05c2284854b06fd9397c482c6b-0.
INFO 03-02 01:31:09 [logger.py:42] Received request cmpl-2e5598249117415bb5be26bdf8ee590b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:31:09 [async_llm.py:261] Added request cmpl-2e5598249117415bb5be26bdf8ee590b-0.
INFO 03-02 01:31:10 [logger.py:42] Received request cmpl-b29c8d1ffa2a42dfaaa4fd47b2ba5d07-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:31:10 [async_llm.py:261] Added request cmpl-b29c8d1ffa2a42dfaaa4fd47b2ba5d07-0.
INFO 03-02 01:31:12 [logger.py:42] Received request cmpl-fc7a3493dfe94180816d41b67674660d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:31:12 [async_llm.py:261] Added request cmpl-fc7a3493dfe94180816d41b67674660d-0.
INFO 03-02 01:31:13 [logger.py:42] Received request cmpl-47b4d52fdb3e4e098a45239e7a305093-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:31:13 [async_llm.py:261] Added request cmpl-47b4d52fdb3e4e098a45239e7a305093-0.
INFO 03-02 01:31:14 [logger.py:42] Received request cmpl-19fe7fe006b3440e9e26a05f547e4c1f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:31:14 [async_llm.py:261] Added request cmpl-19fe7fe006b3440e9e26a05f547e4c1f-0.
INFO 03-02 01:31:15 [logger.py:42] Received request cmpl-af7eed1d88b84a52b6e97a3c4c4c592a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:31:15 [async_llm.py:261] Added request cmpl-af7eed1d88b84a52b6e97a3c4c4c592a-0.
INFO 03-02 01:31:16 [logger.py:42] Received request cmpl-ad46ae8ccf724ba9b74b7a0cec309a57-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:31:16 [async_llm.py:261] Added request cmpl-ad46ae8ccf724ba9b74b7a0cec309a57-0.
INFO 03-02 01:31:17 [logger.py:42] Received request cmpl-f3a7f50df349407aaa5a926bf7e29d99-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:31:17 [async_llm.py:261] Added request cmpl-f3a7f50df349407aaa5a926bf7e29d99-0.
INFO 03-02 01:31:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:31:18 [logger.py:42] Received request cmpl-381462d43f2c4c6dbe4a0987b2aa45df-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:31:18 [async_llm.py:261] Added request cmpl-381462d43f2c4c6dbe4a0987b2aa45df-0.
INFO 03-02 01:31:20 [logger.py:42] Received request cmpl-253e391d67ae403692708bdf326e81a3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:31:20 [async_llm.py:261] Added request cmpl-253e391d67ae403692708bdf326e81a3-0.
INFO 03-02 01:31:21 [logger.py:42] Received request cmpl-3e1394987fc647009b921fedc7f7db8c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:31:21 [async_llm.py:261] Added request cmpl-3e1394987fc647009b921fedc7f7db8c-0.
INFO 03-02 01:31:22 [logger.py:42] Received request cmpl-739b58549e9c4c779f1ec0ecbdca8af6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:31:22 [async_llm.py:261] Added request cmpl-739b58549e9c4c779f1ec0ecbdca8af6-0.
INFO 03-02 01:31:23 [logger.py:42] Received request cmpl-03675d81f9354e96b6bd587ea4b0c0fc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:31:23 [async_llm.py:261] Added request cmpl-03675d81f9354e96b6bd587ea4b0c0fc-0.
INFO 03-02 01:31:24 [logger.py:42] Received request cmpl-9f356b608c614e4aaeda2ab605b1b012-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:31:24 [async_llm.py:261] Added request cmpl-9f356b608c614e4aaeda2ab605b1b012-0.
INFO 03-02 01:31:25 [logger.py:42] Received request cmpl-68eaf5e0cce34b45be18c3c852b29454-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:31:25 [async_llm.py:261] Added request cmpl-68eaf5e0cce34b45be18c3c852b29454-0.
INFO 03-02 01:31:27 [logger.py:42] Received request cmpl-502d020ae3154d25a01d30ed5adf7372-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:31:27 [async_llm.py:261] Added request cmpl-502d020ae3154d25a01d30ed5adf7372-0.
INFO 03-02 01:31:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:31:28 [logger.py:42] Received request cmpl-2a2d6f5cb2fa44aba8d8f16b80fd74f2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:31:28 [async_llm.py:261] Added request cmpl-2a2d6f5cb2fa44aba8d8f16b80fd74f2-0.
INFO 03-02 01:31:29 [logger.py:42] Received request cmpl-dc79e4ca60094b5f88d14fbdc5b6d5a5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:31:29 [async_llm.py:261] Added request cmpl-dc79e4ca60094b5f88d14fbdc5b6d5a5-0.
INFO 03-02 01:31:30 [logger.py:42] Received request cmpl-460bd0eefbce44beb8dacf2cca0acb2d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:31:30 [async_llm.py:261] Added request cmpl-460bd0eefbce44beb8dacf2cca0acb2d-0.
INFO 03-02 01:31:31 [logger.py:42] Received request cmpl-52f0dad7adcd4d01b3121c48039e9675-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:31:31 [async_llm.py:261] Added request cmpl-52f0dad7adcd4d01b3121c48039e9675-0.
INFO 03-02 01:31:32 [logger.py:42] Received request cmpl-f7096a7032924959b0aa4fc61be53d77-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:31:32 [async_llm.py:261] Added request cmpl-f7096a7032924959b0aa4fc61be53d77-0.
INFO 03-02 01:31:34 [logger.py:42] Received request cmpl-fc98a06f1c9c4ff4b758837e8b385230-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:31:34 [async_llm.py:261] Added request cmpl-fc98a06f1c9c4ff4b758837e8b385230-0.
INFO 03-02 01:31:35 [logger.py:42] Received request cmpl-bd23ae1124294db4925360b2a3efb60a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:31:35 [async_llm.py:261] Added request cmpl-bd23ae1124294db4925360b2a3efb60a-0.
INFO 03-02 01:31:36 [logger.py:42] Received request cmpl-d72ec6c25d934182bbb44bdebbdcb678-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:31:36 [async_llm.py:261] Added request cmpl-d72ec6c25d934182bbb44bdebbdcb678-0.
INFO 03-02 01:31:37 [logger.py:42] Received request cmpl-a476f7bbd6b144e3ba5253e24efedbfe-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:31:37 [async_llm.py:261] Added request cmpl-a476f7bbd6b144e3ba5253e24efedbfe-0.
INFO 03-02 01:31:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:31:38 [logger.py:42] Received request cmpl-63d59f2fe09544ffb01e432d8b6c985b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:31:38 [async_llm.py:261] Added request cmpl-63d59f2fe09544ffb01e432d8b6c985b-0.
INFO 03-02 01:31:39 [logger.py:42] Received request cmpl-0bc52d908d2f42eb926f12309e32d576-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:31:39 [async_llm.py:261] Added request cmpl-0bc52d908d2f42eb926f12309e32d576-0.
INFO 03-02 01:31:40 [logger.py:42] Received request cmpl-d00a0495532a48bfa4376500e4c85880-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:31:40 [async_llm.py:261] Added request cmpl-d00a0495532a48bfa4376500e4c85880-0.
INFO 03-02 01:31:42 [logger.py:42] Received request cmpl-7cf05a4ff6a7415b868de5f1d6caed63-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:31:42 [async_llm.py:261] Added request cmpl-7cf05a4ff6a7415b868de5f1d6caed63-0.
INFO 03-02 01:31:43 [logger.py:42] Received request cmpl-51b8e91ef04c420b9030eb517bc095bb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:31:43 [async_llm.py:261] Added request cmpl-51b8e91ef04c420b9030eb517bc095bb-0.
INFO 03-02 01:31:44 [logger.py:42] Received request cmpl-0c7f05fb8de84a6fb9dea7cbc628fcd9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:31:44 [async_llm.py:261] Added request cmpl-0c7f05fb8de84a6fb9dea7cbc628fcd9-0.
INFO 03-02 01:31:45 [logger.py:42] Received request cmpl-54fabd6b2c064f9c992e1da30ebbf885-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:31:45 [async_llm.py:261] Added request cmpl-54fabd6b2c064f9c992e1da30ebbf885-0.
INFO 03-02 01:31:46 [logger.py:42] Received request cmpl-95ffad19f7b84dd4a8938e35e51c6f95-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:31:46 [async_llm.py:261] Added request cmpl-95ffad19f7b84dd4a8938e35e51c6f95-0.
INFO 03-02 01:31:47 [logger.py:42] Received request cmpl-2afdfd8f9236471ca8deb38ddaf9206d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:31:47 [async_llm.py:261] Added request cmpl-2afdfd8f9236471ca8deb38ddaf9206d-0.
INFO 03-02 01:31:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:31:49 [logger.py:42] Received request cmpl-951a621a1da94ca0bd4862b16f7c5e61-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:31:49 [async_llm.py:261] Added request cmpl-951a621a1da94ca0bd4862b16f7c5e61-0.
INFO 03-02 01:31:50 [logger.py:42] Received request cmpl-3dbe738835634fc98b9403a3d8cb2cf6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:31:50 [async_llm.py:261] Added request cmpl-3dbe738835634fc98b9403a3d8cb2cf6-0.
INFO 03-02 01:31:51 [logger.py:42] Received request cmpl-82339526681d4130945ee16e8c9ed4e6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:31:51 [async_llm.py:261] Added request cmpl-82339526681d4130945ee16e8c9ed4e6-0.
INFO 03-02 01:31:52 [logger.py:42] Received request cmpl-b47caac95834441284724db955eb5d59-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:31:52 [async_llm.py:261] Added request cmpl-b47caac95834441284724db955eb5d59-0.
INFO 03-02 01:31:53 [logger.py:42] Received request cmpl-56f2960f7edc4475b0f2104d34da2168-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:31:53 [async_llm.py:261] Added request cmpl-56f2960f7edc4475b0f2104d34da2168-0.
INFO 03-02 01:31:54 [logger.py:42] Received request cmpl-942b1219b80145f48578049f524e5175-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:31:54 [async_llm.py:261] Added request cmpl-942b1219b80145f48578049f524e5175-0.
INFO 03-02 01:31:55 [logger.py:42] Received request cmpl-efa79aefae05432b99740b0496315d6a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:31:55 [async_llm.py:261] Added request cmpl-efa79aefae05432b99740b0496315d6a-0.
INFO 03-02 01:31:57 [logger.py:42] Received request cmpl-66a8ca3c8782426ca404660f68b1cc73-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:31:57 [async_llm.py:261] Added request cmpl-66a8ca3c8782426ca404660f68b1cc73-0.
INFO 03-02 01:31:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:31:58 [logger.py:42] Received request cmpl-84f7b9f8723348eb8ae18602d151def8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:31:58 [async_llm.py:261] Added request cmpl-84f7b9f8723348eb8ae18602d151def8-0.
INFO 03-02 01:31:59 [logger.py:42] Received request cmpl-ac2796ed293a46f693e2292e76359973-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:31:59 [async_llm.py:261] Added request cmpl-ac2796ed293a46f693e2292e76359973-0.
INFO 03-02 01:32:00 [logger.py:42] Received request cmpl-d49436c411ee4dcfab2b65aa4b80143d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:32:00 [async_llm.py:261] Added request cmpl-d49436c411ee4dcfab2b65aa4b80143d-0.
INFO 03-02 01:32:01 [logger.py:42] Received request cmpl-dca4bf07086d48a7a0136d877041161e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:32:01 [async_llm.py:261] Added request cmpl-dca4bf07086d48a7a0136d877041161e-0.
INFO 03-02 01:32:02 [logger.py:42] Received request cmpl-cf232edb06f34202a48ce0754b6e58ab-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:32:02 [async_llm.py:261] Added request cmpl-cf232edb06f34202a48ce0754b6e58ab-0.
INFO 03-02 01:32:04 [logger.py:42] Received request cmpl-e670ef226b424f2fac8f9dc231c8a97c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:32:04 [async_llm.py:261] Added request cmpl-e670ef226b424f2fac8f9dc231c8a97c-0.
INFO 03-02 01:32:05 [logger.py:42] Received request cmpl-45cdb11f00164c0ca0c7861ace6c606c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:32:05 [async_llm.py:261] Added request cmpl-45cdb11f00164c0ca0c7861ace6c606c-0.
INFO 03-02 01:32:06 [logger.py:42] Received request cmpl-2a5227ee35324b10a149c2269bafb9db-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:32:06 [async_llm.py:261] Added request cmpl-2a5227ee35324b10a149c2269bafb9db-0.
INFO 03-02 01:32:07 [logger.py:42] Received request cmpl-5fb0da88589c4592a19f438bc2cb7f00-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:32:07 [async_llm.py:261] Added request cmpl-5fb0da88589c4592a19f438bc2cb7f00-0.
INFO 03-02 01:32:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:32:08 [logger.py:42] Received request cmpl-47429732302648aa94737b2986f79482-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:32:08 [async_llm.py:261] Added request cmpl-47429732302648aa94737b2986f79482-0.
INFO 03-02 01:32:09 [logger.py:42] Received request cmpl-c9c2bc6982ee44bbacce1339fc34ab28-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:32:09 [async_llm.py:261] Added request cmpl-c9c2bc6982ee44bbacce1339fc34ab28-0.
INFO 03-02 01:32:10 [logger.py:42] Received request cmpl-8aab59648fe6418ea283f462013de81b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:32:10 [async_llm.py:261] Added request cmpl-8aab59648fe6418ea283f462013de81b-0.
INFO 03-02 01:32:12 [logger.py:42] Received request cmpl-4fa5c50a423a4043a8dde0bcbf1dda01-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:32:12 [async_llm.py:261] Added request cmpl-4fa5c50a423a4043a8dde0bcbf1dda01-0.
INFO 03-02 01:32:13 [logger.py:42] Received request cmpl-8e5f4b00111c4e83b3f8b089f16ce543-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:32:13 [async_llm.py:261] Added request cmpl-8e5f4b00111c4e83b3f8b089f16ce543-0.
INFO 03-02 01:32:14 [logger.py:42] Received request cmpl-4f6fb27d67c94987a6524d3bf469dc58-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:32:14 [async_llm.py:261] Added request cmpl-4f6fb27d67c94987a6524d3bf469dc58-0.
INFO 03-02 01:32:15 [logger.py:42] Received request cmpl-dffea7d80a6348979f9e705199d06ab2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:32:15 [async_llm.py:261] Added request cmpl-dffea7d80a6348979f9e705199d06ab2-0.
INFO 03-02 01:32:16 [logger.py:42] Received request cmpl-a5e154f7ef04439cbaff6e3226f7a0d1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:32:16 [async_llm.py:261] Added request cmpl-a5e154f7ef04439cbaff6e3226f7a0d1-0.
INFO 03-02 01:32:17 [logger.py:42] Received request cmpl-976cdde820e540b79678c0e9bf0e0c80-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:32:17 [async_llm.py:261] Added request cmpl-976cdde820e540b79678c0e9bf0e0c80-0.
INFO 03-02 01:32:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:32:19 [logger.py:42] Received request cmpl-43448bb9a29a45c79b5aeb79176c5e29-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:32:19 [async_llm.py:261] Added request cmpl-43448bb9a29a45c79b5aeb79176c5e29-0.
INFO 03-02 01:32:20 [logger.py:42] Received request cmpl-605148114cdd4621adf036ca8a0e4ea1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:32:20 [async_llm.py:261] Added request cmpl-605148114cdd4621adf036ca8a0e4ea1-0.
INFO 03-02 01:32:21 [logger.py:42] Received request cmpl-0ddee560d1d0413ba8ae108e0bf70a10-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:32:21 [async_llm.py:261] Added request cmpl-0ddee560d1d0413ba8ae108e0bf70a10-0.
INFO 03-02 01:32:22 [logger.py:42] Received request cmpl-c785a93e21c54684b18c99ee3364dd8b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:32:22 [async_llm.py:261] Added request cmpl-c785a93e21c54684b18c99ee3364dd8b-0.
INFO 03-02 01:32:23 [logger.py:42] Received request cmpl-93cf340cea8644979377c8fed61f49fb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:32:23 [async_llm.py:261] Added request cmpl-93cf340cea8644979377c8fed61f49fb-0.
INFO 03-02 01:32:24 [logger.py:42] Received request cmpl-446960caf0314583bf882ea9eaf70e2e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:32:24 [async_llm.py:261] Added request cmpl-446960caf0314583bf882ea9eaf70e2e-0.
INFO 03-02 01:32:25 [logger.py:42] Received request cmpl-82297e2088f24fd6800df71b53e034c0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:32:25 [async_llm.py:261] Added request cmpl-82297e2088f24fd6800df71b53e034c0-0.
INFO 03-02 01:32:27 [logger.py:42] Received request cmpl-560abf6491a1434e9521bd15b464b4da-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:32:27 [async_llm.py:261] Added request cmpl-560abf6491a1434e9521bd15b464b4da-0.
INFO 03-02 01:32:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:32:28 [logger.py:42] Received request cmpl-16ec9d80dcc4487a944acebf75237f53-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:32:28 [async_llm.py:261] Added request cmpl-16ec9d80dcc4487a944acebf75237f53-0.
INFO 03-02 01:32:29 [logger.py:42] Received request cmpl-0cdf4d07b87a46b6b215db156f5e6488-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:32:29 [async_llm.py:261] Added request cmpl-0cdf4d07b87a46b6b215db156f5e6488-0.
INFO 03-02 01:32:30 [logger.py:42] Received request cmpl-b762ccd679434c6191615ec268170a4e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:32:30 [async_llm.py:261] Added request cmpl-b762ccd679434c6191615ec268170a4e-0.
INFO 03-02 01:32:31 [logger.py:42] Received request cmpl-aff9a1c66f7f44979cbcd2cf75ed5421-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:32:31 [async_llm.py:261] Added request cmpl-aff9a1c66f7f44979cbcd2cf75ed5421-0.
INFO 03-02 01:32:32 [logger.py:42] Received request cmpl-e68ba382a97747bcb49efac399e65f54-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:32:32 [async_llm.py:261] Added request cmpl-e68ba382a97747bcb49efac399e65f54-0.
INFO 03-02 01:32:34 [logger.py:42] Received request cmpl-b5b44c1f1187451fa79aa1b3e154be0b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:32:34 [async_llm.py:261] Added request cmpl-b5b44c1f1187451fa79aa1b3e154be0b-0.
INFO 03-02 01:32:35 [logger.py:42] Received request cmpl-d83206baec364ff3a5d2d1b7b6a32875-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:32:35 [async_llm.py:261] Added request cmpl-d83206baec364ff3a5d2d1b7b6a32875-0.
INFO 03-02 01:32:36 [logger.py:42] Received request cmpl-1908bfb45b6b481ba15bb5805051da4d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:32:36 [async_llm.py:261] Added request cmpl-1908bfb45b6b481ba15bb5805051da4d-0.
INFO 03-02 01:32:37 [logger.py:42] Received request cmpl-a44794ee24004827b739d6a20dcfb001-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:32:37 [async_llm.py:261] Added request cmpl-a44794ee24004827b739d6a20dcfb001-0.
INFO 03-02 01:32:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:32:38 [logger.py:42] Received request cmpl-1405bb25eb494e73a585390923fa3abf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:32:38 [async_llm.py:261] Added request cmpl-1405bb25eb494e73a585390923fa3abf-0.
INFO 03-02 01:32:39 [logger.py:42] Received request cmpl-551571d512e04626b5767a555ad143f5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:32:39 [async_llm.py:261] Added request cmpl-551571d512e04626b5767a555ad143f5-0.
INFO 03-02 01:32:41 [logger.py:42] Received request cmpl-0052bc055b4e46f0863ee53d01eef28f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:32:41 [async_llm.py:261] Added request cmpl-0052bc055b4e46f0863ee53d01eef28f-0.
INFO 03-02 01:32:42 [logger.py:42] Received request cmpl-2dfb7c6f16dc491293c16bf36afea8b4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:32:42 [async_llm.py:261] Added request cmpl-2dfb7c6f16dc491293c16bf36afea8b4-0.
INFO 03-02 01:32:43 [logger.py:42] Received request cmpl-fd25993a5f3d468bb8488cd60b32efa2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:32:43 [async_llm.py:261] Added request cmpl-fd25993a5f3d468bb8488cd60b32efa2-0.
INFO 03-02 01:32:44 [logger.py:42] Received request cmpl-b0487c8cd8c548989d4f16eca14a3d35-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:32:44 [async_llm.py:261] Added request cmpl-b0487c8cd8c548989d4f16eca14a3d35-0.
INFO 03-02 01:32:45 [logger.py:42] Received request cmpl-5ed91c25efe941e796e8ca26956516b8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:32:45 [async_llm.py:261] Added request cmpl-5ed91c25efe941e796e8ca26956516b8-0.
INFO 03-02 01:32:46 [logger.py:42] Received request cmpl-31ff506edcf34a4e9994ebe12cdc1871-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:32:46 [async_llm.py:261] Added request cmpl-31ff506edcf34a4e9994ebe12cdc1871-0.
INFO 03-02 01:32:47 [logger.py:42] Received request cmpl-7a3588550a5f40538b8e39ecebca6262-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:32:47 [async_llm.py:261] Added request cmpl-7a3588550a5f40538b8e39ecebca6262-0.
INFO 03-02 01:32:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:32:49 [logger.py:42] Received request cmpl-2e92b18ab0684bde8fcd28eaf46db5f1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:32:49 [async_llm.py:261] Added request cmpl-2e92b18ab0684bde8fcd28eaf46db5f1-0.
INFO 03-02 01:32:50 [logger.py:42] Received request cmpl-b994399a917f4c23a335eba5f1f1a24b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:32:50 [async_llm.py:261] Added request cmpl-b994399a917f4c23a335eba5f1f1a24b-0.
INFO 03-02 01:32:51 [logger.py:42] Received request cmpl-b7baaacde8d642608bb4593b61551f16-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:32:51 [async_llm.py:261] Added request cmpl-b7baaacde8d642608bb4593b61551f16-0.
INFO 03-02 01:32:52 [logger.py:42] Received request cmpl-6b3933f00de64c24bd2ca472c84ffe6c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:32:52 [async_llm.py:261] Added request cmpl-6b3933f00de64c24bd2ca472c84ffe6c-0.
INFO 03-02 01:32:53 [logger.py:42] Received request cmpl-f56aa08d20da47e89a8de9a3677fef8b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:32:53 [async_llm.py:261] Added request cmpl-f56aa08d20da47e89a8de9a3677fef8b-0.
INFO 03-02 01:32:54 [logger.py:42] Received request cmpl-f474c0d890cb4e6192c1fdb41e9c2ce1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:32:54 [async_llm.py:261] Added request cmpl-f474c0d890cb4e6192c1fdb41e9c2ce1-0.
INFO 03-02 01:32:56 [logger.py:42] Received request cmpl-ffabaab01c304827b664a3995138ae7f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:32:56 [async_llm.py:261] Added request cmpl-ffabaab01c304827b664a3995138ae7f-0.
INFO 03-02 01:32:57 [logger.py:42] Received request cmpl-39621a5f0fb5439d9ac09f48f5772ebd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:32:57 [async_llm.py:261] Added request cmpl-39621a5f0fb5439d9ac09f48f5772ebd-0.
INFO 03-02 01:32:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:32:58 [logger.py:42] Received request cmpl-602dcd6cd19244e1b3ea1bd0a4960c3b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:32:58 [async_llm.py:261] Added request cmpl-602dcd6cd19244e1b3ea1bd0a4960c3b-0.
INFO 03-02 01:32:59 [logger.py:42] Received request cmpl-014fb4b5c39341f08a6d2526b29f3215-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:32:59 [async_llm.py:261] Added request cmpl-014fb4b5c39341f08a6d2526b29f3215-0.
INFO 03-02 01:33:00 [logger.py:42] Received request cmpl-dee9aca23ef34dd2b96952987904e6a3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:33:00 [async_llm.py:261] Added request cmpl-dee9aca23ef34dd2b96952987904e6a3-0.
INFO 03-02 01:33:01 [logger.py:42] Received request cmpl-33c5b41246454dd285c0f2337727ab41-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:33:01 [async_llm.py:261] Added request cmpl-33c5b41246454dd285c0f2337727ab41-0.
INFO 03-02 01:33:02 [logger.py:42] Received request cmpl-3077e9b3ed2a4c818ab9a3734205cc29-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:33:02 [async_llm.py:261] Added request cmpl-3077e9b3ed2a4c818ab9a3734205cc29-0.
INFO 03-02 01:33:04 [logger.py:42] Received request cmpl-a657875d73b0486bb2e46f1aba84bcad-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:33:04 [async_llm.py:261] Added request cmpl-a657875d73b0486bb2e46f1aba84bcad-0.
INFO 03-02 01:33:05 [logger.py:42] Received request cmpl-2bc662622240419e965061fbf523b5d1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:33:05 [async_llm.py:261] Added request cmpl-2bc662622240419e965061fbf523b5d1-0.
INFO 03-02 01:33:06 [logger.py:42] Received request cmpl-cad421a47db6455dbb5c9c40ab573180-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:33:06 [async_llm.py:261] Added request cmpl-cad421a47db6455dbb5c9c40ab573180-0.
INFO 03-02 01:33:07 [logger.py:42] Received request cmpl-0d1a887e76664f71b034df620b5dafe6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:33:07 [async_llm.py:261] Added request cmpl-0d1a887e76664f71b034df620b5dafe6-0.
INFO 03-02 01:33:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:33:08 [logger.py:42] Received request cmpl-21e3ac46ba9340cb953688436e5c6569-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:33:08 [async_llm.py:261] Added request cmpl-21e3ac46ba9340cb953688436e5c6569-0.
INFO 03-02 01:33:09 [logger.py:42] Received request cmpl-97d4cb471a484a7facec4bd80adee727-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:33:09 [async_llm.py:261] Added request cmpl-97d4cb471a484a7facec4bd80adee727-0.
INFO 03-02 01:33:10 [logger.py:42] Received request cmpl-321aa1b9f3874d298aa9a74eeb3862dd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:33:10 [async_llm.py:261] Added request cmpl-321aa1b9f3874d298aa9a74eeb3862dd-0.
INFO 03-02 01:33:12 [logger.py:42] Received request cmpl-fd0a4831c7f14f0bb5f8314e46ea32b6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:33:12 [async_llm.py:261] Added request cmpl-fd0a4831c7f14f0bb5f8314e46ea32b6-0.
INFO 03-02 01:33:13 [logger.py:42] Received request cmpl-435a02b8842c4ab9b296555a68473a39-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:33:13 [async_llm.py:261] Added request cmpl-435a02b8842c4ab9b296555a68473a39-0.
INFO 03-02 01:33:14 [logger.py:42] Received request cmpl-12dc3549781b4a94a8bb7646313f435b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:33:14 [async_llm.py:261] Added request cmpl-12dc3549781b4a94a8bb7646313f435b-0.
INFO 03-02 01:33:15 [logger.py:42] Received request cmpl-603f1121fb104d41817966b7844a80a9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:33:15 [async_llm.py:261] Added request cmpl-603f1121fb104d41817966b7844a80a9-0.
INFO 03-02 01:33:16 [logger.py:42] Received request cmpl-5a4d5e16e18242d09777589eab2cfe7f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:33:16 [async_llm.py:261] Added request cmpl-5a4d5e16e18242d09777589eab2cfe7f-0.
INFO 03-02 01:33:17 [logger.py:42] Received request cmpl-42f56fc9cdba425bb7809a73918e7931-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:33:17 [async_llm.py:261] Added request cmpl-42f56fc9cdba425bb7809a73918e7931-0.
INFO 03-02 01:33:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:33:19 [logger.py:42] Received request cmpl-9b86982542e2427d928778bbe7d819d9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:33:19 [async_llm.py:261] Added request cmpl-9b86982542e2427d928778bbe7d819d9-0.
INFO 03-02 01:33:20 [logger.py:42] Received request cmpl-657534871970448b8d13916edd99dd71-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:33:20 [async_llm.py:261] Added request cmpl-657534871970448b8d13916edd99dd71-0.
INFO 03-02 01:33:21 [logger.py:42] Received request cmpl-a496b8abb12947928545d10c19df0bca-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:33:21 [async_llm.py:261] Added request cmpl-a496b8abb12947928545d10c19df0bca-0.
INFO 03-02 01:33:22 [logger.py:42] Received request cmpl-7680f3f58af64543babf9ba682ab61b5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:33:22 [async_llm.py:261] Added request cmpl-7680f3f58af64543babf9ba682ab61b5-0.
INFO 03-02 01:33:23 [logger.py:42] Received request cmpl-472683cb6333435f962c0d7383613e2b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:33:23 [async_llm.py:261] Added request cmpl-472683cb6333435f962c0d7383613e2b-0.
INFO 03-02 01:33:24 [logger.py:42] Received request cmpl-cbfe456baef54228b259193e1fe49f23-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:33:24 [async_llm.py:261] Added request cmpl-cbfe456baef54228b259193e1fe49f23-0.
INFO 03-02 01:33:26 [logger.py:42] Received request cmpl-48f523bf73ea426291364de27b8dc8a9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:33:26 [async_llm.py:261] Added request cmpl-48f523bf73ea426291364de27b8dc8a9-0.
INFO 03-02 01:33:27 [logger.py:42] Received request cmpl-30df5ae49350498285a1a2da3cf9eff1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:33:27 [async_llm.py:261] Added request cmpl-30df5ae49350498285a1a2da3cf9eff1-0.
INFO 03-02 01:33:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:33:28 [logger.py:42] Received request cmpl-91b781e8267b4fc79d4c0e0339b71d9e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:33:28 [async_llm.py:261] Added request cmpl-91b781e8267b4fc79d4c0e0339b71d9e-0.
INFO 03-02 01:33:29 [logger.py:42] Received request cmpl-85ab3917583f4e8d8a0fb433110e5110-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:33:29 [async_llm.py:261] Added request cmpl-85ab3917583f4e8d8a0fb433110e5110-0.
INFO 03-02 01:33:30 [logger.py:42] Received request cmpl-1552322e56e34355b311f22c4b70770c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:33:30 [async_llm.py:261] Added request cmpl-1552322e56e34355b311f22c4b70770c-0.
INFO 03-02 01:33:31 [logger.py:42] Received request cmpl-29fd565911f24b56bd00b5eab57d026a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:33:31 [async_llm.py:261] Added request cmpl-29fd565911f24b56bd00b5eab57d026a-0.
INFO 03-02 01:33:32 [logger.py:42] Received request cmpl-d2d50ef0f7c14fc6af2e47335dd548d4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:33:32 [async_llm.py:261] Added request cmpl-d2d50ef0f7c14fc6af2e47335dd548d4-0.
INFO 03-02 01:33:34 [logger.py:42] Received request cmpl-39e0a26bc9644fba9a7342fa12c2bda1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:33:34 [async_llm.py:261] Added request cmpl-39e0a26bc9644fba9a7342fa12c2bda1-0.
INFO 03-02 01:33:35 [logger.py:42] Received request cmpl-f2ea760971d740b194956f0909520fee-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:33:35 [async_llm.py:261] Added request cmpl-f2ea760971d740b194956f0909520fee-0.
INFO 03-02 01:33:36 [logger.py:42] Received request cmpl-07aedeee7dd2421c9f3f13ba2761b03b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:33:36 [async_llm.py:261] Added request cmpl-07aedeee7dd2421c9f3f13ba2761b03b-0.
INFO 03-02 01:33:37 [logger.py:42] Received request cmpl-d7d970a6f66c43759928488acd50af56-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:33:37 [async_llm.py:261] Added request cmpl-d7d970a6f66c43759928488acd50af56-0.
INFO 03-02 01:33:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:33:38 [logger.py:42] Received request cmpl-884019dd79f041868924ee94065ccaaf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:33:38 [async_llm.py:261] Added request cmpl-884019dd79f041868924ee94065ccaaf-0.
INFO 03-02 01:33:39 [logger.py:42] Received request cmpl-b6fe7e23a0494c7bab847f40eb429e2b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:33:39 [async_llm.py:261] Added request cmpl-b6fe7e23a0494c7bab847f40eb429e2b-0.
INFO 03-02 01:33:40 [logger.py:42] Received request cmpl-a96541a958524eccac73fb1ee41002cb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:33:40 [async_llm.py:261] Added request cmpl-a96541a958524eccac73fb1ee41002cb-0.
INFO 03-02 01:33:42 [logger.py:42] Received request cmpl-b49c8a68019f4ca7845084fa246ae402-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:33:42 [async_llm.py:261] Added request cmpl-b49c8a68019f4ca7845084fa246ae402-0.
INFO 03-02 01:33:43 [logger.py:42] Received request cmpl-895fed5593f544638cc9f9363c621f5a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:33:43 [async_llm.py:261] Added request cmpl-895fed5593f544638cc9f9363c621f5a-0.
INFO 03-02 01:33:44 [logger.py:42] Received request cmpl-a31408deab574cba9b8cccbedd5b15de-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:33:44 [async_llm.py:261] Added request cmpl-a31408deab574cba9b8cccbedd5b15de-0.
INFO 03-02 01:33:45 [logger.py:42] Received request cmpl-acacc6ecb815426db1ee1f5a5327bf6a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:33:45 [async_llm.py:261] Added request cmpl-acacc6ecb815426db1ee1f5a5327bf6a-0.
INFO 03-02 01:33:46 [logger.py:42] Received request cmpl-822a1c5e7de349cf8b1ca02d2e336b3f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:33:46 [async_llm.py:261] Added request cmpl-822a1c5e7de349cf8b1ca02d2e336b3f-0.
INFO 03-02 01:33:47 [logger.py:42] Received request cmpl-7dbd179c624b442e9f980981ba0c734e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:33:47 [async_llm.py:261] Added request cmpl-7dbd179c624b442e9f980981ba0c734e-0.
INFO 03-02 01:33:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:33:49 [logger.py:42] Received request cmpl-25628c769b154937aa4d0ed70a1ff589-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:33:49 [async_llm.py:261] Added request cmpl-25628c769b154937aa4d0ed70a1ff589-0.
INFO 03-02 01:33:50 [logger.py:42] Received request cmpl-2fb34f46f4ea40acb8b20a42ff58446b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:33:50 [async_llm.py:261] Added request cmpl-2fb34f46f4ea40acb8b20a42ff58446b-0.
INFO 03-02 01:33:51 [logger.py:42] Received request cmpl-a6316be068eb4c99bdb705e670c1e89f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:33:51 [async_llm.py:261] Added request cmpl-a6316be068eb4c99bdb705e670c1e89f-0.
INFO 03-02 01:33:52 [logger.py:42] Received request cmpl-b252ac236a234c62854b0c557c14805b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:33:52 [async_llm.py:261] Added request cmpl-b252ac236a234c62854b0c557c14805b-0.
INFO 03-02 01:33:53 [logger.py:42] Received request cmpl-0ae7259efccd4761b8183ba97769c5fc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:33:53 [async_llm.py:261] Added request cmpl-0ae7259efccd4761b8183ba97769c5fc-0.
INFO 03-02 01:33:54 [logger.py:42] Received request cmpl-d92e799055fa442d897e54a47721c00b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:33:54 [async_llm.py:261] Added request cmpl-d92e799055fa442d897e54a47721c00b-0.
INFO 03-02 01:33:55 [logger.py:42] Received request cmpl-4d5e6c6580254961a6ff9c8e9b81a3a5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:33:55 [async_llm.py:261] Added request cmpl-4d5e6c6580254961a6ff9c8e9b81a3a5-0.
INFO 03-02 01:33:57 [logger.py:42] Received request cmpl-0ade8dea1f5e4f1fbc5cc23af4e87e9c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:33:57 [async_llm.py:261] Added request cmpl-0ade8dea1f5e4f1fbc5cc23af4e87e9c-0.
INFO 03-02 01:33:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:33:58 [logger.py:42] Received request cmpl-733ea060399945b1934b5a4db3a52936-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:33:58 [async_llm.py:261] Added request cmpl-733ea060399945b1934b5a4db3a52936-0.
INFO 03-02 01:33:59 [logger.py:42] Received request cmpl-41dda04b06c044eab87b343e82f6e102-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:33:59 [async_llm.py:261] Added request cmpl-41dda04b06c044eab87b343e82f6e102-0.
INFO 03-02 01:34:00 [logger.py:42] Received request cmpl-ae06ac36717042e18efa0c44c442f92f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:34:00 [async_llm.py:261] Added request cmpl-ae06ac36717042e18efa0c44c442f92f-0.
INFO 03-02 01:34:01 [logger.py:42] Received request cmpl-c506b08835d0499fa11f5a39cf82eef8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:34:01 [async_llm.py:261] Added request cmpl-c506b08835d0499fa11f5a39cf82eef8-0.
INFO 03-02 01:34:02 [logger.py:42] Received request cmpl-2be5ae90b438499d94d288bf8feb5981-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:34:02 [async_llm.py:261] Added request cmpl-2be5ae90b438499d94d288bf8feb5981-0.
INFO 03-02 01:34:04 [logger.py:42] Received request cmpl-f7efca79bd574a0fa6ab6e34cf9a9d32-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:34:04 [async_llm.py:261] Added request cmpl-f7efca79bd574a0fa6ab6e34cf9a9d32-0.
INFO 03-02 01:34:05 [logger.py:42] Received request cmpl-0c2de3fd9d284cab92287de86f9adfa2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:34:05 [async_llm.py:261] Added request cmpl-0c2de3fd9d284cab92287de86f9adfa2-0.
INFO 03-02 01:34:06 [logger.py:42] Received request cmpl-2f8f98ba1cad4a8cbe77f50e97ff55c9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:34:06 [async_llm.py:261] Added request cmpl-2f8f98ba1cad4a8cbe77f50e97ff55c9-0.
INFO 03-02 01:34:07 [logger.py:42] Received request cmpl-f94193f89ee04e2db62bc1c7f43e1ff3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:34:07 [async_llm.py:261] Added request cmpl-f94193f89ee04e2db62bc1c7f43e1ff3-0.
INFO 03-02 01:34:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:34:08 [logger.py:42] Received request cmpl-51f47e26c152401792e21f1ec2e60ccf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:34:08 [async_llm.py:261] Added request cmpl-51f47e26c152401792e21f1ec2e60ccf-0.
INFO 03-02 01:34:09 [logger.py:42] Received request cmpl-f76c7be6106c4a97ad0c9d07978418ed-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:34:09 [async_llm.py:261] Added request cmpl-f76c7be6106c4a97ad0c9d07978418ed-0.
INFO 03-02 01:34:10 [logger.py:42] Received request cmpl-f46871a2b1fc4e879893f2efed5a4c14-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:34:10 [async_llm.py:261] Added request cmpl-f46871a2b1fc4e879893f2efed5a4c14-0.
INFO 03-02 01:34:12 [logger.py:42] Received request cmpl-ca1a117fafdd45a390d3d88994d18397-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:34:12 [async_llm.py:261] Added request cmpl-ca1a117fafdd45a390d3d88994d18397-0.
INFO 03-02 01:34:13 [logger.py:42] Received request cmpl-24afef0ba8fb48d89ea545f0f059bc71-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:34:13 [async_llm.py:261] Added request cmpl-24afef0ba8fb48d89ea545f0f059bc71-0.
INFO 03-02 01:34:14 [logger.py:42] Received request cmpl-9dffa05164874157b03f01b74b99b69a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:34:14 [async_llm.py:261] Added request cmpl-9dffa05164874157b03f01b74b99b69a-0.
INFO 03-02 01:34:15 [logger.py:42] Received request cmpl-91ea8a8590834d4899055eaf753eca1d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:34:15 [async_llm.py:261] Added request cmpl-91ea8a8590834d4899055eaf753eca1d-0.
INFO 03-02 01:34:16 [logger.py:42] Received request cmpl-0f5d9abc712149df8616415ca7dacd8d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:34:16 [async_llm.py:261] Added request cmpl-0f5d9abc712149df8616415ca7dacd8d-0.
INFO 03-02 01:34:17 [logger.py:42] Received request cmpl-8b3407d493e2417193dc29307c2cad7e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:34:17 [async_llm.py:261] Added request cmpl-8b3407d493e2417193dc29307c2cad7e-0.
INFO 03-02 01:34:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:34:19 [logger.py:42] Received request cmpl-02b0f22cdee04c10994ad65a68f5bcc7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:34:19 [async_llm.py:261] Added request cmpl-02b0f22cdee04c10994ad65a68f5bcc7-0.
INFO 03-02 01:34:20 [logger.py:42] Received request cmpl-32b855a36acc457081072050814e0bb8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:34:20 [async_llm.py:261] Added request cmpl-32b855a36acc457081072050814e0bb8-0.
INFO 03-02 01:34:21 [logger.py:42] Received request cmpl-8d5691d880d14923af92cabcb7356bc1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:34:21 [async_llm.py:261] Added request cmpl-8d5691d880d14923af92cabcb7356bc1-0.
INFO 03-02 01:34:22 [logger.py:42] Received request cmpl-33497417bea74441b2ef423be06257ca-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:34:22 [async_llm.py:261] Added request cmpl-33497417bea74441b2ef423be06257ca-0.
INFO 03-02 01:34:23 [logger.py:42] Received request cmpl-d864f72b57464777b2789b18cd9efd30-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:34:23 [async_llm.py:261] Added request cmpl-d864f72b57464777b2789b18cd9efd30-0.
INFO 03-02 01:34:24 [logger.py:42] Received request cmpl-d2f38eb178ce436791549cf44fe7e66e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:34:24 [async_llm.py:261] Added request cmpl-d2f38eb178ce436791549cf44fe7e66e-0.
INFO 03-02 01:34:25 [logger.py:42] Received request cmpl-0318f2610e7145d0985664db294b42b2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:34:25 [async_llm.py:261] Added request cmpl-0318f2610e7145d0985664db294b42b2-0.
INFO 03-02 01:34:27 [logger.py:42] Received request cmpl-a700cb3844974e14a5eec64849f7920f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:34:27 [async_llm.py:261] Added request cmpl-a700cb3844974e14a5eec64849f7920f-0.
INFO 03-02 01:34:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:34:28 [logger.py:42] Received request cmpl-a59f1167f8ed463583ebe7d6eed99282-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:34:28 [async_llm.py:261] Added request cmpl-a59f1167f8ed463583ebe7d6eed99282-0.
INFO 03-02 01:34:29 [logger.py:42] Received request cmpl-3672608aa6c74ddeacfe4f3ac324c789-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:34:29 [async_llm.py:261] Added request cmpl-3672608aa6c74ddeacfe4f3ac324c789-0.
INFO 03-02 01:34:30 [logger.py:42] Received request cmpl-4b37f8f7991d4727852f3f2127bd275e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:34:30 [async_llm.py:261] Added request cmpl-4b37f8f7991d4727852f3f2127bd275e-0.
INFO 03-02 01:34:31 [logger.py:42] Received request cmpl-8f911b5a92b84c059dc6bfafcfbcd9d8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:34:31 [async_llm.py:261] Added request cmpl-8f911b5a92b84c059dc6bfafcfbcd9d8-0.
INFO 03-02 01:34:32 [logger.py:42] Received request cmpl-099568a86472422a9827e14dd3b9da9a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:34:32 [async_llm.py:261] Added request cmpl-099568a86472422a9827e14dd3b9da9a-0.
INFO 03-02 01:34:34 [logger.py:42] Received request cmpl-a66256ae8c9f4a1f9c4b1fb4e8c3c743-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:34:34 [async_llm.py:261] Added request cmpl-a66256ae8c9f4a1f9c4b1fb4e8c3c743-0.
INFO 03-02 01:34:35 [logger.py:42] Received request cmpl-2458329e43174d76bdd3dfc14ae9b305-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:34:35 [async_llm.py:261] Added request cmpl-2458329e43174d76bdd3dfc14ae9b305-0.
INFO 03-02 01:34:36 [logger.py:42] Received request cmpl-232ce558a45d47f49688406cb6cf3ec1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:34:36 [async_llm.py:261] Added request cmpl-232ce558a45d47f49688406cb6cf3ec1-0.
INFO 03-02 01:34:37 [logger.py:42] Received request cmpl-9769b2db5101422ea98651ec908377d5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:34:37 [async_llm.py:261] Added request cmpl-9769b2db5101422ea98651ec908377d5-0.
INFO 03-02 01:34:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:34:38 [logger.py:42] Received request cmpl-4afe14abbc194ef8a80239fbed351b72-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:34:38 [async_llm.py:261] Added request cmpl-4afe14abbc194ef8a80239fbed351b72-0.
INFO 03-02 01:34:39 [logger.py:42] Received request cmpl-d739dada77634f17ad2ac7e52a3c6f9d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:34:39 [async_llm.py:261] Added request cmpl-d739dada77634f17ad2ac7e52a3c6f9d-0.
INFO 03-02 01:34:40 [logger.py:42] Received request cmpl-b3b7b4a4ad734ff8872b27a24423834c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:34:40 [async_llm.py:261] Added request cmpl-b3b7b4a4ad734ff8872b27a24423834c-0.
INFO 03-02 01:34:42 [logger.py:42] Received request cmpl-3e379d1f35c948489e755413005cc55f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:34:42 [async_llm.py:261] Added request cmpl-3e379d1f35c948489e755413005cc55f-0.
INFO 03-02 01:34:43 [logger.py:42] Received request cmpl-c3562ed2f9db41909c4da98de2d8598c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:34:43 [async_llm.py:261] Added request cmpl-c3562ed2f9db41909c4da98de2d8598c-0.
INFO 03-02 01:34:44 [logger.py:42] Received request cmpl-2c6c6413b7744e049ad1c7748da285dd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:34:44 [async_llm.py:261] Added request cmpl-2c6c6413b7744e049ad1c7748da285dd-0.
INFO 03-02 01:34:45 [logger.py:42] Received request cmpl-9707dbe35623405eadb6e212d2be1d07-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:34:45 [async_llm.py:261] Added request cmpl-9707dbe35623405eadb6e212d2be1d07-0.
INFO 03-02 01:34:46 [logger.py:42] Received request cmpl-bb57f8f4b7c1487a90ba52dd28bbc0fa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:34:46 [async_llm.py:261] Added request cmpl-bb57f8f4b7c1487a90ba52dd28bbc0fa-0.
INFO 03-02 01:34:47 [logger.py:42] Received request cmpl-ff0ca5e8a4eb435ba1fcb53ecf3bddc2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:34:47 [async_llm.py:261] Added request cmpl-ff0ca5e8a4eb435ba1fcb53ecf3bddc2-0.
INFO 03-02 01:34:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:34:49 [logger.py:42] Received request cmpl-e451ad75d675492eb625467e4fdbae0c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:34:49 [async_llm.py:261] Added request cmpl-e451ad75d675492eb625467e4fdbae0c-0.
INFO 03-02 01:34:50 [logger.py:42] Received request cmpl-7a8afcb296b1459ea8346641b150a0fb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:34:50 [async_llm.py:261] Added request cmpl-7a8afcb296b1459ea8346641b150a0fb-0.
INFO 03-02 01:34:51 [logger.py:42] Received request cmpl-4e773aeb87c74d6f8cf8363838b2db5a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:34:51 [async_llm.py:261] Added request cmpl-4e773aeb87c74d6f8cf8363838b2db5a-0.
INFO 03-02 01:34:52 [logger.py:42] Received request cmpl-c460366a27044d55b16eebf3bfa65207-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:34:52 [async_llm.py:261] Added request cmpl-c460366a27044d55b16eebf3bfa65207-0.
INFO 03-02 01:34:53 [logger.py:42] Received request cmpl-a00e7925f5474620b052012363e40b89-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:34:53 [async_llm.py:261] Added request cmpl-a00e7925f5474620b052012363e40b89-0.
INFO 03-02 01:34:54 [logger.py:42] Received request cmpl-a96723150252489ba893bd8ad5d5076d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:34:54 [async_llm.py:261] Added request cmpl-a96723150252489ba893bd8ad5d5076d-0.
INFO 03-02 01:34:55 [logger.py:42] Received request cmpl-b41ffdc65e8d4c71b3421416c0dec54a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:34:55 [async_llm.py:261] Added request cmpl-b41ffdc65e8d4c71b3421416c0dec54a-0.
INFO 03-02 01:34:57 [logger.py:42] Received request cmpl-64c4e70b3dab473a96855b9a0088ad3f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:34:57 [async_llm.py:261] Added request cmpl-64c4e70b3dab473a96855b9a0088ad3f-0.
INFO 03-02 01:34:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:34:58 [logger.py:42] Received request cmpl-d4a825d872ea40a7926d9eea65275885-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:34:58 [async_llm.py:261] Added request cmpl-d4a825d872ea40a7926d9eea65275885-0.
INFO 03-02 01:34:59 [logger.py:42] Received request cmpl-4b176565fec44dd3911b1dedb768c39e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:34:59 [async_llm.py:261] Added request cmpl-4b176565fec44dd3911b1dedb768c39e-0.
INFO 03-02 01:35:00 [logger.py:42] Received request cmpl-ba8b3c78b42040bbaf229024b9b19af2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:35:00 [async_llm.py:261] Added request cmpl-ba8b3c78b42040bbaf229024b9b19af2-0.
INFO 03-02 01:35:01 [logger.py:42] Received request cmpl-b2975be0a7224a788509ea627793e05b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:35:01 [async_llm.py:261] Added request cmpl-b2975be0a7224a788509ea627793e05b-0.
INFO 03-02 01:35:02 [logger.py:42] Received request cmpl-5ef9949788cf4f40a88554025b4edbfc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:35:02 [async_llm.py:261] Added request cmpl-5ef9949788cf4f40a88554025b4edbfc-0.
INFO 03-02 01:35:04 [logger.py:42] Received request cmpl-64abe7d5aee645a49d515d2cc7794b40-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:35:04 [async_llm.py:261] Added request cmpl-64abe7d5aee645a49d515d2cc7794b40-0.
INFO 03-02 01:35:05 [logger.py:42] Received request cmpl-18625c23756240bc8b985d3b6e88f606-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:35:05 [async_llm.py:261] Added request cmpl-18625c23756240bc8b985d3b6e88f606-0.
INFO 03-02 01:35:06 [logger.py:42] Received request cmpl-00ffb7dd7194472f9d6053e91084d2e2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:35:06 [async_llm.py:261] Added request cmpl-00ffb7dd7194472f9d6053e91084d2e2-0.
INFO 03-02 01:35:07 [logger.py:42] Received request cmpl-14177ef5d9624318a769dc395ff77468-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:35:07 [async_llm.py:261] Added request cmpl-14177ef5d9624318a769dc395ff77468-0.
INFO 03-02 01:35:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:35:08 [logger.py:42] Received request cmpl-9a6aa6e9dcf54851bc33f0392e934827-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:35:08 [async_llm.py:261] Added request cmpl-9a6aa6e9dcf54851bc33f0392e934827-0.
INFO 03-02 01:35:09 [logger.py:42] Received request cmpl-144434708b8744c0a67b7dc8357311f5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:35:09 [async_llm.py:261] Added request cmpl-144434708b8744c0a67b7dc8357311f5-0.
INFO 03-02 01:35:10 [logger.py:42] Received request cmpl-7ab6598e061b4c52b38d890fd94c2ac5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:35:10 [async_llm.py:261] Added request cmpl-7ab6598e061b4c52b38d890fd94c2ac5-0.
INFO 03-02 01:35:12 [logger.py:42] Received request cmpl-d6326d86e93843ffbda678d60f4e8762-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:35:12 [async_llm.py:261] Added request cmpl-d6326d86e93843ffbda678d60f4e8762-0.
INFO 03-02 01:35:13 [logger.py:42] Received request cmpl-1288745745a5458cb4d698a5232630fe-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:35:13 [async_llm.py:261] Added request cmpl-1288745745a5458cb4d698a5232630fe-0.
INFO 03-02 01:35:14 [logger.py:42] Received request cmpl-6515827f509543b0bcee98008b548318-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:35:14 [async_llm.py:261] Added request cmpl-6515827f509543b0bcee98008b548318-0.
INFO 03-02 01:35:15 [logger.py:42] Received request cmpl-b7f3299fc95746099c3ee6e54a5f6d64-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:35:15 [async_llm.py:261] Added request cmpl-b7f3299fc95746099c3ee6e54a5f6d64-0.
INFO 03-02 01:35:16 [logger.py:42] Received request cmpl-43e769345b2b452ab7134965d5ada650-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:35:16 [async_llm.py:261] Added request cmpl-43e769345b2b452ab7134965d5ada650-0.
INFO 03-02 01:35:17 [logger.py:42] Received request cmpl-936be318dbbe4a9487afcd22492fe6cb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:35:17 [async_llm.py:261] Added request cmpl-936be318dbbe4a9487afcd22492fe6cb-0.
INFO 03-02 01:35:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:35:19 [logger.py:42] Received request cmpl-81776f7398ce4e27b9ca9b825cc4e19a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:35:19 [async_llm.py:261] Added request cmpl-81776f7398ce4e27b9ca9b825cc4e19a-0.
INFO 03-02 01:35:20 [logger.py:42] Received request cmpl-a41964e7df6e40aa8d41520dcdc8a919-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:35:20 [async_llm.py:261] Added request cmpl-a41964e7df6e40aa8d41520dcdc8a919-0.
INFO 03-02 01:35:21 [logger.py:42] Received request cmpl-2ff34ff90dec494fad97fbdc645106e1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:35:21 [async_llm.py:261] Added request cmpl-2ff34ff90dec494fad97fbdc645106e1-0.
INFO 03-02 01:35:22 [logger.py:42] Received request cmpl-1d8b251c94a44b76adc31890789cc838-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:35:22 [async_llm.py:261] Added request cmpl-1d8b251c94a44b76adc31890789cc838-0.
INFO 03-02 01:35:23 [logger.py:42] Received request cmpl-1b4645c1dc754d8d8b272e3ffac2f392-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:35:23 [async_llm.py:261] Added request cmpl-1b4645c1dc754d8d8b272e3ffac2f392-0.
INFO 03-02 01:35:24 [logger.py:42] Received request cmpl-6cce0075a7064a3b9d05689578c976c8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:35:24 [async_llm.py:261] Added request cmpl-6cce0075a7064a3b9d05689578c976c8-0.
INFO 03-02 01:35:25 [logger.py:42] Received request cmpl-a438f025389c4d999a9b12010c6a5440-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:35:25 [async_llm.py:261] Added request cmpl-a438f025389c4d999a9b12010c6a5440-0.
INFO 03-02 01:35:27 [logger.py:42] Received request cmpl-c5ad97b41ca94c37b41a5739b49c4adc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:35:27 [async_llm.py:261] Added request cmpl-c5ad97b41ca94c37b41a5739b49c4adc-0.
INFO 03-02 01:35:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:35:28 [logger.py:42] Received request cmpl-422cfa3bae184913bab4f4d660464919-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:35:28 [async_llm.py:261] Added request cmpl-422cfa3bae184913bab4f4d660464919-0.
INFO 03-02 01:35:29 [logger.py:42] Received request cmpl-d16a17b7a1774f75a18eacf7e6d9cb62-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:35:29 [async_llm.py:261] Added request cmpl-d16a17b7a1774f75a18eacf7e6d9cb62-0.
INFO 03-02 01:35:30 [logger.py:42] Received request cmpl-d5d584dda8f040a89478140892593d6f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:35:30 [async_llm.py:261] Added request cmpl-d5d584dda8f040a89478140892593d6f-0.
INFO 03-02 01:35:31 [logger.py:42] Received request cmpl-3a8b895049a048e89844ee6df8a6c49f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:35:31 [async_llm.py:261] Added request cmpl-3a8b895049a048e89844ee6df8a6c49f-0.
INFO 03-02 01:35:32 [logger.py:42] Received request cmpl-dc89c510609f4699bc08178aef2ca137-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:35:32 [async_llm.py:261] Added request cmpl-dc89c510609f4699bc08178aef2ca137-0.
INFO 03-02 01:35:34 [logger.py:42] Received request cmpl-87142c5ec34c40c6988c2e7341b69f74-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:35:34 [async_llm.py:261] Added request cmpl-87142c5ec34c40c6988c2e7341b69f74-0.
INFO 03-02 01:35:35 [logger.py:42] Received request cmpl-c7e1a3a14de549deaf9bc2231fa4be44-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:35:35 [async_llm.py:261] Added request cmpl-c7e1a3a14de549deaf9bc2231fa4be44-0.
INFO 03-02 01:35:36 [logger.py:42] Received request cmpl-39d1c204fe17404c973cadb9d78c7f49-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:35:36 [async_llm.py:261] Added request cmpl-39d1c204fe17404c973cadb9d78c7f49-0.
INFO 03-02 01:35:37 [logger.py:42] Received request cmpl-ced2e90d65414b98bc8f6aaa3545d575-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:35:37 [async_llm.py:261] Added request cmpl-ced2e90d65414b98bc8f6aaa3545d575-0.
INFO 03-02 01:35:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:35:38 [logger.py:42] Received request cmpl-531eadcb1a3d4f0faaeece4d60acef54-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:35:38 [async_llm.py:261] Added request cmpl-531eadcb1a3d4f0faaeece4d60acef54-0.
INFO 03-02 01:35:39 [logger.py:42] Received request cmpl-6ea9346c8eb445708093339bcc4a12fc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:35:39 [async_llm.py:261] Added request cmpl-6ea9346c8eb445708093339bcc4a12fc-0.
INFO 03-02 01:35:40 [logger.py:42] Received request cmpl-35f9364306e84d1b8c8565a78f8d3d32-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:35:40 [async_llm.py:261] Added request cmpl-35f9364306e84d1b8c8565a78f8d3d32-0.
INFO 03-02 01:35:42 [logger.py:42] Received request cmpl-445ebcf4ab864677a6b5f278507bd948-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:35:42 [async_llm.py:261] Added request cmpl-445ebcf4ab864677a6b5f278507bd948-0.
INFO 03-02 01:35:43 [logger.py:42] Received request cmpl-5da12d0d046941fb8d0b6c5c0fbb533b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:35:43 [async_llm.py:261] Added request cmpl-5da12d0d046941fb8d0b6c5c0fbb533b-0.
INFO 03-02 01:35:44 [logger.py:42] Received request cmpl-cff2d7297e544bd297355e3ed7164665-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:35:44 [async_llm.py:261] Added request cmpl-cff2d7297e544bd297355e3ed7164665-0.
INFO 03-02 01:35:45 [logger.py:42] Received request cmpl-e0fc30dc1e7d49b78704b33a9fde2ccb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:35:45 [async_llm.py:261] Added request cmpl-e0fc30dc1e7d49b78704b33a9fde2ccb-0.
INFO 03-02 01:35:46 [logger.py:42] Received request cmpl-f8b4a275bac841cd9191a633cc8cd421-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:35:46 [async_llm.py:261] Added request cmpl-f8b4a275bac841cd9191a633cc8cd421-0.
INFO 03-02 01:35:47 [logger.py:42] Received request cmpl-07d4245c3db54645a6a38d2e917febee-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:35:47 [async_llm.py:261] Added request cmpl-07d4245c3db54645a6a38d2e917febee-0.
INFO 03-02 01:35:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:35:49 [logger.py:42] Received request cmpl-ed1d70b0766d41d4982aef157afaf2d1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:35:49 [async_llm.py:261] Added request cmpl-ed1d70b0766d41d4982aef157afaf2d1-0.
INFO 03-02 01:35:50 [logger.py:42] Received request cmpl-e6e21e9ac7e14a989894238911540f76-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:35:50 [async_llm.py:261] Added request cmpl-e6e21e9ac7e14a989894238911540f76-0.
INFO 03-02 01:35:51 [logger.py:42] Received request cmpl-d980b939e49d471cb8eaa23d9d7acc9b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:35:51 [async_llm.py:261] Added request cmpl-d980b939e49d471cb8eaa23d9d7acc9b-0.
INFO 03-02 01:35:52 [logger.py:42] Received request cmpl-5559e37e3d09493987ef8df30a8b6b4b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:35:52 [async_llm.py:261] Added request cmpl-5559e37e3d09493987ef8df30a8b6b4b-0.
INFO 03-02 01:35:53 [logger.py:42] Received request cmpl-6877dca48b374744a8bb32da00bc9c60-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:35:53 [async_llm.py:261] Added request cmpl-6877dca48b374744a8bb32da00bc9c60-0.
INFO 03-02 01:35:54 [logger.py:42] Received request cmpl-f9dc1801d92e4e4a874631da3ca36aa4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:35:54 [async_llm.py:261] Added request cmpl-f9dc1801d92e4e4a874631da3ca36aa4-0.
INFO 03-02 01:35:55 [logger.py:42] Received request cmpl-b6638449089b470983a0cb65e0a230a0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:35:55 [async_llm.py:261] Added request cmpl-b6638449089b470983a0cb65e0a230a0-0.
INFO 03-02 01:35:57 [logger.py:42] Received request cmpl-95c4b2a92f7b4b0482c8b0ef7f72ebf8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:35:57 [async_llm.py:261] Added request cmpl-95c4b2a92f7b4b0482c8b0ef7f72ebf8-0.
INFO 03-02 01:35:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:35:58 [logger.py:42] Received request cmpl-bad94ee93941443b8490310b930d5877-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:35:58 [async_llm.py:261] Added request cmpl-bad94ee93941443b8490310b930d5877-0.
INFO 03-02 01:35:59 [logger.py:42] Received request cmpl-2b1e795040f94caaa9c29ef875b2136e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:35:59 [async_llm.py:261] Added request cmpl-2b1e795040f94caaa9c29ef875b2136e-0.
INFO 03-02 01:36:00 [logger.py:42] Received request cmpl-9188df37717344b0aa7eac872ef880d2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:36:00 [async_llm.py:261] Added request cmpl-9188df37717344b0aa7eac872ef880d2-0.
INFO 03-02 01:36:01 [logger.py:42] Received request cmpl-6a2028f53ed14da99a654d1a9f7519d9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:36:01 [async_llm.py:261] Added request cmpl-6a2028f53ed14da99a654d1a9f7519d9-0.
INFO 03-02 01:36:02 [logger.py:42] Received request cmpl-c8e4987db0844356bb1074ab211cfab5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:36:02 [async_llm.py:261] Added request cmpl-c8e4987db0844356bb1074ab211cfab5-0.
INFO 03-02 01:36:04 [logger.py:42] Received request cmpl-7af7bf8e06814f3d872a2a1b11838753-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:36:04 [async_llm.py:261] Added request cmpl-7af7bf8e06814f3d872a2a1b11838753-0.
INFO 03-02 01:36:05 [logger.py:42] Received request cmpl-02e0a0efa16a41e8a6c38a4fef2fa202-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:36:05 [async_llm.py:261] Added request cmpl-02e0a0efa16a41e8a6c38a4fef2fa202-0.
INFO 03-02 01:36:06 [logger.py:42] Received request cmpl-61049437d70a4d00b0582e1cc1557369-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:36:06 [async_llm.py:261] Added request cmpl-61049437d70a4d00b0582e1cc1557369-0.
INFO 03-02 01:36:07 [logger.py:42] Received request cmpl-b9496681018c4f8d977f953ba02bea12-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:36:07 [async_llm.py:261] Added request cmpl-b9496681018c4f8d977f953ba02bea12-0.
INFO 03-02 01:36:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:36:08 [logger.py:42] Received request cmpl-ef0b63ff80304f828ca289e98b2e04dc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:36:08 [async_llm.py:261] Added request cmpl-ef0b63ff80304f828ca289e98b2e04dc-0.
INFO 03-02 01:36:09 [logger.py:42] Received request cmpl-2a8cbdfb9ff549bfaf50d5b6333f08bc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:36:09 [async_llm.py:261] Added request cmpl-2a8cbdfb9ff549bfaf50d5b6333f08bc-0.
INFO 03-02 01:36:10 [logger.py:42] Received request cmpl-175fb30bd69841ecb0392f05920b0334-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:36:10 [async_llm.py:261] Added request cmpl-175fb30bd69841ecb0392f05920b0334-0.
INFO 03-02 01:36:12 [logger.py:42] Received request cmpl-030446d865924812a23fb02da7a91417-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:36:12 [async_llm.py:261] Added request cmpl-030446d865924812a23fb02da7a91417-0.
INFO 03-02 01:36:13 [logger.py:42] Received request cmpl-ef94b26890fd40b8bb9560278f165009-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:36:13 [async_llm.py:261] Added request cmpl-ef94b26890fd40b8bb9560278f165009-0.
INFO 03-02 01:36:14 [logger.py:42] Received request cmpl-45a7df4a6c8c4bf0aac64c6ec44746c2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:36:14 [async_llm.py:261] Added request cmpl-45a7df4a6c8c4bf0aac64c6ec44746c2-0.
INFO 03-02 01:36:15 [logger.py:42] Received request cmpl-fce56177e5844c37b299284d552427e6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:36:15 [async_llm.py:261] Added request cmpl-fce56177e5844c37b299284d552427e6-0.
INFO 03-02 01:36:16 [logger.py:42] Received request cmpl-273793c9489d4b33b433404e0ae46f4d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:36:16 [async_llm.py:261] Added request cmpl-273793c9489d4b33b433404e0ae46f4d-0.
INFO 03-02 01:36:17 [logger.py:42] Received request cmpl-edc650f9ca4d4740b27c53ca3e5ef130-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:36:17 [async_llm.py:261] Added request cmpl-edc650f9ca4d4740b27c53ca3e5ef130-0.
INFO 03-02 01:36:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:36:19 [logger.py:42] Received request cmpl-46cff059a7cc465ba72fa588adf4545f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:36:19 [async_llm.py:261] Added request cmpl-46cff059a7cc465ba72fa588adf4545f-0.
INFO 03-02 01:36:20 [logger.py:42] Received request cmpl-b37e9acd77704fed9543743d7358fdf8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:36:20 [async_llm.py:261] Added request cmpl-b37e9acd77704fed9543743d7358fdf8-0.
INFO 03-02 01:36:21 [logger.py:42] Received request cmpl-af1b19310f3b407eb5534ae2d8c24eb7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:36:21 [async_llm.py:261] Added request cmpl-af1b19310f3b407eb5534ae2d8c24eb7-0.
INFO 03-02 01:36:22 [logger.py:42] Received request cmpl-0fbacad696dc4dc390723ae69e99fe7b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:36:22 [async_llm.py:261] Added request cmpl-0fbacad696dc4dc390723ae69e99fe7b-0.
INFO 03-02 01:36:23 [logger.py:42] Received request cmpl-6169038a1b8d47669135bd5214d63be1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:36:23 [async_llm.py:261] Added request cmpl-6169038a1b8d47669135bd5214d63be1-0.
INFO 03-02 01:36:24 [logger.py:42] Received request cmpl-ec1ba75dd1f545dfb3fabfeba4a873f3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:36:24 [async_llm.py:261] Added request cmpl-ec1ba75dd1f545dfb3fabfeba4a873f3-0.
INFO 03-02 01:36:25 [logger.py:42] Received request cmpl-bc6e9fcaa09f4db59358da72d9121a4a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:36:25 [async_llm.py:261] Added request cmpl-bc6e9fcaa09f4db59358da72d9121a4a-0.
INFO 03-02 01:36:27 [logger.py:42] Received request cmpl-e02a8495512c4ba1b1414d151a58cf32-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:36:27 [async_llm.py:261] Added request cmpl-e02a8495512c4ba1b1414d151a58cf32-0.
INFO 03-02 01:36:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:36:28 [logger.py:42] Received request cmpl-aed80c5b1a2b440aab6eaa57f8b7863f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:36:28 [async_llm.py:261] Added request cmpl-aed80c5b1a2b440aab6eaa57f8b7863f-0.
INFO 03-02 01:36:29 [logger.py:42] Received request cmpl-036454af3c324fe8830d80d9521f1333-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:36:29 [async_llm.py:261] Added request cmpl-036454af3c324fe8830d80d9521f1333-0.
INFO 03-02 01:36:30 [logger.py:42] Received request cmpl-46bda4bec1b148c7bb5975d6d9648926-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:36:30 [async_llm.py:261] Added request cmpl-46bda4bec1b148c7bb5975d6d9648926-0.
INFO 03-02 01:36:31 [logger.py:42] Received request cmpl-c08270f134bf46baa5d81acf0ff137bc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:36:31 [async_llm.py:261] Added request cmpl-c08270f134bf46baa5d81acf0ff137bc-0.
INFO 03-02 01:36:32 [logger.py:42] Received request cmpl-cf628d449f14442e9d00f1fee6356889-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:36:32 [async_llm.py:261] Added request cmpl-cf628d449f14442e9d00f1fee6356889-0.
INFO 03-02 01:36:34 [logger.py:42] Received request cmpl-59a5dc55e60d47a1a8027cdf67591593-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:36:34 [async_llm.py:261] Added request cmpl-59a5dc55e60d47a1a8027cdf67591593-0.
INFO 03-02 01:36:35 [logger.py:42] Received request cmpl-d18bbba92f644e1db20bd6d1dde82195-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:36:35 [async_llm.py:261] Added request cmpl-d18bbba92f644e1db20bd6d1dde82195-0.
INFO 03-02 01:36:36 [logger.py:42] Received request cmpl-f460afa545c949c1a5ab7b49d567cf97-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:36:36 [async_llm.py:261] Added request cmpl-f460afa545c949c1a5ab7b49d567cf97-0.
INFO 03-02 01:36:37 [logger.py:42] Received request cmpl-bca4b7112017453c8f1db409e9d8cc5d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:36:37 [async_llm.py:261] Added request cmpl-bca4b7112017453c8f1db409e9d8cc5d-0.
INFO 03-02 01:36:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:36:38 [logger.py:42] Received request cmpl-7048f781236e448380721e6374101df3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:36:38 [async_llm.py:261] Added request cmpl-7048f781236e448380721e6374101df3-0.
INFO 03-02 01:36:39 [logger.py:42] Received request cmpl-a0662cfa7ba4420e98ef558671c13218-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:36:39 [async_llm.py:261] Added request cmpl-a0662cfa7ba4420e98ef558671c13218-0.
INFO 03-02 01:36:40 [logger.py:42] Received request cmpl-2bdbf9eb54b642fab54e61f1834ed8ed-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:36:40 [async_llm.py:261] Added request cmpl-2bdbf9eb54b642fab54e61f1834ed8ed-0.
INFO 03-02 01:36:42 [logger.py:42] Received request cmpl-83f92121f7eb48e5a2bd3a0631266c40-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:36:42 [async_llm.py:261] Added request cmpl-83f92121f7eb48e5a2bd3a0631266c40-0.
INFO 03-02 01:36:43 [logger.py:42] Received request cmpl-d9c8180115b141b7b81120c8a469587f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:36:43 [async_llm.py:261] Added request cmpl-d9c8180115b141b7b81120c8a469587f-0.
INFO 03-02 01:36:44 [logger.py:42] Received request cmpl-09e3181c519e4e428a06b82a3649844a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:36:44 [async_llm.py:261] Added request cmpl-09e3181c519e4e428a06b82a3649844a-0.
INFO 03-02 01:36:45 [logger.py:42] Received request cmpl-cd9e5fd1717d4daaae35eb51315928b8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:36:45 [async_llm.py:261] Added request cmpl-cd9e5fd1717d4daaae35eb51315928b8-0.
INFO 03-02 01:36:46 [logger.py:42] Received request cmpl-98b2a7817c9f49e78d697c1c6aef5652-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:36:46 [async_llm.py:261] Added request cmpl-98b2a7817c9f49e78d697c1c6aef5652-0.
INFO 03-02 01:36:47 [logger.py:42] Received request cmpl-c1a97aafeb3b4babb3458f113e34faf4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:36:47 [async_llm.py:261] Added request cmpl-c1a97aafeb3b4babb3458f113e34faf4-0.
INFO 03-02 01:36:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:36:49 [logger.py:42] Received request cmpl-707116e0fdf046aa82db7dc08c6482ec-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:36:49 [async_llm.py:261] Added request cmpl-707116e0fdf046aa82db7dc08c6482ec-0.
INFO 03-02 01:36:50 [logger.py:42] Received request cmpl-33c6f87817c14a5791aaa67a50fdd05b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:36:50 [async_llm.py:261] Added request cmpl-33c6f87817c14a5791aaa67a50fdd05b-0.
INFO 03-02 01:36:51 [logger.py:42] Received request cmpl-73fad3e620a6443f86c6cee98d99ae15-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:36:51 [async_llm.py:261] Added request cmpl-73fad3e620a6443f86c6cee98d99ae15-0.
INFO 03-02 01:36:52 [logger.py:42] Received request cmpl-af0c112dc15b475d9a1e40a94b48fa6a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:36:52 [async_llm.py:261] Added request cmpl-af0c112dc15b475d9a1e40a94b48fa6a-0.
INFO 03-02 01:36:53 [logger.py:42] Received request cmpl-6f78fa7c46c64b268b5a3194f3f1a9ec-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:36:53 [async_llm.py:261] Added request cmpl-6f78fa7c46c64b268b5a3194f3f1a9ec-0.
INFO 03-02 01:36:54 [logger.py:42] Received request cmpl-29aed164fd1048138231591c8bd0bf2a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:36:54 [async_llm.py:261] Added request cmpl-29aed164fd1048138231591c8bd0bf2a-0.
INFO 03-02 01:36:55 [logger.py:42] Received request cmpl-1c3ec5bf09cd4f219186c286d3eeaef6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:36:55 [async_llm.py:261] Added request cmpl-1c3ec5bf09cd4f219186c286d3eeaef6-0.
INFO 03-02 01:36:57 [logger.py:42] Received request cmpl-7947a7040f964eb0bb10a4c8cd10d70f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:36:57 [async_llm.py:261] Added request cmpl-7947a7040f964eb0bb10a4c8cd10d70f-0.
INFO 03-02 01:36:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:36:58 [logger.py:42] Received request cmpl-21e94b271bb44b9aa0e0fffe2a0e81e3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:36:58 [async_llm.py:261] Added request cmpl-21e94b271bb44b9aa0e0fffe2a0e81e3-0.
INFO 03-02 01:36:59 [logger.py:42] Received request cmpl-7d55d6e1719e427da288a5e1669e13d3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:36:59 [async_llm.py:261] Added request cmpl-7d55d6e1719e427da288a5e1669e13d3-0.
INFO 03-02 01:37:00 [logger.py:42] Received request cmpl-d83fc184f7a54177bf49721d96db6ebd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:37:00 [async_llm.py:261] Added request cmpl-d83fc184f7a54177bf49721d96db6ebd-0.
INFO 03-02 01:37:01 [logger.py:42] Received request cmpl-b0ef6c6e889047f19d02b4b19d8d93de-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:37:01 [async_llm.py:261] Added request cmpl-b0ef6c6e889047f19d02b4b19d8d93de-0.
INFO 03-02 01:37:02 [logger.py:42] Received request cmpl-03113d87080a4847982246b31cf7f69c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:37:02 [async_llm.py:261] Added request cmpl-03113d87080a4847982246b31cf7f69c-0.
INFO 03-02 01:37:04 [logger.py:42] Received request cmpl-79861068c18c42ab966cec9689371e65-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:37:04 [async_llm.py:261] Added request cmpl-79861068c18c42ab966cec9689371e65-0.
INFO 03-02 01:37:05 [logger.py:42] Received request cmpl-16d0a927d2db42418b0bc05ee95ffb18-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:37:05 [async_llm.py:261] Added request cmpl-16d0a927d2db42418b0bc05ee95ffb18-0.
INFO 03-02 01:37:06 [logger.py:42] Received request cmpl-84b02c533caa40078738d848f12320d7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:37:06 [async_llm.py:261] Added request cmpl-84b02c533caa40078738d848f12320d7-0.
INFO 03-02 01:37:07 [logger.py:42] Received request cmpl-6d4d3f8567474c388052c0741fbdd8e3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:37:07 [async_llm.py:261] Added request cmpl-6d4d3f8567474c388052c0741fbdd8e3-0.
INFO 03-02 01:37:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:37:08 [logger.py:42] Received request cmpl-9ef8fb64725b461e9ea18a9800e676ac-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:37:08 [async_llm.py:261] Added request cmpl-9ef8fb64725b461e9ea18a9800e676ac-0.
INFO 03-02 01:37:09 [logger.py:42] Received request cmpl-bff51f89efa349a68703f794b8dac5f2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:37:09 [async_llm.py:261] Added request cmpl-bff51f89efa349a68703f794b8dac5f2-0.
INFO 03-02 01:37:10 [logger.py:42] Received request cmpl-cb53a7072d904fb2b6382446b9ed6635-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:37:10 [async_llm.py:261] Added request cmpl-cb53a7072d904fb2b6382446b9ed6635-0.
INFO 03-02 01:37:12 [logger.py:42] Received request cmpl-4e358b415308468ab399a31dc55005c6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:37:12 [async_llm.py:261] Added request cmpl-4e358b415308468ab399a31dc55005c6-0.
INFO 03-02 01:37:13 [logger.py:42] Received request cmpl-0c372917668f44258b0d2384773346f7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:37:13 [async_llm.py:261] Added request cmpl-0c372917668f44258b0d2384773346f7-0.
INFO 03-02 01:37:14 [logger.py:42] Received request cmpl-80924522716f4ce9889d652fac3edf89-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:37:14 [async_llm.py:261] Added request cmpl-80924522716f4ce9889d652fac3edf89-0.
INFO 03-02 01:37:15 [logger.py:42] Received request cmpl-01d4cb3b6a614de48834c1d5dbee9de1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:37:15 [async_llm.py:261] Added request cmpl-01d4cb3b6a614de48834c1d5dbee9de1-0.
INFO 03-02 01:37:16 [logger.py:42] Received request cmpl-8f8f208775084044b7fc7bdf676dd1c5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:37:16 [async_llm.py:261] Added request cmpl-8f8f208775084044b7fc7bdf676dd1c5-0.
INFO 03-02 01:37:17 [logger.py:42] Received request cmpl-f24731111be449198cb01a0e7a0435a9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:37:17 [async_llm.py:261] Added request cmpl-f24731111be449198cb01a0e7a0435a9-0.
INFO 03-02 01:37:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:37:19 [logger.py:42] Received request cmpl-1414859ae3544524a61f2d4b7620a0ee-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:37:19 [async_llm.py:261] Added request cmpl-1414859ae3544524a61f2d4b7620a0ee-0.
INFO 03-02 01:37:20 [logger.py:42] Received request cmpl-7f64537f21204634abcc0a360be4ffc0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:37:20 [async_llm.py:261] Added request cmpl-7f64537f21204634abcc0a360be4ffc0-0.
INFO 03-02 01:37:21 [logger.py:42] Received request cmpl-1239a3ef7ec748a0a4e9c34821000fd5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:37:21 [async_llm.py:261] Added request cmpl-1239a3ef7ec748a0a4e9c34821000fd5-0.
INFO 03-02 01:37:22 [logger.py:42] Received request cmpl-0a50a5c3dfa841c49904b660fd638469-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:37:22 [async_llm.py:261] Added request cmpl-0a50a5c3dfa841c49904b660fd638469-0.
INFO 03-02 01:37:23 [logger.py:42] Received request cmpl-b29fc3e6630d4b0581bbb4a8ef716805-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:37:23 [async_llm.py:261] Added request cmpl-b29fc3e6630d4b0581bbb4a8ef716805-0.
INFO 03-02 01:37:24 [logger.py:42] Received request cmpl-6459886f3687492dadcbe2357bdbd6b5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:37:24 [async_llm.py:261] Added request cmpl-6459886f3687492dadcbe2357bdbd6b5-0.
INFO 03-02 01:37:25 [logger.py:42] Received request cmpl-1ed226cdf21d406489efe949865c2f37-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:37:25 [async_llm.py:261] Added request cmpl-1ed226cdf21d406489efe949865c2f37-0.
INFO 03-02 01:37:27 [logger.py:42] Received request cmpl-cf21c6390043402aaff59775c24713d9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:37:27 [async_llm.py:261] Added request cmpl-cf21c6390043402aaff59775c24713d9-0.
INFO 03-02 01:37:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:37:28 [logger.py:42] Received request cmpl-32776fb8244f4c849633b58a2d85de5c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:37:28 [async_llm.py:261] Added request cmpl-32776fb8244f4c849633b58a2d85de5c-0.
INFO 03-02 01:37:29 [logger.py:42] Received request cmpl-a204b3d8ec184ea794fd3c7424014a13-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:37:29 [async_llm.py:261] Added request cmpl-a204b3d8ec184ea794fd3c7424014a13-0.
INFO 03-02 01:37:30 [logger.py:42] Received request cmpl-89ff9888a48545d294e5dcf51ff308d5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:37:30 [async_llm.py:261] Added request cmpl-89ff9888a48545d294e5dcf51ff308d5-0.
INFO 03-02 01:37:31 [logger.py:42] Received request cmpl-e0d5b75369fd46bab212215f39606cad-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:37:31 [async_llm.py:261] Added request cmpl-e0d5b75369fd46bab212215f39606cad-0.
INFO 03-02 01:37:32 [logger.py:42] Received request cmpl-32d46addde474e5baea3cd91765f1c01-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:37:32 [async_llm.py:261] Added request cmpl-32d46addde474e5baea3cd91765f1c01-0.
INFO 03-02 01:37:34 [logger.py:42] Received request cmpl-49b97b41f71b4ff6af6face1df203088-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:37:34 [async_llm.py:261] Added request cmpl-49b97b41f71b4ff6af6face1df203088-0.
INFO 03-02 01:37:35 [logger.py:42] Received request cmpl-b10de7381b034059a859d60b4b5d0330-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:37:35 [async_llm.py:261] Added request cmpl-b10de7381b034059a859d60b4b5d0330-0.
INFO 03-02 01:37:36 [logger.py:42] Received request cmpl-a6188fe82ca745c3b69c19a3894b3831-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:37:36 [async_llm.py:261] Added request cmpl-a6188fe82ca745c3b69c19a3894b3831-0.
INFO 03-02 01:37:37 [logger.py:42] Received request cmpl-6869adcb74d647ba8d1de6aab739f316-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:37:37 [async_llm.py:261] Added request cmpl-6869adcb74d647ba8d1de6aab739f316-0.
INFO 03-02 01:37:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:37:38 [logger.py:42] Received request cmpl-4b12bce998c94380a51b0972e5c04df7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:37:38 [async_llm.py:261] Added request cmpl-4b12bce998c94380a51b0972e5c04df7-0.
INFO 03-02 01:37:39 [logger.py:42] Received request cmpl-927a6c3111be4c1a95c22ea682699964-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:37:39 [async_llm.py:261] Added request cmpl-927a6c3111be4c1a95c22ea682699964-0.
INFO 03-02 01:37:40 [logger.py:42] Received request cmpl-ed148e5fda0a4fe6a9c55129f2c03eef-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:37:40 [async_llm.py:261] Added request cmpl-ed148e5fda0a4fe6a9c55129f2c03eef-0.
INFO 03-02 01:37:42 [logger.py:42] Received request cmpl-cc22de6edc2547159184594ca4e82d2f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:37:42 [async_llm.py:261] Added request cmpl-cc22de6edc2547159184594ca4e82d2f-0.
INFO 03-02 01:37:43 [logger.py:42] Received request cmpl-435d3c38460c4e0d94ae67194f5119df-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:37:43 [async_llm.py:261] Added request cmpl-435d3c38460c4e0d94ae67194f5119df-0.
INFO 03-02 01:37:44 [logger.py:42] Received request cmpl-315f5494c99b44b2875eb2d41b76fb75-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:37:44 [async_llm.py:261] Added request cmpl-315f5494c99b44b2875eb2d41b76fb75-0.
INFO 03-02 01:37:45 [logger.py:42] Received request cmpl-88d0e54bce2a4475ad6052d5a9a61b22-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:37:45 [async_llm.py:261] Added request cmpl-88d0e54bce2a4475ad6052d5a9a61b22-0.
INFO 03-02 01:37:46 [logger.py:42] Received request cmpl-765fcc41c9c541ec8dca9d14be5440d4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:37:46 [async_llm.py:261] Added request cmpl-765fcc41c9c541ec8dca9d14be5440d4-0.
INFO 03-02 01:37:47 [logger.py:42] Received request cmpl-25f18d2a82274d8789e74dca279e7ea4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:37:47 [async_llm.py:261] Added request cmpl-25f18d2a82274d8789e74dca279e7ea4-0.
INFO 03-02 01:37:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:37:49 [logger.py:42] Received request cmpl-50b0845e72b0423098a2b7ad8ea245bf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:37:49 [async_llm.py:261] Added request cmpl-50b0845e72b0423098a2b7ad8ea245bf-0.
INFO 03-02 01:37:50 [logger.py:42] Received request cmpl-9c2533f59ef94aa280ddebb2a54856a9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:37:50 [async_llm.py:261] Added request cmpl-9c2533f59ef94aa280ddebb2a54856a9-0.
INFO 03-02 01:37:51 [logger.py:42] Received request cmpl-344ab3f73e3f4c308d07d6678013962e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:37:51 [async_llm.py:261] Added request cmpl-344ab3f73e3f4c308d07d6678013962e-0.
INFO 03-02 01:37:52 [logger.py:42] Received request cmpl-7696b62d14c146e5b82b13a1bcadf30e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:37:52 [async_llm.py:261] Added request cmpl-7696b62d14c146e5b82b13a1bcadf30e-0.
INFO 03-02 01:37:53 [logger.py:42] Received request cmpl-0f35011c01b542ef9a965c13c11ad5c9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:37:53 [async_llm.py:261] Added request cmpl-0f35011c01b542ef9a965c13c11ad5c9-0.
INFO 03-02 01:37:54 [logger.py:42] Received request cmpl-be95929c49314dc69fba626ad6f8d7d4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:37:54 [async_llm.py:261] Added request cmpl-be95929c49314dc69fba626ad6f8d7d4-0.
INFO 03-02 01:37:56 [logger.py:42] Received request cmpl-76caf7dbaa0244f58818b1307c70aebb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:37:56 [async_llm.py:261] Added request cmpl-76caf7dbaa0244f58818b1307c70aebb-0.
INFO 03-02 01:37:57 [logger.py:42] Received request cmpl-3f9221caa1004318ab4ac292c32d9ce6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:37:57 [async_llm.py:261] Added request cmpl-3f9221caa1004318ab4ac292c32d9ce6-0.
INFO 03-02 01:37:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:37:58 [logger.py:42] Received request cmpl-577fd206239642c8ac3a8b291599e69e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:37:58 [async_llm.py:261] Added request cmpl-577fd206239642c8ac3a8b291599e69e-0.
INFO 03-02 01:37:59 [logger.py:42] Received request cmpl-958a79ca97d34eb7ba13a2985891dd3c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:37:59 [async_llm.py:261] Added request cmpl-958a79ca97d34eb7ba13a2985891dd3c-0.
INFO 03-02 01:38:00 [logger.py:42] Received request cmpl-975955029eff47bbb12a1550355587ac-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:38:00 [async_llm.py:261] Added request cmpl-975955029eff47bbb12a1550355587ac-0.
INFO 03-02 01:38:01 [logger.py:42] Received request cmpl-dabd97c53cab43a8ad629891fa7bb050-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:38:01 [async_llm.py:261] Added request cmpl-dabd97c53cab43a8ad629891fa7bb050-0.
INFO 03-02 01:38:02 [logger.py:42] Received request cmpl-ad9dfda283f540eeb907fd351342019e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:38:02 [async_llm.py:261] Added request cmpl-ad9dfda283f540eeb907fd351342019e-0.
INFO 03-02 01:38:04 [logger.py:42] Received request cmpl-391a5e7d830d4fd9b69d594405c115c3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:38:04 [async_llm.py:261] Added request cmpl-391a5e7d830d4fd9b69d594405c115c3-0.
INFO 03-02 01:38:05 [logger.py:42] Received request cmpl-656dea5ce5e145e7900ce604f3622698-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:38:05 [async_llm.py:261] Added request cmpl-656dea5ce5e145e7900ce604f3622698-0.
INFO 03-02 01:38:06 [logger.py:42] Received request cmpl-89e86b58cb8748bc86c97e5a645a9995-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:38:06 [async_llm.py:261] Added request cmpl-89e86b58cb8748bc86c97e5a645a9995-0.
INFO 03-02 01:38:07 [logger.py:42] Received request cmpl-c2abb422b1df4058b124b2091f5ff67f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:38:07 [async_llm.py:261] Added request cmpl-c2abb422b1df4058b124b2091f5ff67f-0.
INFO 03-02 01:38:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:38:08 [logger.py:42] Received request cmpl-d50a5e5b0f24494da7aaa688451d2313-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:38:08 [async_llm.py:261] Added request cmpl-d50a5e5b0f24494da7aaa688451d2313-0.
INFO 03-02 01:38:09 [logger.py:42] Received request cmpl-8d957059c933406fbf64073142e8935b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:38:09 [async_llm.py:261] Added request cmpl-8d957059c933406fbf64073142e8935b-0.
INFO 03-02 01:38:10 [logger.py:42] Received request cmpl-29d18b8b3bcb414b84246ce1f72de828-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:38:10 [async_llm.py:261] Added request cmpl-29d18b8b3bcb414b84246ce1f72de828-0.
INFO 03-02 01:38:12 [logger.py:42] Received request cmpl-d0b108d4c02542e49ef3979c63bf3038-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:38:12 [async_llm.py:261] Added request cmpl-d0b108d4c02542e49ef3979c63bf3038-0.
INFO 03-02 01:38:13 [logger.py:42] Received request cmpl-757e2032cb4848338c541735209a5bf2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:38:13 [async_llm.py:261] Added request cmpl-757e2032cb4848338c541735209a5bf2-0.
INFO 03-02 01:38:14 [logger.py:42] Received request cmpl-85bb4c1ff78a4505bbcc4952d65d87b0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:38:14 [async_llm.py:261] Added request cmpl-85bb4c1ff78a4505bbcc4952d65d87b0-0.
INFO 03-02 01:38:15 [logger.py:42] Received request cmpl-a65924ef14bb47fdb8076610aca46ff9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:38:15 [async_llm.py:261] Added request cmpl-a65924ef14bb47fdb8076610aca46ff9-0.
INFO 03-02 01:38:16 [logger.py:42] Received request cmpl-6332d8df6aed4384b41287bb6b0b91a1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:38:16 [async_llm.py:261] Added request cmpl-6332d8df6aed4384b41287bb6b0b91a1-0.
INFO 03-02 01:38:17 [logger.py:42] Received request cmpl-96a39e5f545144f68e006b798c04b77e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:38:17 [async_llm.py:261] Added request cmpl-96a39e5f545144f68e006b798c04b77e-0.
INFO 03-02 01:38:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:38:19 [logger.py:42] Received request cmpl-0ab8fb2f1ac64b9c9b2f9d1a31e2727c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:38:19 [async_llm.py:261] Added request cmpl-0ab8fb2f1ac64b9c9b2f9d1a31e2727c-0.
INFO 03-02 01:38:20 [logger.py:42] Received request cmpl-1d1bc214a905486db45261b0e5532c07-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:38:20 [async_llm.py:261] Added request cmpl-1d1bc214a905486db45261b0e5532c07-0.
INFO 03-02 01:38:21 [logger.py:42] Received request cmpl-4fe52eb1ed394f5b9da972efaa6c5b56-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:38:21 [async_llm.py:261] Added request cmpl-4fe52eb1ed394f5b9da972efaa6c5b56-0.
INFO 03-02 01:38:22 [logger.py:42] Received request cmpl-986ec59bcae34dae8f1522b9ee1dd897-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:38:22 [async_llm.py:261] Added request cmpl-986ec59bcae34dae8f1522b9ee1dd897-0.
INFO 03-02 01:38:23 [logger.py:42] Received request cmpl-dda341dabfa345aab2f3bb636cb9c541-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:38:23 [async_llm.py:261] Added request cmpl-dda341dabfa345aab2f3bb636cb9c541-0.
INFO 03-02 01:38:24 [logger.py:42] Received request cmpl-e67439bc91954628add74631fccf7edb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:38:24 [async_llm.py:261] Added request cmpl-e67439bc91954628add74631fccf7edb-0.
INFO 03-02 01:38:25 [logger.py:42] Received request cmpl-81b5848ce4104e37ad11f4739c302620-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:38:25 [async_llm.py:261] Added request cmpl-81b5848ce4104e37ad11f4739c302620-0.
INFO 03-02 01:38:27 [logger.py:42] Received request cmpl-2ec14ec5e5024a8fb39e17c2ad5bd1e6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:38:27 [async_llm.py:261] Added request cmpl-2ec14ec5e5024a8fb39e17c2ad5bd1e6-0.
INFO 03-02 01:38:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:38:28 [logger.py:42] Received request cmpl-7dc68da53b9b4c5db121c3f520f396d1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:38:28 [async_llm.py:261] Added request cmpl-7dc68da53b9b4c5db121c3f520f396d1-0.
INFO 03-02 01:38:29 [logger.py:42] Received request cmpl-e39248dbab2b43fdae87a4e752043e2c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:38:29 [async_llm.py:261] Added request cmpl-e39248dbab2b43fdae87a4e752043e2c-0.
INFO 03-02 01:38:30 [logger.py:42] Received request cmpl-500ecf276d3c4ac9aec15e4a6ca272aa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:38:30 [async_llm.py:261] Added request cmpl-500ecf276d3c4ac9aec15e4a6ca272aa-0.
INFO 03-02 01:38:31 [logger.py:42] Received request cmpl-a388c22611e843d2b89996648786ac4d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:38:31 [async_llm.py:261] Added request cmpl-a388c22611e843d2b89996648786ac4d-0.
INFO 03-02 01:38:32 [logger.py:42] Received request cmpl-ec2f68978ffc48079c408d3b0dd95daf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:38:32 [async_llm.py:261] Added request cmpl-ec2f68978ffc48079c408d3b0dd95daf-0.
INFO 03-02 01:38:34 [logger.py:42] Received request cmpl-da78275489d341898bc9b5ea250f39cd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:38:34 [async_llm.py:261] Added request cmpl-da78275489d341898bc9b5ea250f39cd-0.
INFO 03-02 01:38:35 [logger.py:42] Received request cmpl-15b2da4068034b92ae91270431ef150c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:38:35 [async_llm.py:261] Added request cmpl-15b2da4068034b92ae91270431ef150c-0.
INFO 03-02 01:38:36 [logger.py:42] Received request cmpl-1a80f2c2529448eebf86e5b07bf0283d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:38:36 [async_llm.py:261] Added request cmpl-1a80f2c2529448eebf86e5b07bf0283d-0.
INFO 03-02 01:38:37 [logger.py:42] Received request cmpl-e9ebeea17436471b91a9f5f0c54082de-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:38:37 [async_llm.py:261] Added request cmpl-e9ebeea17436471b91a9f5f0c54082de-0.
INFO 03-02 01:38:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:38:38 [logger.py:42] Received request cmpl-f9d370f053b945efacd379e1aeab15d5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:38:38 [async_llm.py:261] Added request cmpl-f9d370f053b945efacd379e1aeab15d5-0.
INFO 03-02 01:38:39 [logger.py:42] Received request cmpl-bf3d1d5b9ba7465f9f2f82d84074b045-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:38:39 [async_llm.py:261] Added request cmpl-bf3d1d5b9ba7465f9f2f82d84074b045-0.
INFO 03-02 01:38:41 [logger.py:42] Received request cmpl-be3af4537b98446e82b3e94bf3cf5835-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:38:41 [async_llm.py:261] Added request cmpl-be3af4537b98446e82b3e94bf3cf5835-0.
INFO 03-02 01:38:42 [logger.py:42] Received request cmpl-deb3c50548474856be01781bbd9a6a17-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:38:42 [async_llm.py:261] Added request cmpl-deb3c50548474856be01781bbd9a6a17-0.
INFO 03-02 01:38:43 [logger.py:42] Received request cmpl-faf68d4ba3794709ba432d8885815a98-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:38:43 [async_llm.py:261] Added request cmpl-faf68d4ba3794709ba432d8885815a98-0.
INFO 03-02 01:38:44 [logger.py:42] Received request cmpl-70d39ffa6e76452fa3279250547a600e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:38:44 [async_llm.py:261] Added request cmpl-70d39ffa6e76452fa3279250547a600e-0.
INFO 03-02 01:38:45 [logger.py:42] Received request cmpl-a603ac5ada2e4629839370fb810a4cb8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:38:45 [async_llm.py:261] Added request cmpl-a603ac5ada2e4629839370fb810a4cb8-0.
INFO 03-02 01:38:46 [logger.py:42] Received request cmpl-197abe09c3434922a82e75170fa95f44-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:38:46 [async_llm.py:261] Added request cmpl-197abe09c3434922a82e75170fa95f44-0.
INFO 03-02 01:38:47 [logger.py:42] Received request cmpl-1ec65fee72c14b5590ef88406061d170-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:38:47 [async_llm.py:261] Added request cmpl-1ec65fee72c14b5590ef88406061d170-0.
INFO 03-02 01:38:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:38:49 [logger.py:42] Received request cmpl-fe78a40aae544155aa3cb3fbd0f28f0e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:38:49 [async_llm.py:261] Added request cmpl-fe78a40aae544155aa3cb3fbd0f28f0e-0.
INFO 03-02 01:38:50 [logger.py:42] Received request cmpl-5a882debbd874d8b965c0162c0f34331-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:38:50 [async_llm.py:261] Added request cmpl-5a882debbd874d8b965c0162c0f34331-0.
INFO 03-02 01:38:51 [logger.py:42] Received request cmpl-045fc97e62e8456288d47059205a8885-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:38:51 [async_llm.py:261] Added request cmpl-045fc97e62e8456288d47059205a8885-0.
INFO 03-02 01:38:52 [logger.py:42] Received request cmpl-137b3c511bd345618d839892766dbf77-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:38:52 [async_llm.py:261] Added request cmpl-137b3c511bd345618d839892766dbf77-0.
INFO 03-02 01:38:53 [logger.py:42] Received request cmpl-9b263d3537154c05b6bebf969fdb03ef-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:38:53 [async_llm.py:261] Added request cmpl-9b263d3537154c05b6bebf969fdb03ef-0.
INFO 03-02 01:38:54 [logger.py:42] Received request cmpl-68420eff1890490199c239e68b865030-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:38:54 [async_llm.py:261] Added request cmpl-68420eff1890490199c239e68b865030-0.
INFO 03-02 01:38:56 [logger.py:42] Received request cmpl-51a81dfaa46c423cb36b669b6759c198-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:38:56 [async_llm.py:261] Added request cmpl-51a81dfaa46c423cb36b669b6759c198-0.
INFO 03-02 01:38:57 [logger.py:42] Received request cmpl-0b71b8510e004a728c269e2cb877041b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:38:57 [async_llm.py:261] Added request cmpl-0b71b8510e004a728c269e2cb877041b-0.
INFO 03-02 01:38:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:38:58 [logger.py:42] Received request cmpl-e6947db71ff444c2a304962f1bb3ae62-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:38:58 [async_llm.py:261] Added request cmpl-e6947db71ff444c2a304962f1bb3ae62-0.
INFO 03-02 01:38:59 [logger.py:42] Received request cmpl-97e5a958dd664a47a130d892171abc16-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:38:59 [async_llm.py:261] Added request cmpl-97e5a958dd664a47a130d892171abc16-0.
INFO 03-02 01:39:00 [logger.py:42] Received request cmpl-30b7073892ae41969a9419b4d46983e2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:39:00 [async_llm.py:261] Added request cmpl-30b7073892ae41969a9419b4d46983e2-0.
INFO 03-02 01:39:01 [logger.py:42] Received request cmpl-d8759533aec2462091e4b602c10dda8e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:39:01 [async_llm.py:261] Added request cmpl-d8759533aec2462091e4b602c10dda8e-0.
INFO 03-02 01:39:02 [logger.py:42] Received request cmpl-f1165166c661479d936270200171543e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:39:02 [async_llm.py:261] Added request cmpl-f1165166c661479d936270200171543e-0.
INFO 03-02 01:39:04 [logger.py:42] Received request cmpl-d860ff7bf28c42b6856ac6adab0275b8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:39:04 [async_llm.py:261] Added request cmpl-d860ff7bf28c42b6856ac6adab0275b8-0.
INFO 03-02 01:39:05 [logger.py:42] Received request cmpl-0fe695f5342e4aa2a0e5fc3094ccdc3e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:39:05 [async_llm.py:261] Added request cmpl-0fe695f5342e4aa2a0e5fc3094ccdc3e-0.
INFO 03-02 01:39:06 [logger.py:42] Received request cmpl-492361d13c3d4e1fbc57173898cb2b49-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:39:06 [async_llm.py:261] Added request cmpl-492361d13c3d4e1fbc57173898cb2b49-0.
INFO 03-02 01:39:07 [logger.py:42] Received request cmpl-0facaf3446a44b03a929e535a6f2ffcb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:39:07 [async_llm.py:261] Added request cmpl-0facaf3446a44b03a929e535a6f2ffcb-0.
INFO 03-02 01:39:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:39:08 [logger.py:42] Received request cmpl-625b731dbe674958b5e3d1794eff4daa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:39:08 [async_llm.py:261] Added request cmpl-625b731dbe674958b5e3d1794eff4daa-0.
INFO 03-02 01:39:09 [logger.py:42] Received request cmpl-63840f619647407096ae58b242759431-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:39:09 [async_llm.py:261] Added request cmpl-63840f619647407096ae58b242759431-0.
INFO 03-02 01:39:11 [logger.py:42] Received request cmpl-5be7bf166aa047b1a1442d310a14cadc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:39:11 [async_llm.py:261] Added request cmpl-5be7bf166aa047b1a1442d310a14cadc-0.
INFO 03-02 01:39:12 [logger.py:42] Received request cmpl-e03bda6c718e41c8a077bcb97b3359a9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:39:12 [async_llm.py:261] Added request cmpl-e03bda6c718e41c8a077bcb97b3359a9-0.
INFO 03-02 01:39:13 [logger.py:42] Received request cmpl-ea21fe586de844fc97e985372a930a05-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:39:13 [async_llm.py:261] Added request cmpl-ea21fe586de844fc97e985372a930a05-0.
INFO 03-02 01:39:14 [logger.py:42] Received request cmpl-83ff968ab843439c80c75077acdbfdd1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:39:14 [async_llm.py:261] Added request cmpl-83ff968ab843439c80c75077acdbfdd1-0.
INFO 03-02 01:39:15 [logger.py:42] Received request cmpl-50fc7e45e7f54636b6fa3a6d0f0d153f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:39:15 [async_llm.py:261] Added request cmpl-50fc7e45e7f54636b6fa3a6d0f0d153f-0.
INFO 03-02 01:39:16 [logger.py:42] Received request cmpl-2016d42d31b44db098a59eb3df661af4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:39:16 [async_llm.py:261] Added request cmpl-2016d42d31b44db098a59eb3df661af4-0.
INFO 03-02 01:39:17 [logger.py:42] Received request cmpl-2f72a1dae2eb4dadac1c5dc309022ffc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:39:17 [async_llm.py:261] Added request cmpl-2f72a1dae2eb4dadac1c5dc309022ffc-0.
INFO 03-02 01:39:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:39:19 [logger.py:42] Received request cmpl-2b943577c39c4425a859fed53a0c5ff0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:39:19 [async_llm.py:261] Added request cmpl-2b943577c39c4425a859fed53a0c5ff0-0.
INFO 03-02 01:39:20 [logger.py:42] Received request cmpl-1d631c8e6a43457180293f65148c4f62-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:39:20 [async_llm.py:261] Added request cmpl-1d631c8e6a43457180293f65148c4f62-0.
INFO 03-02 01:39:21 [logger.py:42] Received request cmpl-99be3ef023884bcd817c3b847fd83ba1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:39:21 [async_llm.py:261] Added request cmpl-99be3ef023884bcd817c3b847fd83ba1-0.
INFO 03-02 01:39:22 [logger.py:42] Received request cmpl-419157b1ccb649a7ab124e1d3d1a14d0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:39:22 [async_llm.py:261] Added request cmpl-419157b1ccb649a7ab124e1d3d1a14d0-0.
INFO 03-02 01:39:23 [logger.py:42] Received request cmpl-17e0bfabdd7544dc9a35f882e2a3f0e2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:39:23 [async_llm.py:261] Added request cmpl-17e0bfabdd7544dc9a35f882e2a3f0e2-0.
INFO 03-02 01:39:24 [logger.py:42] Received request cmpl-159c555ee2464af3a1b81df6f2ec89e6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:39:24 [async_llm.py:261] Added request cmpl-159c555ee2464af3a1b81df6f2ec89e6-0.
INFO 03-02 01:39:26 [logger.py:42] Received request cmpl-d72baf06dc354623bac986b959fd585f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:39:26 [async_llm.py:261] Added request cmpl-d72baf06dc354623bac986b959fd585f-0.
INFO 03-02 01:39:27 [logger.py:42] Received request cmpl-bae8ac4f00294e8f84d1751baefbcf6f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:39:27 [async_llm.py:261] Added request cmpl-bae8ac4f00294e8f84d1751baefbcf6f-0.
INFO 03-02 01:39:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:39:28 [logger.py:42] Received request cmpl-c3b28f8cd75248918c2e7a48e6909c03-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:39:28 [async_llm.py:261] Added request cmpl-c3b28f8cd75248918c2e7a48e6909c03-0.
INFO 03-02 01:39:29 [logger.py:42] Received request cmpl-fb4bd29d0cc44e9c8a79ac2e6181d7c2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:39:29 [async_llm.py:261] Added request cmpl-fb4bd29d0cc44e9c8a79ac2e6181d7c2-0.
INFO 03-02 01:39:30 [logger.py:42] Received request cmpl-2741317696c54b328483eedbd977260f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:39:30 [async_llm.py:261] Added request cmpl-2741317696c54b328483eedbd977260f-0.
INFO 03-02 01:39:31 [logger.py:42] Received request cmpl-fdf81a19fc6c447ab26bb83f9a2412d5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:39:31 [async_llm.py:261] Added request cmpl-fdf81a19fc6c447ab26bb83f9a2412d5-0.
INFO 03-02 01:39:32 [logger.py:42] Received request cmpl-eb5664c778f54c01b6dbbe32b0faa265-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:39:32 [async_llm.py:261] Added request cmpl-eb5664c778f54c01b6dbbe32b0faa265-0.
INFO 03-02 01:39:34 [logger.py:42] Received request cmpl-e7839804a467437b91400ce9d2a30a97-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:39:34 [async_llm.py:261] Added request cmpl-e7839804a467437b91400ce9d2a30a97-0.
INFO 03-02 01:39:35 [logger.py:42] Received request cmpl-a9a6e159a6584a4491a43920152f54df-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:39:35 [async_llm.py:261] Added request cmpl-a9a6e159a6584a4491a43920152f54df-0.
INFO 03-02 01:39:36 [logger.py:42] Received request cmpl-82f5e52ce0254e56a7b8a9ef5b059a43-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:39:36 [async_llm.py:261] Added request cmpl-82f5e52ce0254e56a7b8a9ef5b059a43-0.
INFO 03-02 01:39:37 [logger.py:42] Received request cmpl-c068d39e912f40019212b70ac27b5831-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:39:37 [async_llm.py:261] Added request cmpl-c068d39e912f40019212b70ac27b5831-0.
INFO 03-02 01:39:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:39:38 [logger.py:42] Received request cmpl-dde8366274e4480b814773b950a3e0d9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:39:38 [async_llm.py:261] Added request cmpl-dde8366274e4480b814773b950a3e0d9-0.
INFO 03-02 01:39:39 [logger.py:42] Received request cmpl-5fdf3e9a1c9b4c31b54d3c1aa7f5953b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:39:39 [async_llm.py:261] Added request cmpl-5fdf3e9a1c9b4c31b54d3c1aa7f5953b-0.
INFO 03-02 01:39:41 [logger.py:42] Received request cmpl-990db534a1a447c7ae1c55e8a6975466-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:39:41 [async_llm.py:261] Added request cmpl-990db534a1a447c7ae1c55e8a6975466-0.
INFO 03-02 01:39:42 [logger.py:42] Received request cmpl-4a874544f9b147518d3bf36391d8e5e9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:39:42 [async_llm.py:261] Added request cmpl-4a874544f9b147518d3bf36391d8e5e9-0.
INFO 03-02 01:39:43 [logger.py:42] Received request cmpl-c8af2ec318b9457998883f1f64c22a03-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:39:43 [async_llm.py:261] Added request cmpl-c8af2ec318b9457998883f1f64c22a03-0.
INFO 03-02 01:39:44 [logger.py:42] Received request cmpl-170805e602c942b286a3d4983f619332-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:39:44 [async_llm.py:261] Added request cmpl-170805e602c942b286a3d4983f619332-0.
INFO 03-02 01:39:45 [logger.py:42] Received request cmpl-55d0cc15f2b24a908488eb5bda861c8c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:39:45 [async_llm.py:261] Added request cmpl-55d0cc15f2b24a908488eb5bda861c8c-0.
INFO 03-02 01:39:46 [logger.py:42] Received request cmpl-4eb31eb57afe4a98ace6cab88046f0b0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:39:46 [async_llm.py:261] Added request cmpl-4eb31eb57afe4a98ace6cab88046f0b0-0.
INFO 03-02 01:39:47 [logger.py:42] Received request cmpl-7a1a40dd722a4ba7b8d9c36c72ed22e4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:39:47 [async_llm.py:261] Added request cmpl-7a1a40dd722a4ba7b8d9c36c72ed22e4-0.
INFO 03-02 01:39:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:39:49 [logger.py:42] Received request cmpl-140bb982b4994fe3bc85dac99bbdea3d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:39:49 [async_llm.py:261] Added request cmpl-140bb982b4994fe3bc85dac99bbdea3d-0.
INFO 03-02 01:39:50 [logger.py:42] Received request cmpl-c2685b5537424b488a8c86ebf3f3a649-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:39:50 [async_llm.py:261] Added request cmpl-c2685b5537424b488a8c86ebf3f3a649-0.
INFO 03-02 01:39:51 [logger.py:42] Received request cmpl-bfeabac84c3a4d95b1fc18cfc84a5009-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:39:51 [async_llm.py:261] Added request cmpl-bfeabac84c3a4d95b1fc18cfc84a5009-0.
INFO 03-02 01:39:52 [logger.py:42] Received request cmpl-f5d8561e9a3e4d15a605acc2c865b117-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:39:52 [async_llm.py:261] Added request cmpl-f5d8561e9a3e4d15a605acc2c865b117-0.
INFO 03-02 01:39:53 [logger.py:42] Received request cmpl-3ec3bca4325446e1984254de659b11cd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:39:53 [async_llm.py:261] Added request cmpl-3ec3bca4325446e1984254de659b11cd-0.
INFO 03-02 01:39:54 [logger.py:42] Received request cmpl-4a6c13f8f630405886044b38effe617a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:39:54 [async_llm.py:261] Added request cmpl-4a6c13f8f630405886044b38effe617a-0.
INFO 03-02 01:39:55 [logger.py:42] Received request cmpl-619c27cf4d444687bb712037c2b1007d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:39:55 [async_llm.py:261] Added request cmpl-619c27cf4d444687bb712037c2b1007d-0.
INFO 03-02 01:39:57 [logger.py:42] Received request cmpl-cc256c2d09a04521836316b3fce29ad8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:39:57 [async_llm.py:261] Added request cmpl-cc256c2d09a04521836316b3fce29ad8-0.
INFO 03-02 01:39:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:39:58 [logger.py:42] Received request cmpl-daeb85203a1b475c90f8c58472703be6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:39:58 [async_llm.py:261] Added request cmpl-daeb85203a1b475c90f8c58472703be6-0.
INFO 03-02 01:39:59 [logger.py:42] Received request cmpl-c0d555af23ee46d6abc397d0bdb21395-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:39:59 [async_llm.py:261] Added request cmpl-c0d555af23ee46d6abc397d0bdb21395-0.
INFO 03-02 01:40:00 [logger.py:42] Received request cmpl-fa97caba25c0457ea9a30f42c9f330a6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:40:00 [async_llm.py:261] Added request cmpl-fa97caba25c0457ea9a30f42c9f330a6-0.
INFO 03-02 01:40:01 [logger.py:42] Received request cmpl-9f9a341887624673b403389c3348e08b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:40:01 [async_llm.py:261] Added request cmpl-9f9a341887624673b403389c3348e08b-0.
INFO 03-02 01:40:02 [logger.py:42] Received request cmpl-983fb40e4cec4835865b6de1d3696d68-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:40:02 [async_llm.py:261] Added request cmpl-983fb40e4cec4835865b6de1d3696d68-0.
INFO 03-02 01:40:04 [logger.py:42] Received request cmpl-a1ae8ad2da484b9db5f8a1aa6090023d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:40:04 [async_llm.py:261] Added request cmpl-a1ae8ad2da484b9db5f8a1aa6090023d-0.
INFO 03-02 01:40:05 [logger.py:42] Received request cmpl-4fb356030bfe4942b2467e3c0253f16e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:40:05 [async_llm.py:261] Added request cmpl-4fb356030bfe4942b2467e3c0253f16e-0.
INFO 03-02 01:40:06 [logger.py:42] Received request cmpl-69023236ff194a1ca656bba455fe522e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:40:06 [async_llm.py:261] Added request cmpl-69023236ff194a1ca656bba455fe522e-0.
INFO 03-02 01:40:07 [logger.py:42] Received request cmpl-8dc8fca0ca6b4be3831801f71691de8d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:40:07 [async_llm.py:261] Added request cmpl-8dc8fca0ca6b4be3831801f71691de8d-0.
INFO 03-02 01:40:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:40:08 [logger.py:42] Received request cmpl-52a00335783c4a4ea1c789faf862a207-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:40:08 [async_llm.py:261] Added request cmpl-52a00335783c4a4ea1c789faf862a207-0.
INFO 03-02 01:40:09 [logger.py:42] Received request cmpl-5ebbd46d9d6e4bbfa15efa6c10d776ae-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:40:09 [async_llm.py:261] Added request cmpl-5ebbd46d9d6e4bbfa15efa6c10d776ae-0.
INFO 03-02 01:40:10 [logger.py:42] Received request cmpl-955a7203a033416fbb53d08ef046eebc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:40:10 [async_llm.py:261] Added request cmpl-955a7203a033416fbb53d08ef046eebc-0.
INFO 03-02 01:40:12 [logger.py:42] Received request cmpl-2d2e48ce10ed4151811208cb827eea49-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:40:12 [async_llm.py:261] Added request cmpl-2d2e48ce10ed4151811208cb827eea49-0.
INFO 03-02 01:40:13 [logger.py:42] Received request cmpl-45f953cfa8234d11a1f61bd69a0d7922-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:40:13 [async_llm.py:261] Added request cmpl-45f953cfa8234d11a1f61bd69a0d7922-0.
INFO 03-02 01:40:14 [logger.py:42] Received request cmpl-6ca7a2e4dca24b3eafb7d69553e9b9c3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:40:14 [async_llm.py:261] Added request cmpl-6ca7a2e4dca24b3eafb7d69553e9b9c3-0.
INFO 03-02 01:40:15 [logger.py:42] Received request cmpl-b1bb2a1058854f0391a915b3dd2ba62d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:40:15 [async_llm.py:261] Added request cmpl-b1bb2a1058854f0391a915b3dd2ba62d-0.
INFO 03-02 01:40:16 [logger.py:42] Received request cmpl-8648375737de478ab3936b070b700238-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:40:16 [async_llm.py:261] Added request cmpl-8648375737de478ab3936b070b700238-0.
INFO 03-02 01:40:17 [logger.py:42] Received request cmpl-671f5128e17b4e408bb39cbcca04be92-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:40:17 [async_llm.py:261] Added request cmpl-671f5128e17b4e408bb39cbcca04be92-0.
INFO 03-02 01:40:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:40:19 [logger.py:42] Received request cmpl-78540a15eac447f3af24943ca4c38136-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:40:19 [async_llm.py:261] Added request cmpl-78540a15eac447f3af24943ca4c38136-0.
INFO 03-02 01:40:20 [logger.py:42] Received request cmpl-390b14cd8d824fe39e69f01cca87bdbf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:40:20 [async_llm.py:261] Added request cmpl-390b14cd8d824fe39e69f01cca87bdbf-0.
INFO 03-02 01:40:21 [logger.py:42] Received request cmpl-9632f3603ea44fec98cce6b0aba464ed-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:40:21 [async_llm.py:261] Added request cmpl-9632f3603ea44fec98cce6b0aba464ed-0.
INFO 03-02 01:40:22 [logger.py:42] Received request cmpl-6c136b694ad74fd0b488fdb5c1149342-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:40:22 [async_llm.py:261] Added request cmpl-6c136b694ad74fd0b488fdb5c1149342-0.
INFO 03-02 01:40:23 [logger.py:42] Received request cmpl-097b3e06edd1416fb0c9210a573f8711-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:40:23 [async_llm.py:261] Added request cmpl-097b3e06edd1416fb0c9210a573f8711-0.
INFO 03-02 01:40:24 [logger.py:42] Received request cmpl-999677bd1511483e8c25cf0b7216c5bc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:40:24 [async_llm.py:261] Added request cmpl-999677bd1511483e8c25cf0b7216c5bc-0.
INFO 03-02 01:40:25 [logger.py:42] Received request cmpl-0d4fa720f7bf429c81de1d1811f47d10-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:40:25 [async_llm.py:261] Added request cmpl-0d4fa720f7bf429c81de1d1811f47d10-0.
INFO 03-02 01:40:27 [logger.py:42] Received request cmpl-63aed1a91c764eaaa983001f2057bced-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:40:27 [async_llm.py:261] Added request cmpl-63aed1a91c764eaaa983001f2057bced-0.
INFO 03-02 01:40:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:40:28 [logger.py:42] Received request cmpl-12a115f617f14394b19a1c1200c2981b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:40:28 [async_llm.py:261] Added request cmpl-12a115f617f14394b19a1c1200c2981b-0.
INFO 03-02 01:40:29 [logger.py:42] Received request cmpl-aac9beca451c41e0836c6a371ec95106-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:40:29 [async_llm.py:261] Added request cmpl-aac9beca451c41e0836c6a371ec95106-0.
INFO 03-02 01:40:30 [logger.py:42] Received request cmpl-5e17643a414f4859b776a941a149c6de-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:40:30 [async_llm.py:261] Added request cmpl-5e17643a414f4859b776a941a149c6de-0.
INFO 03-02 01:40:31 [logger.py:42] Received request cmpl-f640abf861404947ad16bdc3705ce619-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:40:31 [async_llm.py:261] Added request cmpl-f640abf861404947ad16bdc3705ce619-0.
INFO 03-02 01:40:32 [logger.py:42] Received request cmpl-19d3d6498f2d403a9992a3772b050501-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:40:32 [async_llm.py:261] Added request cmpl-19d3d6498f2d403a9992a3772b050501-0.
INFO 03-02 01:40:34 [logger.py:42] Received request cmpl-04d09399d53d4fa1a0d466ae94903eef-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:40:34 [async_llm.py:261] Added request cmpl-04d09399d53d4fa1a0d466ae94903eef-0.
INFO 03-02 01:40:35 [logger.py:42] Received request cmpl-650a7486755f48bda4d19361ba7ea326-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:40:35 [async_llm.py:261] Added request cmpl-650a7486755f48bda4d19361ba7ea326-0.
INFO 03-02 01:40:36 [logger.py:42] Received request cmpl-d0fc648d26be4cd9856c7064b534a572-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:40:36 [async_llm.py:261] Added request cmpl-d0fc648d26be4cd9856c7064b534a572-0.
INFO 03-02 01:40:37 [logger.py:42] Received request cmpl-5d6fce154a844ff689826f8395f69015-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:40:37 [async_llm.py:261] Added request cmpl-5d6fce154a844ff689826f8395f69015-0.
INFO 03-02 01:40:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:40:38 [logger.py:42] Received request cmpl-f1636bf4873d40e39fc4e296b95bb770-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:40:38 [async_llm.py:261] Added request cmpl-f1636bf4873d40e39fc4e296b95bb770-0.
INFO 03-02 01:40:39 [logger.py:42] Received request cmpl-aea529f9657245698223444acc64087a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:40:39 [async_llm.py:261] Added request cmpl-aea529f9657245698223444acc64087a-0.
INFO 03-02 01:40:40 [logger.py:42] Received request cmpl-c4f653089a2c4570bd48f098db5b034d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:40:40 [async_llm.py:261] Added request cmpl-c4f653089a2c4570bd48f098db5b034d-0.
INFO 03-02 01:40:42 [logger.py:42] Received request cmpl-92f4e55f808f47a19fc5b805a6bba89a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:40:42 [async_llm.py:261] Added request cmpl-92f4e55f808f47a19fc5b805a6bba89a-0.
INFO 03-02 01:40:43 [logger.py:42] Received request cmpl-a24a6f8e10ad4fda9ac35da78dab65f3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:40:43 [async_llm.py:261] Added request cmpl-a24a6f8e10ad4fda9ac35da78dab65f3-0.
INFO 03-02 01:40:44 [logger.py:42] Received request cmpl-43371212949240ccaae85215edc96a60-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:40:44 [async_llm.py:261] Added request cmpl-43371212949240ccaae85215edc96a60-0.
INFO 03-02 01:40:45 [logger.py:42] Received request cmpl-93aa8b0901d94935a0fb652bb30f13bb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:40:45 [async_llm.py:261] Added request cmpl-93aa8b0901d94935a0fb652bb30f13bb-0.
INFO 03-02 01:40:46 [logger.py:42] Received request cmpl-c17fb5ec0b6e4418affa30be4ddcce35-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:40:46 [async_llm.py:261] Added request cmpl-c17fb5ec0b6e4418affa30be4ddcce35-0.
INFO 03-02 01:40:47 [logger.py:42] Received request cmpl-a2c0aa0a9c4c45cea7312192165f9a63-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:40:47 [async_llm.py:261] Added request cmpl-a2c0aa0a9c4c45cea7312192165f9a63-0.
INFO 03-02 01:40:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:40:49 [logger.py:42] Received request cmpl-c6a4a8f383d54e7dace5a4a1bffb2047-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:40:49 [async_llm.py:261] Added request cmpl-c6a4a8f383d54e7dace5a4a1bffb2047-0.
INFO 03-02 01:40:50 [logger.py:42] Received request cmpl-29cf22cc4a254155a22b30c83b17e099-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:40:50 [async_llm.py:261] Added request cmpl-29cf22cc4a254155a22b30c83b17e099-0.
INFO 03-02 01:40:51 [logger.py:42] Received request cmpl-3105533976294703bc909f46f5eb0542-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:40:51 [async_llm.py:261] Added request cmpl-3105533976294703bc909f46f5eb0542-0.
INFO 03-02 01:40:52 [logger.py:42] Received request cmpl-9c75ba27621749dfbeac80452e68bb6a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:40:52 [async_llm.py:261] Added request cmpl-9c75ba27621749dfbeac80452e68bb6a-0.
INFO 03-02 01:40:53 [logger.py:42] Received request cmpl-92261a59780d4536b1b725091fe22d9e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:40:53 [async_llm.py:261] Added request cmpl-92261a59780d4536b1b725091fe22d9e-0.
INFO 03-02 01:40:54 [logger.py:42] Received request cmpl-215f7801ac15475582f2ebdbedf27f96-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:40:54 [async_llm.py:261] Added request cmpl-215f7801ac15475582f2ebdbedf27f96-0.
INFO 03-02 01:40:55 [logger.py:42] Received request cmpl-c6fbb676a5a149a5a32223c457722b96-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:40:55 [async_llm.py:261] Added request cmpl-c6fbb676a5a149a5a32223c457722b96-0.
INFO 03-02 01:40:57 [logger.py:42] Received request cmpl-d31d487ebabb4354b5351f0ef3605d45-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:40:57 [async_llm.py:261] Added request cmpl-d31d487ebabb4354b5351f0ef3605d45-0.
INFO 03-02 01:40:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:40:58 [logger.py:42] Received request cmpl-2f1c61a106244419b5bd1fee5e3b0521-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:40:58 [async_llm.py:261] Added request cmpl-2f1c61a106244419b5bd1fee5e3b0521-0.
INFO 03-02 01:40:59 [logger.py:42] Received request cmpl-2af93defdcaa4ad9acba36c78a118950-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:40:59 [async_llm.py:261] Added request cmpl-2af93defdcaa4ad9acba36c78a118950-0.
INFO 03-02 01:41:00 [logger.py:42] Received request cmpl-b134227daa844441a3f193368fdb8f05-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:41:00 [async_llm.py:261] Added request cmpl-b134227daa844441a3f193368fdb8f05-0.
INFO 03-02 01:41:01 [logger.py:42] Received request cmpl-d49b8a0c5cf44338920324da5d1056ba-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:41:01 [async_llm.py:261] Added request cmpl-d49b8a0c5cf44338920324da5d1056ba-0.
INFO 03-02 01:41:02 [logger.py:42] Received request cmpl-779627b5adac4f629472e21a11909bf4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:41:02 [async_llm.py:261] Added request cmpl-779627b5adac4f629472e21a11909bf4-0.
INFO 03-02 01:41:04 [logger.py:42] Received request cmpl-dc55a16e72034ab78d79b1e0df1baf17-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:41:04 [async_llm.py:261] Added request cmpl-dc55a16e72034ab78d79b1e0df1baf17-0.
INFO 03-02 01:41:05 [logger.py:42] Received request cmpl-c1fff238107447d99f19b64aefb2ecb4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:41:05 [async_llm.py:261] Added request cmpl-c1fff238107447d99f19b64aefb2ecb4-0.
INFO 03-02 01:41:06 [logger.py:42] Received request cmpl-d67ae889ea9a48708bc876ba28eb3066-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:41:06 [async_llm.py:261] Added request cmpl-d67ae889ea9a48708bc876ba28eb3066-0.
INFO 03-02 01:41:07 [logger.py:42] Received request cmpl-2160e43b9f4e49199cd86c83561388ce-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:41:07 [async_llm.py:261] Added request cmpl-2160e43b9f4e49199cd86c83561388ce-0.
INFO 03-02 01:41:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:41:08 [logger.py:42] Received request cmpl-9d960b736fa248068fb76baf5a6ff07f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:41:08 [async_llm.py:261] Added request cmpl-9d960b736fa248068fb76baf5a6ff07f-0.
INFO 03-02 01:41:09 [logger.py:42] Received request cmpl-b19314b79a0d47d78b5651ae30cb2d24-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:41:09 [async_llm.py:261] Added request cmpl-b19314b79a0d47d78b5651ae30cb2d24-0.
INFO 03-02 01:41:10 [logger.py:42] Received request cmpl-c96dd3c940764734b5440daf195f5cc5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:41:10 [async_llm.py:261] Added request cmpl-c96dd3c940764734b5440daf195f5cc5-0.
INFO 03-02 01:41:12 [logger.py:42] Received request cmpl-3472006fa2644ec893f9ab2cf8a0ffaa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:41:12 [async_llm.py:261] Added request cmpl-3472006fa2644ec893f9ab2cf8a0ffaa-0.
INFO 03-02 01:41:13 [logger.py:42] Received request cmpl-42d61c4bfb0942578dc0b462686dd273-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:41:13 [async_llm.py:261] Added request cmpl-42d61c4bfb0942578dc0b462686dd273-0.
INFO 03-02 01:41:14 [logger.py:42] Received request cmpl-0c7e963927194c878348d2f673674556-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:41:14 [async_llm.py:261] Added request cmpl-0c7e963927194c878348d2f673674556-0.
INFO 03-02 01:41:15 [logger.py:42] Received request cmpl-bd8cdc20e8bd406c95a4d59667c0d55e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:41:15 [async_llm.py:261] Added request cmpl-bd8cdc20e8bd406c95a4d59667c0d55e-0.
INFO 03-02 01:41:16 [logger.py:42] Received request cmpl-7e94518ca0704c0187ec276cef46bb75-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:41:16 [async_llm.py:261] Added request cmpl-7e94518ca0704c0187ec276cef46bb75-0.
INFO 03-02 01:41:17 [logger.py:42] Received request cmpl-ad696c859d9346ec87fe8961f28a9529-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:41:17 [async_llm.py:261] Added request cmpl-ad696c859d9346ec87fe8961f28a9529-0.
INFO 03-02 01:41:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:41:19 [logger.py:42] Received request cmpl-efd737792f6941c49976e7bfbf835d9f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:41:19 [async_llm.py:261] Added request cmpl-efd737792f6941c49976e7bfbf835d9f-0.
INFO 03-02 01:41:20 [logger.py:42] Received request cmpl-3132592466f74ab38a800e228f72df21-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:41:20 [async_llm.py:261] Added request cmpl-3132592466f74ab38a800e228f72df21-0.
INFO 03-02 01:41:21 [logger.py:42] Received request cmpl-fcf6f7fde3bc44cea16531435ed96fa4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:41:21 [async_llm.py:261] Added request cmpl-fcf6f7fde3bc44cea16531435ed96fa4-0.
INFO 03-02 01:41:22 [logger.py:42] Received request cmpl-82176921d2594a6ab3910ff43642815a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:41:22 [async_llm.py:261] Added request cmpl-82176921d2594a6ab3910ff43642815a-0.
INFO 03-02 01:41:23 [logger.py:42] Received request cmpl-882f28c2070a4e76aec1d77b6a33aa4f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:41:23 [async_llm.py:261] Added request cmpl-882f28c2070a4e76aec1d77b6a33aa4f-0.
INFO 03-02 01:41:24 [logger.py:42] Received request cmpl-6cae8a74abe34abe90b589c0e85452c6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:41:24 [async_llm.py:261] Added request cmpl-6cae8a74abe34abe90b589c0e85452c6-0.
INFO 03-02 01:41:26 [logger.py:42] Received request cmpl-e33cdb9d7ab445a9bd27760d26e14cac-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:41:26 [async_llm.py:261] Added request cmpl-e33cdb9d7ab445a9bd27760d26e14cac-0.
INFO 03-02 01:41:27 [logger.py:42] Received request cmpl-08ab4a690642489fa89b83bad9d920dd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:41:27 [async_llm.py:261] Added request cmpl-08ab4a690642489fa89b83bad9d920dd-0.
INFO 03-02 01:41:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:41:28 [logger.py:42] Received request cmpl-06e53a73b24646f8806c19ed97caf440-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:41:28 [async_llm.py:261] Added request cmpl-06e53a73b24646f8806c19ed97caf440-0.
INFO 03-02 01:41:29 [logger.py:42] Received request cmpl-4e1af61d01b74e9b9f018d5c692bd266-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:41:29 [async_llm.py:261] Added request cmpl-4e1af61d01b74e9b9f018d5c692bd266-0.
INFO 03-02 01:41:30 [logger.py:42] Received request cmpl-cb4e6f42bfcb4783abd999ed02fbf324-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:41:30 [async_llm.py:261] Added request cmpl-cb4e6f42bfcb4783abd999ed02fbf324-0.
INFO 03-02 01:41:31 [logger.py:42] Received request cmpl-611a16ad15b3474285f3d7d81381546a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:41:31 [async_llm.py:261] Added request cmpl-611a16ad15b3474285f3d7d81381546a-0.
INFO 03-02 01:41:32 [logger.py:42] Received request cmpl-695e6cc410fb487c8e096d7b909465d5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:41:32 [async_llm.py:261] Added request cmpl-695e6cc410fb487c8e096d7b909465d5-0.
INFO 03-02 01:41:34 [logger.py:42] Received request cmpl-7af551a9de6448c28b2d464025e3b7e5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:41:34 [async_llm.py:261] Added request cmpl-7af551a9de6448c28b2d464025e3b7e5-0.
INFO 03-02 01:41:35 [logger.py:42] Received request cmpl-e836ce1eb4c8409c8ef0a11b99edc3a3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:41:35 [async_llm.py:261] Added request cmpl-e836ce1eb4c8409c8ef0a11b99edc3a3-0.
INFO 03-02 01:41:36 [logger.py:42] Received request cmpl-8e640c454dea4995b509268aeb254473-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:41:36 [async_llm.py:261] Added request cmpl-8e640c454dea4995b509268aeb254473-0.
INFO 03-02 01:41:37 [logger.py:42] Received request cmpl-7670255de4bd4de6b3662874f4eed320-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:41:37 [async_llm.py:261] Added request cmpl-7670255de4bd4de6b3662874f4eed320-0.
INFO 03-02 01:41:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:41:38 [logger.py:42] Received request cmpl-830a0df55c2c4cd2b5f972b35fc6e31e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:41:38 [async_llm.py:261] Added request cmpl-830a0df55c2c4cd2b5f972b35fc6e31e-0.
INFO 03-02 01:41:39 [logger.py:42] Received request cmpl-c137d6b8124b4c0fa2048505bab49a66-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:41:39 [async_llm.py:261] Added request cmpl-c137d6b8124b4c0fa2048505bab49a66-0.
INFO 03-02 01:41:41 [logger.py:42] Received request cmpl-416826556339438ea4ffd5c5bab10d80-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:41:41 [async_llm.py:261] Added request cmpl-416826556339438ea4ffd5c5bab10d80-0.
INFO 03-02 01:41:42 [logger.py:42] Received request cmpl-fddea1dc42bd498b982127b6eff99d9a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:41:42 [async_llm.py:261] Added request cmpl-fddea1dc42bd498b982127b6eff99d9a-0.
INFO 03-02 01:41:43 [logger.py:42] Received request cmpl-545a955113944bd19297ef42b4ad7cdd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:41:43 [async_llm.py:261] Added request cmpl-545a955113944bd19297ef42b4ad7cdd-0.
INFO 03-02 01:41:44 [logger.py:42] Received request cmpl-ffbb5717caa44b079cae19143f49935a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:41:44 [async_llm.py:261] Added request cmpl-ffbb5717caa44b079cae19143f49935a-0.
INFO 03-02 01:41:45 [logger.py:42] Received request cmpl-0e0f693c5477424388a6b003965b202b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:41:45 [async_llm.py:261] Added request cmpl-0e0f693c5477424388a6b003965b202b-0.
INFO 03-02 01:41:46 [logger.py:42] Received request cmpl-be20252d28b24488b1451f341d761652-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:41:46 [async_llm.py:261] Added request cmpl-be20252d28b24488b1451f341d761652-0.
INFO 03-02 01:41:47 [logger.py:42] Received request cmpl-447044b068f641649f46794684ad3ae1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:41:47 [async_llm.py:261] Added request cmpl-447044b068f641649f46794684ad3ae1-0.
INFO 03-02 01:41:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:41:49 [logger.py:42] Received request cmpl-8055e280ef9a4f8bbb8efdfcfcb10747-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:41:49 [async_llm.py:261] Added request cmpl-8055e280ef9a4f8bbb8efdfcfcb10747-0.
INFO 03-02 01:41:50 [logger.py:42] Received request cmpl-b8820b6bc04140e0830699f5fd57be40-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:41:50 [async_llm.py:261] Added request cmpl-b8820b6bc04140e0830699f5fd57be40-0.
INFO 03-02 01:41:51 [logger.py:42] Received request cmpl-0e14dd87de204032b2ee5eac0df00044-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:41:51 [async_llm.py:261] Added request cmpl-0e14dd87de204032b2ee5eac0df00044-0.
INFO 03-02 01:41:52 [logger.py:42] Received request cmpl-bb80485d230a4133adcfddfd5ef126be-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:41:52 [async_llm.py:261] Added request cmpl-bb80485d230a4133adcfddfd5ef126be-0.
INFO 03-02 01:41:53 [logger.py:42] Received request cmpl-f38d6923e0a9427bbe0f5b81dba85ec9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:41:53 [async_llm.py:261] Added request cmpl-f38d6923e0a9427bbe0f5b81dba85ec9-0.
INFO 03-02 01:41:54 [logger.py:42] Received request cmpl-ddd66354b28547549a858e6da9976a5c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:41:54 [async_llm.py:261] Added request cmpl-ddd66354b28547549a858e6da9976a5c-0.
INFO 03-02 01:41:55 [logger.py:42] Received request cmpl-29d132c2b3df4798b757266d63caef82-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:41:55 [async_llm.py:261] Added request cmpl-29d132c2b3df4798b757266d63caef82-0.
INFO 03-02 01:41:57 [logger.py:42] Received request cmpl-eb48674d11b343f08f8ad1ee17e2fd71-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:41:57 [async_llm.py:261] Added request cmpl-eb48674d11b343f08f8ad1ee17e2fd71-0.
INFO 03-02 01:41:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:41:58 [logger.py:42] Received request cmpl-6096c5bb60af44dba3801d8f89a28ad3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:41:58 [async_llm.py:261] Added request cmpl-6096c5bb60af44dba3801d8f89a28ad3-0.
INFO 03-02 01:41:59 [logger.py:42] Received request cmpl-ebd59de7fe7a4f20aaf77f580a98d8d2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:41:59 [async_llm.py:261] Added request cmpl-ebd59de7fe7a4f20aaf77f580a98d8d2-0.
INFO 03-02 01:42:00 [logger.py:42] Received request cmpl-d6683d4cb0a64b2e8ea10d95ad77a1a3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:42:00 [async_llm.py:261] Added request cmpl-d6683d4cb0a64b2e8ea10d95ad77a1a3-0.
INFO 03-02 01:42:01 [logger.py:42] Received request cmpl-7730968a88fe4f5ab3dbfdfaa1d3aaa4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:42:01 [async_llm.py:261] Added request cmpl-7730968a88fe4f5ab3dbfdfaa1d3aaa4-0.
INFO 03-02 01:42:02 [logger.py:42] Received request cmpl-f9fb6e304233422ab0e15b0be24a1122-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:42:02 [async_llm.py:261] Added request cmpl-f9fb6e304233422ab0e15b0be24a1122-0.
INFO 03-02 01:42:04 [logger.py:42] Received request cmpl-301fec7f8a594b5780415ccfefa35ead-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:42:04 [async_llm.py:261] Added request cmpl-301fec7f8a594b5780415ccfefa35ead-0.
INFO 03-02 01:42:05 [logger.py:42] Received request cmpl-cb61e58701854c53aa7045a04cfd974a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:42:05 [async_llm.py:261] Added request cmpl-cb61e58701854c53aa7045a04cfd974a-0.
INFO 03-02 01:42:06 [logger.py:42] Received request cmpl-9ca74a60726a42e0883ae3263ad60129-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:42:06 [async_llm.py:261] Added request cmpl-9ca74a60726a42e0883ae3263ad60129-0.
INFO 03-02 01:42:07 [logger.py:42] Received request cmpl-b0cbfab0a39240cf833a06866f9d8d9d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:42:07 [async_llm.py:261] Added request cmpl-b0cbfab0a39240cf833a06866f9d8d9d-0.
INFO 03-02 01:42:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:42:08 [logger.py:42] Received request cmpl-51261b9b342e4de9932a856b805d4377-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:42:08 [async_llm.py:261] Added request cmpl-51261b9b342e4de9932a856b805d4377-0.
INFO 03-02 01:42:09 [logger.py:42] Received request cmpl-f93f0ede1a5141e0a427957f06c568f7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:42:09 [async_llm.py:261] Added request cmpl-f93f0ede1a5141e0a427957f06c568f7-0.
INFO 03-02 01:42:11 [logger.py:42] Received request cmpl-cdfbf1a41b1f4206b479533e98507b6c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:42:11 [async_llm.py:261] Added request cmpl-cdfbf1a41b1f4206b479533e98507b6c-0.
INFO 03-02 01:42:12 [logger.py:42] Received request cmpl-f33ae6bf57484acead24c9f582c07163-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:42:12 [async_llm.py:261] Added request cmpl-f33ae6bf57484acead24c9f582c07163-0.
INFO 03-02 01:42:13 [logger.py:42] Received request cmpl-f6f2fa9df779436cabcc6bd9683ecdd1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:42:13 [async_llm.py:261] Added request cmpl-f6f2fa9df779436cabcc6bd9683ecdd1-0.
INFO 03-02 01:42:14 [logger.py:42] Received request cmpl-ab1d864716bd4b1e91c1de1a4c4298a6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:42:14 [async_llm.py:261] Added request cmpl-ab1d864716bd4b1e91c1de1a4c4298a6-0.
INFO 03-02 01:42:15 [logger.py:42] Received request cmpl-8f62842e395e4591abf8a9a0aa2e3ae4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:42:15 [async_llm.py:261] Added request cmpl-8f62842e395e4591abf8a9a0aa2e3ae4-0.
INFO 03-02 01:42:16 [logger.py:42] Received request cmpl-28a6ee7cfd4540a5b372cccde381513e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:42:16 [async_llm.py:261] Added request cmpl-28a6ee7cfd4540a5b372cccde381513e-0.
INFO 03-02 01:42:17 [logger.py:42] Received request cmpl-fde57004fde4414fa84eb9362069a8b7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:42:17 [async_llm.py:261] Added request cmpl-fde57004fde4414fa84eb9362069a8b7-0.
INFO 03-02 01:42:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:42:19 [logger.py:42] Received request cmpl-dd876c51594f4797a434e9e869aa4080-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:42:19 [async_llm.py:261] Added request cmpl-dd876c51594f4797a434e9e869aa4080-0.
INFO 03-02 01:42:20 [logger.py:42] Received request cmpl-4e0767adf9e64d64af70b2387d368067-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:42:20 [async_llm.py:261] Added request cmpl-4e0767adf9e64d64af70b2387d368067-0.
INFO 03-02 01:42:21 [logger.py:42] Received request cmpl-9815574810e84e3a86af1738d7df80fb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:42:21 [async_llm.py:261] Added request cmpl-9815574810e84e3a86af1738d7df80fb-0.
INFO 03-02 01:42:22 [logger.py:42] Received request cmpl-1449fb5c22dd4036936119862140f130-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:42:22 [async_llm.py:261] Added request cmpl-1449fb5c22dd4036936119862140f130-0.
INFO 03-02 01:42:23 [logger.py:42] Received request cmpl-08ba784f868f4f3d8397e60a7dfc3c23-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:42:23 [async_llm.py:261] Added request cmpl-08ba784f868f4f3d8397e60a7dfc3c23-0.
INFO 03-02 01:42:24 [logger.py:42] Received request cmpl-3ad21cefe8eb45cc9f777ff4ac7d95db-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:42:24 [async_llm.py:261] Added request cmpl-3ad21cefe8eb45cc9f777ff4ac7d95db-0.
INFO 03-02 01:42:26 [logger.py:42] Received request cmpl-ba53961b5e024c3f80178a83e19f119d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:42:26 [async_llm.py:261] Added request cmpl-ba53961b5e024c3f80178a83e19f119d-0.
INFO 03-02 01:42:27 [logger.py:42] Received request cmpl-0ae75383e3c04342bf25e05b1c442bf3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:42:27 [async_llm.py:261] Added request cmpl-0ae75383e3c04342bf25e05b1c442bf3-0.
INFO 03-02 01:42:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:42:28 [logger.py:42] Received request cmpl-d5f20459c6dc4068a08f2ed686594ce4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:42:28 [async_llm.py:261] Added request cmpl-d5f20459c6dc4068a08f2ed686594ce4-0.
INFO 03-02 01:42:29 [logger.py:42] Received request cmpl-6ae0551edb5e406d983404d734927e58-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:42:29 [async_llm.py:261] Added request cmpl-6ae0551edb5e406d983404d734927e58-0.
INFO 03-02 01:42:30 [logger.py:42] Received request cmpl-b16559787c1246b691a8bf3b510bf02d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:42:30 [async_llm.py:261] Added request cmpl-b16559787c1246b691a8bf3b510bf02d-0.
INFO 03-02 01:42:31 [logger.py:42] Received request cmpl-a42c4f16be0d49679efff94db08a58bd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:42:31 [async_llm.py:261] Added request cmpl-a42c4f16be0d49679efff94db08a58bd-0.
INFO 03-02 01:42:32 [logger.py:42] Received request cmpl-03ba1738794d40c3b1bff3d967099bff-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:42:32 [async_llm.py:261] Added request cmpl-03ba1738794d40c3b1bff3d967099bff-0.
INFO 03-02 01:42:34 [logger.py:42] Received request cmpl-9b5f25153d8149d0b52e9fe501bc1367-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:42:34 [async_llm.py:261] Added request cmpl-9b5f25153d8149d0b52e9fe501bc1367-0.
INFO 03-02 01:42:35 [logger.py:42] Received request cmpl-858f609ceaca425385855a7ad0af49c2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:42:35 [async_llm.py:261] Added request cmpl-858f609ceaca425385855a7ad0af49c2-0.
INFO 03-02 01:42:36 [logger.py:42] Received request cmpl-0eea7a9204ae47ef8cef6bb8c635e14a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:42:36 [async_llm.py:261] Added request cmpl-0eea7a9204ae47ef8cef6bb8c635e14a-0.
INFO 03-02 01:42:37 [logger.py:42] Received request cmpl-dfa71de98bf64b5fb2851b8d5a383f6a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:42:37 [async_llm.py:261] Added request cmpl-dfa71de98bf64b5fb2851b8d5a383f6a-0.
INFO 03-02 01:42:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:42:38 [logger.py:42] Received request cmpl-3011e3393ebb46aea0d0fa57f7e18e98-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:42:38 [async_llm.py:261] Added request cmpl-3011e3393ebb46aea0d0fa57f7e18e98-0.
INFO 03-02 01:42:39 [logger.py:42] Received request cmpl-ca676a29b9d64ee29f834461a8c22527-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:42:39 [async_llm.py:261] Added request cmpl-ca676a29b9d64ee29f834461a8c22527-0.
INFO 03-02 01:42:41 [logger.py:42] Received request cmpl-9907a72be9bf427bbd7b8623a5c642ad-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:42:41 [async_llm.py:261] Added request cmpl-9907a72be9bf427bbd7b8623a5c642ad-0.
INFO 03-02 01:42:42 [logger.py:42] Received request cmpl-e54bf0a5b1b94640b442e72827ff6884-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:42:42 [async_llm.py:261] Added request cmpl-e54bf0a5b1b94640b442e72827ff6884-0.
INFO 03-02 01:42:43 [logger.py:42] Received request cmpl-8c7cdba4e0224184bc6cb79a6e56a74d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:42:43 [async_llm.py:261] Added request cmpl-8c7cdba4e0224184bc6cb79a6e56a74d-0.
INFO 03-02 01:42:44 [logger.py:42] Received request cmpl-718e405886be4ec592df0e28b11b8935-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:42:44 [async_llm.py:261] Added request cmpl-718e405886be4ec592df0e28b11b8935-0.
INFO 03-02 01:42:45 [logger.py:42] Received request cmpl-925ec993b50544e3b3c7b59c9e2e449c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:42:45 [async_llm.py:261] Added request cmpl-925ec993b50544e3b3c7b59c9e2e449c-0.
INFO 03-02 01:42:46 [logger.py:42] Received request cmpl-7f08c28d21bd4055940b0b163e555f9b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:42:46 [async_llm.py:261] Added request cmpl-7f08c28d21bd4055940b0b163e555f9b-0.
INFO 03-02 01:42:47 [logger.py:42] Received request cmpl-34ed2b4502b2492189386d57f0e29f2e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:42:47 [async_llm.py:261] Added request cmpl-34ed2b4502b2492189386d57f0e29f2e-0.
INFO 03-02 01:42:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:42:49 [logger.py:42] Received request cmpl-972a539eab804984beeb169c04834ab2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:42:49 [async_llm.py:261] Added request cmpl-972a539eab804984beeb169c04834ab2-0.
INFO 03-02 01:42:50 [logger.py:42] Received request cmpl-4dd31c2991074d0486f599a3a3aef983-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:42:50 [async_llm.py:261] Added request cmpl-4dd31c2991074d0486f599a3a3aef983-0.
INFO 03-02 01:42:51 [logger.py:42] Received request cmpl-fb9cddf44b63420ebb6aab2ce5fa6f47-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:42:51 [async_llm.py:261] Added request cmpl-fb9cddf44b63420ebb6aab2ce5fa6f47-0.
INFO 03-02 01:42:52 [logger.py:42] Received request cmpl-fecc3a12e30c44988bc7522bb4968235-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:42:52 [async_llm.py:261] Added request cmpl-fecc3a12e30c44988bc7522bb4968235-0.
INFO 03-02 01:42:53 [logger.py:42] Received request cmpl-b20e1d636c0b42d2ae79bb860269cdcb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:42:53 [async_llm.py:261] Added request cmpl-b20e1d636c0b42d2ae79bb860269cdcb-0.
INFO 03-02 01:42:54 [logger.py:42] Received request cmpl-a87319492b2a42e5a33ccaa0078ebd1c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:42:54 [async_llm.py:261] Added request cmpl-a87319492b2a42e5a33ccaa0078ebd1c-0.
INFO 03-02 01:42:56 [logger.py:42] Received request cmpl-7a651347d0964a7ba8c18f1e1c551fdb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:42:56 [async_llm.py:261] Added request cmpl-7a651347d0964a7ba8c18f1e1c551fdb-0.
INFO 03-02 01:42:57 [logger.py:42] Received request cmpl-ca0587aaca0941c7903ccdf55183915a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:42:57 [async_llm.py:261] Added request cmpl-ca0587aaca0941c7903ccdf55183915a-0.
INFO 03-02 01:42:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:42:58 [logger.py:42] Received request cmpl-30daea0471f242eb9dabbe81b22c76e3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:42:58 [async_llm.py:261] Added request cmpl-30daea0471f242eb9dabbe81b22c76e3-0.
INFO 03-02 01:42:59 [logger.py:42] Received request cmpl-d2102aa6c18549c691d2887edcf24a0b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:42:59 [async_llm.py:261] Added request cmpl-d2102aa6c18549c691d2887edcf24a0b-0.
INFO 03-02 01:43:00 [logger.py:42] Received request cmpl-52068afdde5a464c9a7fcf29167989b1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:43:00 [async_llm.py:261] Added request cmpl-52068afdde5a464c9a7fcf29167989b1-0.
INFO 03-02 01:43:01 [logger.py:42] Received request cmpl-cc623c95f69e40dbb91de2b0821ae9cb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:43:01 [async_llm.py:261] Added request cmpl-cc623c95f69e40dbb91de2b0821ae9cb-0.
INFO 03-02 01:43:02 [logger.py:42] Received request cmpl-eea73f88f41d4f50a7f545d341a6d49b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:43:02 [async_llm.py:261] Added request cmpl-eea73f88f41d4f50a7f545d341a6d49b-0.
INFO 03-02 01:43:04 [logger.py:42] Received request cmpl-fdfdc5bc9984483699c43c673e6c83cc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:43:04 [async_llm.py:261] Added request cmpl-fdfdc5bc9984483699c43c673e6c83cc-0.
INFO 03-02 01:43:05 [logger.py:42] Received request cmpl-a1663e2e2f4646169d0a6fe3dc6507cd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:43:05 [async_llm.py:261] Added request cmpl-a1663e2e2f4646169d0a6fe3dc6507cd-0.
INFO 03-02 01:43:06 [logger.py:42] Received request cmpl-f60e948c51fa4860b213a9bb7457cace-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:43:06 [async_llm.py:261] Added request cmpl-f60e948c51fa4860b213a9bb7457cace-0.
INFO 03-02 01:43:07 [logger.py:42] Received request cmpl-80bf758b6ac240a0a7cfb9646c32fcac-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:43:07 [async_llm.py:261] Added request cmpl-80bf758b6ac240a0a7cfb9646c32fcac-0.
INFO 03-02 01:43:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:43:08 [logger.py:42] Received request cmpl-2ba0caa0cf7d47d9b4cf649ecd65b16d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:43:08 [async_llm.py:261] Added request cmpl-2ba0caa0cf7d47d9b4cf649ecd65b16d-0.
INFO 03-02 01:43:09 [logger.py:42] Received request cmpl-843e440dbaca407cbaff28ca4e012230-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:43:09 [async_llm.py:261] Added request cmpl-843e440dbaca407cbaff28ca4e012230-0.
INFO 03-02 01:43:11 [logger.py:42] Received request cmpl-16a031a4113543309d42349c89eb762a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:43:11 [async_llm.py:261] Added request cmpl-16a031a4113543309d42349c89eb762a-0.
INFO 03-02 01:43:12 [logger.py:42] Received request cmpl-6a5e9a6d3a87460994d4ed64f9e04082-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:43:12 [async_llm.py:261] Added request cmpl-6a5e9a6d3a87460994d4ed64f9e04082-0.
INFO 03-02 01:43:13 [logger.py:42] Received request cmpl-dae7727eb1a24ae6b2ebc27f1a6dea41-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:43:13 [async_llm.py:261] Added request cmpl-dae7727eb1a24ae6b2ebc27f1a6dea41-0.
INFO 03-02 01:43:14 [logger.py:42] Received request cmpl-cd0f20bf9d0545f3a0528dc16993d726-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:43:14 [async_llm.py:261] Added request cmpl-cd0f20bf9d0545f3a0528dc16993d726-0.
INFO 03-02 01:43:15 [logger.py:42] Received request cmpl-87582f8e6a474ff48523a287d1263cef-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:43:15 [async_llm.py:261] Added request cmpl-87582f8e6a474ff48523a287d1263cef-0.
INFO 03-02 01:43:16 [logger.py:42] Received request cmpl-0d27a66586cc4d5f845126409e85e647-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:43:16 [async_llm.py:261] Added request cmpl-0d27a66586cc4d5f845126409e85e647-0.
INFO 03-02 01:43:17 [logger.py:42] Received request cmpl-39b4499fec2241d48997f08790184149-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:43:17 [async_llm.py:261] Added request cmpl-39b4499fec2241d48997f08790184149-0.
INFO 03-02 01:43:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:43:19 [logger.py:42] Received request cmpl-6512a61b5ca34449a75bf02026f22234-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:43:19 [async_llm.py:261] Added request cmpl-6512a61b5ca34449a75bf02026f22234-0.
INFO 03-02 01:43:20 [logger.py:42] Received request cmpl-adf98e8c8f414382b0e8ba40a57cbbd1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:43:20 [async_llm.py:261] Added request cmpl-adf98e8c8f414382b0e8ba40a57cbbd1-0.
INFO 03-02 01:43:21 [logger.py:42] Received request cmpl-54afb8eda72b4d24b18995968a8ef978-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:43:21 [async_llm.py:261] Added request cmpl-54afb8eda72b4d24b18995968a8ef978-0.
INFO 03-02 01:43:22 [logger.py:42] Received request cmpl-9d4af825c43c48a09e1799cd894f5f57-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:43:22 [async_llm.py:261] Added request cmpl-9d4af825c43c48a09e1799cd894f5f57-0.
INFO 03-02 01:43:23 [logger.py:42] Received request cmpl-fdb84f4320084606ac5f3c1b9d687c9e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:43:23 [async_llm.py:261] Added request cmpl-fdb84f4320084606ac5f3c1b9d687c9e-0.
INFO 03-02 01:43:24 [logger.py:42] Received request cmpl-10cf07612296447b9858ba8079c59b9e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:43:24 [async_llm.py:261] Added request cmpl-10cf07612296447b9858ba8079c59b9e-0.
INFO 03-02 01:43:26 [logger.py:42] Received request cmpl-03c5ea87465042b0ba2accae09d59c1a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:43:26 [async_llm.py:261] Added request cmpl-03c5ea87465042b0ba2accae09d59c1a-0.
INFO 03-02 01:43:27 [logger.py:42] Received request cmpl-b3e2fd7cba794af995f02020a1f66393-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:43:27 [async_llm.py:261] Added request cmpl-b3e2fd7cba794af995f02020a1f66393-0.
INFO 03-02 01:43:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:43:28 [logger.py:42] Received request cmpl-0e63c1660dce4c11b2d58ffaee621e67-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:43:28 [async_llm.py:261] Added request cmpl-0e63c1660dce4c11b2d58ffaee621e67-0.
INFO 03-02 01:43:29 [logger.py:42] Received request cmpl-a25d3bcf1aee4eff90806eb725d0b997-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:43:29 [async_llm.py:261] Added request cmpl-a25d3bcf1aee4eff90806eb725d0b997-0.
INFO 03-02 01:43:30 [logger.py:42] Received request cmpl-a35611bd3c65487689c6cf5f09f94080-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:43:30 [async_llm.py:261] Added request cmpl-a35611bd3c65487689c6cf5f09f94080-0.
INFO 03-02 01:43:31 [logger.py:42] Received request cmpl-d2fc6e4fca5d434d9f93a89c77afbb18-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:43:31 [async_llm.py:261] Added request cmpl-d2fc6e4fca5d434d9f93a89c77afbb18-0.
INFO 03-02 01:43:32 [logger.py:42] Received request cmpl-f176be34107e47708c10bf77b9ad8ea0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:43:32 [async_llm.py:261] Added request cmpl-f176be34107e47708c10bf77b9ad8ea0-0.
INFO 03-02 01:43:34 [logger.py:42] Received request cmpl-3f309908fd4142dcb7e1b1f8a7d8253b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:43:34 [async_llm.py:261] Added request cmpl-3f309908fd4142dcb7e1b1f8a7d8253b-0.
INFO 03-02 01:43:35 [logger.py:42] Received request cmpl-d75232e08bc74a21a18ce4e8eb4ff8f9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:43:35 [async_llm.py:261] Added request cmpl-d75232e08bc74a21a18ce4e8eb4ff8f9-0.
INFO 03-02 01:43:36 [logger.py:42] Received request cmpl-7586086caf25493d89844f642a155115-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:43:36 [async_llm.py:261] Added request cmpl-7586086caf25493d89844f642a155115-0.
INFO 03-02 01:43:37 [logger.py:42] Received request cmpl-6c9f00d5e7994cbc85886c1eda2700e9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:43:37 [async_llm.py:261] Added request cmpl-6c9f00d5e7994cbc85886c1eda2700e9-0.
INFO 03-02 01:43:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:43:38 [logger.py:42] Received request cmpl-9152d2d292f94aad8caaa06d97310627-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:43:38 [async_llm.py:261] Added request cmpl-9152d2d292f94aad8caaa06d97310627-0.
INFO 03-02 01:43:39 [logger.py:42] Received request cmpl-0b8b7e1221244f11b6573ddeb754a496-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:43:39 [async_llm.py:261] Added request cmpl-0b8b7e1221244f11b6573ddeb754a496-0.
INFO 03-02 01:43:41 [logger.py:42] Received request cmpl-ed24b990a90d4c3dbe3611c482330097-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:43:41 [async_llm.py:261] Added request cmpl-ed24b990a90d4c3dbe3611c482330097-0.
INFO 03-02 01:43:42 [logger.py:42] Received request cmpl-407480bbf912423a9d31a23dbc41a645-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:43:42 [async_llm.py:261] Added request cmpl-407480bbf912423a9d31a23dbc41a645-0.
INFO 03-02 01:43:43 [logger.py:42] Received request cmpl-15cc34a5174a4e6fb065097a8b448dc0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:43:43 [async_llm.py:261] Added request cmpl-15cc34a5174a4e6fb065097a8b448dc0-0.
INFO 03-02 01:43:44 [logger.py:42] Received request cmpl-51cf36b6a5204297abd44905538c1078-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:43:44 [async_llm.py:261] Added request cmpl-51cf36b6a5204297abd44905538c1078-0.
INFO 03-02 01:43:45 [logger.py:42] Received request cmpl-9570dd5e996f4accb3d6ad00ae129f31-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:43:45 [async_llm.py:261] Added request cmpl-9570dd5e996f4accb3d6ad00ae129f31-0.
INFO 03-02 01:43:46 [logger.py:42] Received request cmpl-365aecd2b0c0461983abec5abec3e5c3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:43:46 [async_llm.py:261] Added request cmpl-365aecd2b0c0461983abec5abec3e5c3-0.
INFO 03-02 01:43:47 [logger.py:42] Received request cmpl-929254c2d3eb4a968ce719a96233d41e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:43:47 [async_llm.py:261] Added request cmpl-929254c2d3eb4a968ce719a96233d41e-0.
INFO 03-02 01:43:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:43:49 [logger.py:42] Received request cmpl-677413d19fa146b99efba48fc67913ff-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:43:49 [async_llm.py:261] Added request cmpl-677413d19fa146b99efba48fc67913ff-0.
INFO 03-02 01:43:50 [logger.py:42] Received request cmpl-84780e68033c47348726254bf4e934fc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:43:50 [async_llm.py:261] Added request cmpl-84780e68033c47348726254bf4e934fc-0.
INFO 03-02 01:43:51 [logger.py:42] Received request cmpl-94ee89edc45247ba8070a2f883ed7bb8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:43:51 [async_llm.py:261] Added request cmpl-94ee89edc45247ba8070a2f883ed7bb8-0.
INFO 03-02 01:43:52 [logger.py:42] Received request cmpl-eaa0337e5bb345158172ea1e0afe9214-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:43:52 [async_llm.py:261] Added request cmpl-eaa0337e5bb345158172ea1e0afe9214-0.
INFO 03-02 01:43:53 [logger.py:42] Received request cmpl-bd177b7d85d44135b67ae192388f3caf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:43:53 [async_llm.py:261] Added request cmpl-bd177b7d85d44135b67ae192388f3caf-0.
INFO 03-02 01:43:54 [logger.py:42] Received request cmpl-ae607a622e4842cb909011515cdb878c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:43:54 [async_llm.py:261] Added request cmpl-ae607a622e4842cb909011515cdb878c-0.
INFO 03-02 01:43:56 [logger.py:42] Received request cmpl-7f268e22cb4642dda10a05673e8964ff-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:43:56 [async_llm.py:261] Added request cmpl-7f268e22cb4642dda10a05673e8964ff-0.
INFO 03-02 01:43:57 [logger.py:42] Received request cmpl-c3d7d28515324765934ae0481aec187e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:43:57 [async_llm.py:261] Added request cmpl-c3d7d28515324765934ae0481aec187e-0.
INFO 03-02 01:43:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:43:58 [logger.py:42] Received request cmpl-e8c5ce0d23df41609b088330ab7c4ece-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:43:58 [async_llm.py:261] Added request cmpl-e8c5ce0d23df41609b088330ab7c4ece-0.
INFO 03-02 01:43:59 [logger.py:42] Received request cmpl-f6ef7fd7654e438485c238512f86bedc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:43:59 [async_llm.py:261] Added request cmpl-f6ef7fd7654e438485c238512f86bedc-0.
INFO 03-02 01:44:00 [logger.py:42] Received request cmpl-f7389bc1c69241fc91fcb6823f56fbff-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:44:00 [async_llm.py:261] Added request cmpl-f7389bc1c69241fc91fcb6823f56fbff-0.
INFO 03-02 01:44:01 [logger.py:42] Received request cmpl-d33cf64b199e4f1fb12217bde21e31a0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:44:01 [async_llm.py:261] Added request cmpl-d33cf64b199e4f1fb12217bde21e31a0-0.
INFO 03-02 01:44:02 [logger.py:42] Received request cmpl-3471e5bd92854ec29c8195acb25a8e3b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:44:02 [async_llm.py:261] Added request cmpl-3471e5bd92854ec29c8195acb25a8e3b-0.
INFO 03-02 01:44:04 [logger.py:42] Received request cmpl-30ba03cd57aa4a9d9ad9ce03dc7e4fc1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:44:04 [async_llm.py:261] Added request cmpl-30ba03cd57aa4a9d9ad9ce03dc7e4fc1-0.
INFO 03-02 01:44:05 [logger.py:42] Received request cmpl-28e01bab3e5a4934a2be63f35db8b07d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:44:05 [async_llm.py:261] Added request cmpl-28e01bab3e5a4934a2be63f35db8b07d-0.
INFO 03-02 01:44:06 [logger.py:42] Received request cmpl-07f92dc09be8479a9670a4f99d7a9465-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:44:06 [async_llm.py:261] Added request cmpl-07f92dc09be8479a9670a4f99d7a9465-0.
INFO 03-02 01:44:07 [logger.py:42] Received request cmpl-ed1ce1587f7a40bf9d29a446e6219e5c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:44:07 [async_llm.py:261] Added request cmpl-ed1ce1587f7a40bf9d29a446e6219e5c-0.
INFO 03-02 01:44:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:44:08 [logger.py:42] Received request cmpl-0d2984f6093b4c8dae1800634d23ab33-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:44:08 [async_llm.py:261] Added request cmpl-0d2984f6093b4c8dae1800634d23ab33-0.
INFO 03-02 01:44:09 [logger.py:42] Received request cmpl-1f5d7d398121491d817cf4416e59f9eb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:44:09 [async_llm.py:261] Added request cmpl-1f5d7d398121491d817cf4416e59f9eb-0.
INFO 03-02 01:44:11 [logger.py:42] Received request cmpl-d293a6106db8462ca9c91fefe826a9c9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:44:11 [async_llm.py:261] Added request cmpl-d293a6106db8462ca9c91fefe826a9c9-0.
INFO 03-02 01:44:12 [logger.py:42] Received request cmpl-d4767a39523b45cea52f18eea5861139-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:44:12 [async_llm.py:261] Added request cmpl-d4767a39523b45cea52f18eea5861139-0.
INFO 03-02 01:44:13 [logger.py:42] Received request cmpl-20fae51647cb4113ae3b45fad18acf51-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:44:13 [async_llm.py:261] Added request cmpl-20fae51647cb4113ae3b45fad18acf51-0.
INFO 03-02 01:44:14 [logger.py:42] Received request cmpl-70f138140df34c23bbd5d78b4042e1e9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:44:14 [async_llm.py:261] Added request cmpl-70f138140df34c23bbd5d78b4042e1e9-0.
INFO 03-02 01:44:15 [logger.py:42] Received request cmpl-57787d144434426bb8bc643345820914-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:44:15 [async_llm.py:261] Added request cmpl-57787d144434426bb8bc643345820914-0.
INFO 03-02 01:44:16 [logger.py:42] Received request cmpl-d90eac26598f4f67b458c875373ac6f2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:44:16 [async_llm.py:261] Added request cmpl-d90eac26598f4f67b458c875373ac6f2-0.
INFO 03-02 01:44:17 [logger.py:42] Received request cmpl-1786d5a77d804bcda984e6c00adca8da-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:44:17 [async_llm.py:261] Added request cmpl-1786d5a77d804bcda984e6c00adca8da-0.
INFO 03-02 01:44:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:44:19 [logger.py:42] Received request cmpl-c858006fc8a34500a272f2d0a8c72d18-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:44:19 [async_llm.py:261] Added request cmpl-c858006fc8a34500a272f2d0a8c72d18-0.
INFO 03-02 01:44:20 [logger.py:42] Received request cmpl-d8945f08a5314c45b28998f215398a25-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:44:20 [async_llm.py:261] Added request cmpl-d8945f08a5314c45b28998f215398a25-0.
INFO 03-02 01:44:21 [logger.py:42] Received request cmpl-fa088e58259f456f8bf5e90eb2c5ba23-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:44:21 [async_llm.py:261] Added request cmpl-fa088e58259f456f8bf5e90eb2c5ba23-0.
INFO 03-02 01:44:22 [logger.py:42] Received request cmpl-08ce7763e5904822a0813c12b16046fd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:44:22 [async_llm.py:261] Added request cmpl-08ce7763e5904822a0813c12b16046fd-0.
INFO 03-02 01:44:23 [logger.py:42] Received request cmpl-ec355a2cc5504abca442dfb5082015f1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:44:23 [async_llm.py:261] Added request cmpl-ec355a2cc5504abca442dfb5082015f1-0.
INFO 03-02 01:44:24 [logger.py:42] Received request cmpl-d2a4a382d48d4df796e078960f4a8d44-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:44:24 [async_llm.py:261] Added request cmpl-d2a4a382d48d4df796e078960f4a8d44-0.
INFO 03-02 01:44:26 [logger.py:42] Received request cmpl-b675053a3c3b4275a47d8227d64d14a3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:44:26 [async_llm.py:261] Added request cmpl-b675053a3c3b4275a47d8227d64d14a3-0.
INFO 03-02 01:44:27 [logger.py:42] Received request cmpl-386babb817814bb19ef1b5f468ee4d26-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:44:27 [async_llm.py:261] Added request cmpl-386babb817814bb19ef1b5f468ee4d26-0.
INFO 03-02 01:44:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:44:28 [logger.py:42] Received request cmpl-302227ca412340f4856a1b227a0152e7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:44:28 [async_llm.py:261] Added request cmpl-302227ca412340f4856a1b227a0152e7-0.
INFO 03-02 01:44:29 [logger.py:42] Received request cmpl-ac38ed429c3b42ed806845c68cbeac54-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:44:29 [async_llm.py:261] Added request cmpl-ac38ed429c3b42ed806845c68cbeac54-0.
INFO 03-02 01:44:30 [logger.py:42] Received request cmpl-1a2fd68139a24dc98f41e5a5cc056d94-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:44:30 [async_llm.py:261] Added request cmpl-1a2fd68139a24dc98f41e5a5cc056d94-0.
INFO 03-02 01:44:31 [logger.py:42] Received request cmpl-6149efb866654f128d3b951e91161199-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:44:31 [async_llm.py:261] Added request cmpl-6149efb866654f128d3b951e91161199-0.
INFO 03-02 01:44:32 [logger.py:42] Received request cmpl-d6db14afeda54a3585cbb2399d387c44-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:44:32 [async_llm.py:261] Added request cmpl-d6db14afeda54a3585cbb2399d387c44-0.
INFO 03-02 01:44:34 [logger.py:42] Received request cmpl-063a60e590154a559225c30dc113c024-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:44:34 [async_llm.py:261] Added request cmpl-063a60e590154a559225c30dc113c024-0.
INFO 03-02 01:44:35 [logger.py:42] Received request cmpl-6f40e730d5a647e18d1f92625a7d4537-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:44:35 [async_llm.py:261] Added request cmpl-6f40e730d5a647e18d1f92625a7d4537-0.
INFO 03-02 01:44:36 [logger.py:42] Received request cmpl-16662d9040024b80bcc5cad0a1217cf7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:44:36 [async_llm.py:261] Added request cmpl-16662d9040024b80bcc5cad0a1217cf7-0.
INFO 03-02 01:44:37 [logger.py:42] Received request cmpl-5ccf7506b0dd46b296456154b07db82e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:44:37 [async_llm.py:261] Added request cmpl-5ccf7506b0dd46b296456154b07db82e-0.
INFO 03-02 01:44:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:44:38 [logger.py:42] Received request cmpl-79f79e5716b443c086260eaeb26d2945-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:44:38 [async_llm.py:261] Added request cmpl-79f79e5716b443c086260eaeb26d2945-0.
INFO 03-02 01:44:39 [logger.py:42] Received request cmpl-4a78ac440e11473cb0cdfddbb9788f17-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:44:39 [async_llm.py:261] Added request cmpl-4a78ac440e11473cb0cdfddbb9788f17-0.
INFO 03-02 01:44:41 [logger.py:42] Received request cmpl-da41026e23fb400a8831c1113908d590-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:44:41 [async_llm.py:261] Added request cmpl-da41026e23fb400a8831c1113908d590-0.
INFO 03-02 01:44:42 [logger.py:42] Received request cmpl-9f487a512feb455a842cdad044e19ed0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:44:42 [async_llm.py:261] Added request cmpl-9f487a512feb455a842cdad044e19ed0-0.
INFO 03-02 01:44:43 [logger.py:42] Received request cmpl-5770a61ddf4042f1bbc912a9eb694a29-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:44:43 [async_llm.py:261] Added request cmpl-5770a61ddf4042f1bbc912a9eb694a29-0.
INFO 03-02 01:44:44 [logger.py:42] Received request cmpl-883e189c97fc478e8bd5117b6e9e404e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:44:44 [async_llm.py:261] Added request cmpl-883e189c97fc478e8bd5117b6e9e404e-0.
INFO 03-02 01:44:45 [logger.py:42] Received request cmpl-6f2db18c62dd42d5a43154b75e97c249-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:44:45 [async_llm.py:261] Added request cmpl-6f2db18c62dd42d5a43154b75e97c249-0.
INFO 03-02 01:44:46 [logger.py:42] Received request cmpl-94d66d3b671e445aa045a36c2f97352f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:44:46 [async_llm.py:261] Added request cmpl-94d66d3b671e445aa045a36c2f97352f-0.
INFO 03-02 01:44:47 [logger.py:42] Received request cmpl-1c8250ee23b34e0080bf02c75668675b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:44:47 [async_llm.py:261] Added request cmpl-1c8250ee23b34e0080bf02c75668675b-0.
INFO 03-02 01:44:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:44:49 [logger.py:42] Received request cmpl-5b6caf8632b44b659d113d7964c39997-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:44:49 [async_llm.py:261] Added request cmpl-5b6caf8632b44b659d113d7964c39997-0.
INFO 03-02 01:44:50 [logger.py:42] Received request cmpl-e5aeccbd1e724b2cb2d30400779b826a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:44:50 [async_llm.py:261] Added request cmpl-e5aeccbd1e724b2cb2d30400779b826a-0.
INFO 03-02 01:44:51 [logger.py:42] Received request cmpl-b02f575c32494276aab92c0542aa8c68-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:44:51 [async_llm.py:261] Added request cmpl-b02f575c32494276aab92c0542aa8c68-0.
INFO 03-02 01:44:52 [logger.py:42] Received request cmpl-e65efaff6b8c48088714f22f7422d26f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:44:52 [async_llm.py:261] Added request cmpl-e65efaff6b8c48088714f22f7422d26f-0.
INFO 03-02 01:44:53 [logger.py:42] Received request cmpl-ecfd1d70a57c4847afbb27b975a8dd53-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:44:53 [async_llm.py:261] Added request cmpl-ecfd1d70a57c4847afbb27b975a8dd53-0.
INFO 03-02 01:44:54 [logger.py:42] Received request cmpl-66d5ba851cc84b3f87c7a08f07fe7ab0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:44:54 [async_llm.py:261] Added request cmpl-66d5ba851cc84b3f87c7a08f07fe7ab0-0.
INFO 03-02 01:44:56 [logger.py:42] Received request cmpl-23e86e1faa284d67892408af56646ad5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:44:56 [async_llm.py:261] Added request cmpl-23e86e1faa284d67892408af56646ad5-0.
INFO 03-02 01:44:57 [logger.py:42] Received request cmpl-74f5422a215748f38847d616345c39a2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:44:57 [async_llm.py:261] Added request cmpl-74f5422a215748f38847d616345c39a2-0.
INFO 03-02 01:44:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:44:58 [logger.py:42] Received request cmpl-6f8ba525e53343408bdc074cec445729-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:44:58 [async_llm.py:261] Added request cmpl-6f8ba525e53343408bdc074cec445729-0.
INFO 03-02 01:44:59 [logger.py:42] Received request cmpl-ccf1d9832c9345208e9b2a78d6b59b2c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:44:59 [async_llm.py:261] Added request cmpl-ccf1d9832c9345208e9b2a78d6b59b2c-0.
INFO 03-02 01:45:00 [logger.py:42] Received request cmpl-bc18c7289fd040ad8ff2fd0498ced0ee-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:45:00 [async_llm.py:261] Added request cmpl-bc18c7289fd040ad8ff2fd0498ced0ee-0.
INFO 03-02 01:45:01 [logger.py:42] Received request cmpl-921b59e5ecbc4939be65d26dabe8be36-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:45:01 [async_llm.py:261] Added request cmpl-921b59e5ecbc4939be65d26dabe8be36-0.
INFO 03-02 01:45:02 [logger.py:42] Received request cmpl-d64740e0ac944513831137c890959e22-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:45:02 [async_llm.py:261] Added request cmpl-d64740e0ac944513831137c890959e22-0.
INFO 03-02 01:45:04 [logger.py:42] Received request cmpl-529e251c096648c8a90fdf8b694fbb30-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:45:04 [async_llm.py:261] Added request cmpl-529e251c096648c8a90fdf8b694fbb30-0.
INFO 03-02 01:45:05 [logger.py:42] Received request cmpl-e6efab79c5b140c9842d730c8484e1b0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:45:05 [async_llm.py:261] Added request cmpl-e6efab79c5b140c9842d730c8484e1b0-0.
INFO 03-02 01:45:06 [logger.py:42] Received request cmpl-174187f639284519a214ab4e603936e7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:45:06 [async_llm.py:261] Added request cmpl-174187f639284519a214ab4e603936e7-0.
INFO 03-02 01:45:07 [logger.py:42] Received request cmpl-fdc4556cd2124e26a6f0d76aa43f6f4b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:45:07 [async_llm.py:261] Added request cmpl-fdc4556cd2124e26a6f0d76aa43f6f4b-0.
INFO 03-02 01:45:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:45:08 [logger.py:42] Received request cmpl-b5111112ec6e40e3b19e8e21f1e22221-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:45:08 [async_llm.py:261] Added request cmpl-b5111112ec6e40e3b19e8e21f1e22221-0.
INFO 03-02 01:45:09 [logger.py:42] Received request cmpl-da107bb8ae454311bfdf53170163f936-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:45:09 [async_llm.py:261] Added request cmpl-da107bb8ae454311bfdf53170163f936-0.
INFO 03-02 01:45:11 [logger.py:42] Received request cmpl-9efe3c705ef8437eba5d2b69eea6ceea-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:45:11 [async_llm.py:261] Added request cmpl-9efe3c705ef8437eba5d2b69eea6ceea-0.
INFO 03-02 01:45:12 [logger.py:42] Received request cmpl-d1c99ecab9ce4a099f9539e77f47ab3a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:45:12 [async_llm.py:261] Added request cmpl-d1c99ecab9ce4a099f9539e77f47ab3a-0.
INFO 03-02 01:45:13 [logger.py:42] Received request cmpl-4d317294545746b68995ce8869e096f6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:45:13 [async_llm.py:261] Added request cmpl-4d317294545746b68995ce8869e096f6-0.
INFO 03-02 01:45:14 [logger.py:42] Received request cmpl-c82e587e7a0c42f7b5e738a98724baaa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:45:14 [async_llm.py:261] Added request cmpl-c82e587e7a0c42f7b5e738a98724baaa-0.
INFO 03-02 01:45:15 [logger.py:42] Received request cmpl-e20bfbc7a61843158ad35ff335e42063-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:45:15 [async_llm.py:261] Added request cmpl-e20bfbc7a61843158ad35ff335e42063-0.
INFO 03-02 01:45:16 [logger.py:42] Received request cmpl-f7c5ba074c154e3e9b31693cf70a241d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:45:16 [async_llm.py:261] Added request cmpl-f7c5ba074c154e3e9b31693cf70a241d-0.
INFO 03-02 01:45:17 [logger.py:42] Received request cmpl-7a74d64ec4d643be92ddc09351de8301-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:45:17 [async_llm.py:261] Added request cmpl-7a74d64ec4d643be92ddc09351de8301-0.
INFO 03-02 01:45:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:45:19 [logger.py:42] Received request cmpl-97e43fadd7eb49b882466f3dfed39b6d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:45:19 [async_llm.py:261] Added request cmpl-97e43fadd7eb49b882466f3dfed39b6d-0.
INFO 03-02 01:45:20 [logger.py:42] Received request cmpl-98578f9f9940418283d181b9d28d02ae-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:45:20 [async_llm.py:261] Added request cmpl-98578f9f9940418283d181b9d28d02ae-0.
INFO 03-02 01:45:21 [logger.py:42] Received request cmpl-1eef9e07c68a463cacc14453e82b0d24-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:45:21 [async_llm.py:261] Added request cmpl-1eef9e07c68a463cacc14453e82b0d24-0.
INFO 03-02 01:45:22 [logger.py:42] Received request cmpl-54b38381d7454e2183a7239339e2efa8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:45:22 [async_llm.py:261] Added request cmpl-54b38381d7454e2183a7239339e2efa8-0.
INFO 03-02 01:45:23 [logger.py:42] Received request cmpl-fe48cea99ae94491ab45fc6f6ef30752-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:45:23 [async_llm.py:261] Added request cmpl-fe48cea99ae94491ab45fc6f6ef30752-0.
INFO 03-02 01:45:24 [logger.py:42] Received request cmpl-ffe278f5c7f74249a045a0dd1ff8867d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:45:24 [async_llm.py:261] Added request cmpl-ffe278f5c7f74249a045a0dd1ff8867d-0.
INFO 03-02 01:45:26 [logger.py:42] Received request cmpl-9027c38b743a4871949e29ed55ef98e2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:45:26 [async_llm.py:261] Added request cmpl-9027c38b743a4871949e29ed55ef98e2-0.
INFO 03-02 01:45:27 [logger.py:42] Received request cmpl-38bddf708b62462693b961fb718c4990-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:45:27 [async_llm.py:261] Added request cmpl-38bddf708b62462693b961fb718c4990-0.
INFO 03-02 01:45:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:45:28 [logger.py:42] Received request cmpl-eadf5cac6a7542ca9682d874db6a5cf8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:45:28 [async_llm.py:261] Added request cmpl-eadf5cac6a7542ca9682d874db6a5cf8-0.
INFO 03-02 01:45:29 [logger.py:42] Received request cmpl-994046140ad34a77bdfeb2ed376f9b36-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:45:29 [async_llm.py:261] Added request cmpl-994046140ad34a77bdfeb2ed376f9b36-0.
INFO 03-02 01:45:30 [logger.py:42] Received request cmpl-5785771c336c49f7be928ac8f203b607-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:45:30 [async_llm.py:261] Added request cmpl-5785771c336c49f7be928ac8f203b607-0.
INFO 03-02 01:45:31 [logger.py:42] Received request cmpl-b90e2e4be95443529d31612f1eea2efa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:45:31 [async_llm.py:261] Added request cmpl-b90e2e4be95443529d31612f1eea2efa-0.
INFO 03-02 01:45:32 [logger.py:42] Received request cmpl-52940a22f6854fc99c8a3c335a5579a3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:45:32 [async_llm.py:261] Added request cmpl-52940a22f6854fc99c8a3c335a5579a3-0.
INFO 03-02 01:45:34 [logger.py:42] Received request cmpl-f1951e7edd4a43b39b35f8c521f4cb1e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:45:34 [async_llm.py:261] Added request cmpl-f1951e7edd4a43b39b35f8c521f4cb1e-0.
INFO 03-02 01:45:35 [logger.py:42] Received request cmpl-bdbce8a27259488382418e5fe83f1798-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:45:35 [async_llm.py:261] Added request cmpl-bdbce8a27259488382418e5fe83f1798-0.
INFO 03-02 01:45:36 [logger.py:42] Received request cmpl-95001e18f26b43788a7d8ef438941647-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:45:36 [async_llm.py:261] Added request cmpl-95001e18f26b43788a7d8ef438941647-0.
INFO 03-02 01:45:37 [logger.py:42] Received request cmpl-7047cecaf7254fd8811ee99962b45ffe-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:45:37 [async_llm.py:261] Added request cmpl-7047cecaf7254fd8811ee99962b45ffe-0.
INFO 03-02 01:45:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:45:38 [logger.py:42] Received request cmpl-4d7d8253e3dd407a9581ab42079664b8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:45:38 [async_llm.py:261] Added request cmpl-4d7d8253e3dd407a9581ab42079664b8-0.
INFO 03-02 01:45:39 [logger.py:42] Received request cmpl-8c6b4f533b6842b480d8884707b99f66-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:45:39 [async_llm.py:261] Added request cmpl-8c6b4f533b6842b480d8884707b99f66-0.
INFO 03-02 01:45:41 [logger.py:42] Received request cmpl-818b5afd4978466e995e122fd969e041-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:45:41 [async_llm.py:261] Added request cmpl-818b5afd4978466e995e122fd969e041-0.
INFO 03-02 01:45:42 [logger.py:42] Received request cmpl-b8697cf3d0a442f4ad3339f41afeb2ee-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:45:42 [async_llm.py:261] Added request cmpl-b8697cf3d0a442f4ad3339f41afeb2ee-0.
INFO 03-02 01:45:43 [logger.py:42] Received request cmpl-f783aa46c35f43c2a7936b223612b8ab-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:45:43 [async_llm.py:261] Added request cmpl-f783aa46c35f43c2a7936b223612b8ab-0.
INFO 03-02 01:45:44 [logger.py:42] Received request cmpl-2b866bf948f4426cbf13f3337bcb8964-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:45:44 [async_llm.py:261] Added request cmpl-2b866bf948f4426cbf13f3337bcb8964-0.
INFO 03-02 01:45:45 [logger.py:42] Received request cmpl-d5c19f3cfa9d47b0890c5f017491f764-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:45:45 [async_llm.py:261] Added request cmpl-d5c19f3cfa9d47b0890c5f017491f764-0.
INFO 03-02 01:45:46 [logger.py:42] Received request cmpl-45bb6ce9c11b47a6a500d947364c60c2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:45:46 [async_llm.py:261] Added request cmpl-45bb6ce9c11b47a6a500d947364c60c2-0.
INFO 03-02 01:45:47 [logger.py:42] Received request cmpl-1fdd76768e604c3ab062115abc7c24af-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:45:47 [async_llm.py:261] Added request cmpl-1fdd76768e604c3ab062115abc7c24af-0.
INFO 03-02 01:45:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:45:49 [logger.py:42] Received request cmpl-a2b0257d309b419b98aa7efbbedc37e7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:45:49 [async_llm.py:261] Added request cmpl-a2b0257d309b419b98aa7efbbedc37e7-0.
INFO 03-02 01:45:50 [logger.py:42] Received request cmpl-d687337438c4425aaf9cada0e1282a61-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:45:50 [async_llm.py:261] Added request cmpl-d687337438c4425aaf9cada0e1282a61-0.
INFO 03-02 01:45:51 [logger.py:42] Received request cmpl-d0a22e864c424f0ba799dfcb28af1851-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:45:51 [async_llm.py:261] Added request cmpl-d0a22e864c424f0ba799dfcb28af1851-0.
INFO 03-02 01:45:52 [logger.py:42] Received request cmpl-f1792609b7584a2a9223317f5743d608-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:45:52 [async_llm.py:261] Added request cmpl-f1792609b7584a2a9223317f5743d608-0.
INFO 03-02 01:45:53 [logger.py:42] Received request cmpl-6c6cdf47c39947d78f68fe3ce7bd2c78-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:45:53 [async_llm.py:261] Added request cmpl-6c6cdf47c39947d78f68fe3ce7bd2c78-0.
INFO 03-02 01:45:54 [logger.py:42] Received request cmpl-588df6fe4a76428da1325809ca3b4d05-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:45:54 [async_llm.py:261] Added request cmpl-588df6fe4a76428da1325809ca3b4d05-0.
INFO 03-02 01:45:56 [logger.py:42] Received request cmpl-c1587e1699f3496bac8d3a0c9416a1a2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:45:56 [async_llm.py:261] Added request cmpl-c1587e1699f3496bac8d3a0c9416a1a2-0.
INFO 03-02 01:45:57 [logger.py:42] Received request cmpl-9475d3646e0747cd80844e80c64eda74-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:45:57 [async_llm.py:261] Added request cmpl-9475d3646e0747cd80844e80c64eda74-0.
INFO 03-02 01:45:58 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:45:58 [logger.py:42] Received request cmpl-1a4ad8b249474bc0b0a7cf53043359f1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:45:58 [async_llm.py:261] Added request cmpl-1a4ad8b249474bc0b0a7cf53043359f1-0.
INFO 03-02 01:45:59 [logger.py:42] Received request cmpl-bd81bc0f39264364b8a7f98c826935f5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:45:59 [async_llm.py:261] Added request cmpl-bd81bc0f39264364b8a7f98c826935f5-0.
INFO 03-02 01:46:00 [logger.py:42] Received request cmpl-8aca2636e74e4327b17add850e708104-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:46:00 [async_llm.py:261] Added request cmpl-8aca2636e74e4327b17add850e708104-0.
INFO 03-02 01:46:01 [logger.py:42] Received request cmpl-6089c7733a2341ec958d246f960ebc7a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:46:01 [async_llm.py:261] Added request cmpl-6089c7733a2341ec958d246f960ebc7a-0.
INFO 03-02 01:46:02 [logger.py:42] Received request cmpl-c032f37b3d32463e839dafdbf3c51805-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:46:02 [async_llm.py:261] Added request cmpl-c032f37b3d32463e839dafdbf3c51805-0.
INFO 03-02 01:46:04 [logger.py:42] Received request cmpl-6ca625a6eb36423eb8a761409d646b08-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:46:04 [async_llm.py:261] Added request cmpl-6ca625a6eb36423eb8a761409d646b08-0.
INFO 03-02 01:46:05 [logger.py:42] Received request cmpl-7debbb418deb4d9899031dfbeab326a7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:46:05 [async_llm.py:261] Added request cmpl-7debbb418deb4d9899031dfbeab326a7-0.
INFO 03-02 01:46:06 [logger.py:42] Received request cmpl-d884d4e620de46b38b9fc99fbcbfbc77-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:46:06 [async_llm.py:261] Added request cmpl-d884d4e620de46b38b9fc99fbcbfbc77-0.
INFO 03-02 01:46:07 [logger.py:42] Received request cmpl-17146b74e8a54b8ea61c29f3eeafa3fa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:46:07 [async_llm.py:261] Added request cmpl-17146b74e8a54b8ea61c29f3eeafa3fa-0.
INFO 03-02 01:46:08 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:46:08 [logger.py:42] Received request cmpl-229710724e1944f9b12dd418729bf101-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:46:08 [async_llm.py:261] Added request cmpl-229710724e1944f9b12dd418729bf101-0.
INFO 03-02 01:46:09 [logger.py:42] Received request cmpl-ccd3e246b033429fb97887547de60cd3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:46:09 [async_llm.py:261] Added request cmpl-ccd3e246b033429fb97887547de60cd3-0.
INFO 03-02 01:46:11 [logger.py:42] Received request cmpl-5ca470d4b9174ecfa0dafdf05fc8c4e2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:46:11 [async_llm.py:261] Added request cmpl-5ca470d4b9174ecfa0dafdf05fc8c4e2-0.
INFO 03-02 01:46:12 [logger.py:42] Received request cmpl-fc4ca357caa64fcc8c329804bef6af90-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:46:12 [async_llm.py:261] Added request cmpl-fc4ca357caa64fcc8c329804bef6af90-0.
INFO 03-02 01:46:13 [logger.py:42] Received request cmpl-695c5b02df4a4f05b75f1087d87f8d36-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:46:13 [async_llm.py:261] Added request cmpl-695c5b02df4a4f05b75f1087d87f8d36-0.
INFO 03-02 01:46:14 [logger.py:42] Received request cmpl-2d30fe97b2f24a6f8cac5761c34956ed-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:46:14 [async_llm.py:261] Added request cmpl-2d30fe97b2f24a6f8cac5761c34956ed-0.
INFO 03-02 01:46:15 [logger.py:42] Received request cmpl-5ce389d5cc804d0790f9034a4081a278-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:46:15 [async_llm.py:261] Added request cmpl-5ce389d5cc804d0790f9034a4081a278-0.
INFO 03-02 01:46:16 [logger.py:42] Received request cmpl-9f7b4631d49e4db48324e898c9211b6f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:46:16 [async_llm.py:261] Added request cmpl-9f7b4631d49e4db48324e898c9211b6f-0.
INFO 03-02 01:46:17 [logger.py:42] Received request cmpl-bd2894dd7a5e4454bfea09d6f5debd41-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:46:17 [async_llm.py:261] Added request cmpl-bd2894dd7a5e4454bfea09d6f5debd41-0.
INFO 03-02 01:46:18 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:46:19 [logger.py:42] Received request cmpl-2af527f15ef74533a69de56a3d8f89a0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:46:19 [async_llm.py:261] Added request cmpl-2af527f15ef74533a69de56a3d8f89a0-0.
INFO 03-02 01:46:20 [logger.py:42] Received request cmpl-1c710e1fcb794520b4a923305d015f79-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:46:20 [async_llm.py:261] Added request cmpl-1c710e1fcb794520b4a923305d015f79-0.
INFO 03-02 01:46:21 [logger.py:42] Received request cmpl-cc6cad574e77466d8e32ab0c2c76d51f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:46:21 [async_llm.py:261] Added request cmpl-cc6cad574e77466d8e32ab0c2c76d51f-0.
INFO 03-02 01:46:22 [logger.py:42] Received request cmpl-57c90b8f31ca42fb99187997d936d28a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:46:22 [async_llm.py:261] Added request cmpl-57c90b8f31ca42fb99187997d936d28a-0.
INFO 03-02 01:46:23 [logger.py:42] Received request cmpl-56c4857c905f4e76ad45669df2faeff7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:46:23 [async_llm.py:261] Added request cmpl-56c4857c905f4e76ad45669df2faeff7-0.
INFO 03-02 01:46:24 [logger.py:42] Received request cmpl-c3559ee5af7f43e59dae2e47751b6eab-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:46:24 [async_llm.py:261] Added request cmpl-c3559ee5af7f43e59dae2e47751b6eab-0.
INFO 03-02 01:46:26 [logger.py:42] Received request cmpl-1f6c192320774466903d6516356c2db2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:46:26 [async_llm.py:261] Added request cmpl-1f6c192320774466903d6516356c2db2-0.
INFO 03-02 01:46:27 [logger.py:42] Received request cmpl-1ffe076584684bf89ebaf69c292b0a6a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:46:27 [async_llm.py:261] Added request cmpl-1ffe076584684bf89ebaf69c292b0a6a-0.
INFO 03-02 01:46:28 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:46:28 [logger.py:42] Received request cmpl-99e3cc38f9f849c5a637b90e9c6c94d6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:46:28 [async_llm.py:261] Added request cmpl-99e3cc38f9f849c5a637b90e9c6c94d6-0.
INFO 03-02 01:46:29 [logger.py:42] Received request cmpl-87ea85e8c4bc40c5af2f86be750b3e3e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:46:29 [async_llm.py:261] Added request cmpl-87ea85e8c4bc40c5af2f86be750b3e3e-0.
INFO 03-02 01:46:30 [logger.py:42] Received request cmpl-bc3c82f6ba6c47d59d6fe23b30ca776c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:46:30 [async_llm.py:261] Added request cmpl-bc3c82f6ba6c47d59d6fe23b30ca776c-0.
INFO 03-02 01:46:31 [logger.py:42] Received request cmpl-8fac1620373b4938b55ffdd11e0c00dc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:46:31 [async_llm.py:261] Added request cmpl-8fac1620373b4938b55ffdd11e0c00dc-0.
INFO 03-02 01:46:32 [logger.py:42] Received request cmpl-a8c2dacfcb964edab98aabd00d03dd26-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:46:32 [async_llm.py:261] Added request cmpl-a8c2dacfcb964edab98aabd00d03dd26-0.
INFO 03-02 01:46:34 [logger.py:42] Received request cmpl-406a3dce57bf4a19ab504ce98351dcf5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:46:34 [async_llm.py:261] Added request cmpl-406a3dce57bf4a19ab504ce98351dcf5-0.
INFO 03-02 01:46:35 [logger.py:42] Received request cmpl-16a837f6c37f4013bf5ca6a2d3f6b0cb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:46:35 [async_llm.py:261] Added request cmpl-16a837f6c37f4013bf5ca6a2d3f6b0cb-0.
INFO 03-02 01:46:36 [logger.py:42] Received request cmpl-77ee621c1f7145e9bbc208c585328351-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:46:36 [async_llm.py:261] Added request cmpl-77ee621c1f7145e9bbc208c585328351-0.
INFO 03-02 01:46:37 [logger.py:42] Received request cmpl-e5098e079e5943e69e6bd27e798b2525-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:46:37 [async_llm.py:261] Added request cmpl-e5098e079e5943e69e6bd27e798b2525-0.
INFO 03-02 01:46:38 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:46:38 [logger.py:42] Received request cmpl-8e7e10f3cb3543cd9d9f337568ee17fa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:46:38 [async_llm.py:261] Added request cmpl-8e7e10f3cb3543cd9d9f337568ee17fa-0.
INFO 03-02 01:46:39 [logger.py:42] Received request cmpl-c92941611f6a43bdbac047442c4ec13f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:46:39 [async_llm.py:261] Added request cmpl-c92941611f6a43bdbac047442c4ec13f-0.
INFO 03-02 01:46:41 [logger.py:42] Received request cmpl-2b6d82aabea84924ae9bd656f4fb4ff9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:46:41 [async_llm.py:261] Added request cmpl-2b6d82aabea84924ae9bd656f4fb4ff9-0.
INFO 03-02 01:46:42 [logger.py:42] Received request cmpl-af7108b2a2bd4a528fe0cfa19ae14784-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:46:42 [async_llm.py:261] Added request cmpl-af7108b2a2bd4a528fe0cfa19ae14784-0.
INFO 03-02 01:46:43 [logger.py:42] Received request cmpl-82e918d0c0a34696a6ecf4efe0791ee1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:46:43 [async_llm.py:261] Added request cmpl-82e918d0c0a34696a6ecf4efe0791ee1-0.
INFO 03-02 01:46:44 [logger.py:42] Received request cmpl-96a26c50eb3c4e419b24e75bff39da52-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:46:44 [async_llm.py:261] Added request cmpl-96a26c50eb3c4e419b24e75bff39da52-0.
INFO 03-02 01:46:45 [logger.py:42] Received request cmpl-208a1c8aa75d466b98ea742e2505aa7b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:46:45 [async_llm.py:261] Added request cmpl-208a1c8aa75d466b98ea742e2505aa7b-0.
INFO 03-02 01:46:46 [logger.py:42] Received request cmpl-dc1d6f1b425d4aa9b5529e787862fd3b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:46:46 [async_llm.py:261] Added request cmpl-dc1d6f1b425d4aa9b5529e787862fd3b-0.
INFO 03-02 01:46:47 [logger.py:42] Received request cmpl-096475b314ee46cb8d2de6120b77e7b2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:46:47 [async_llm.py:261] Added request cmpl-096475b314ee46cb8d2de6120b77e7b2-0.
INFO 03-02 01:46:48 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6%
INFO 03-02 01:46:49 [logger.py:42] Received request cmpl-275e696059f049b2a0586344538347a8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:46:49 [async_llm.py:261] Added request cmpl-275e696059f049b2a0586344538347a8-0.
INFO 03-02 01:46:50 [logger.py:42] Received request cmpl-4df43c2b2ce842fca68cb476e3b69c8a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:46:50 [async_llm.py:261] Added request cmpl-4df43c2b2ce842fca68cb476e3b69c8a-0.
INFO 03-02 01:46:51 [logger.py:42] Received request cmpl-7fd0ffb7e2e247dbbaf9dc15aa5c15b3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:46:51 [async_llm.py:261] Added request cmpl-7fd0ffb7e2e247dbbaf9dc15aa5c15b3-0.
INFO 03-02 01:46:52 [logger.py:42] Received request cmpl-a19e469292cc413f977aeaa4d2ea70f2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:46:52 [async_llm.py:261] Added request cmpl-a19e469292cc413f977aeaa4d2ea70f2-0.
INFO 03-02 01:46:53 [logger.py:42] Received request cmpl-2adb06fe7a99438d99911feeb8ab64e9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:46:53 [async_llm.py:261] Added request cmpl-2adb06fe7a99438d99911feeb8ab64e9-0.
INFO 03-02 01:46:54 [logger.py:42] Received request cmpl-8378e4333a9e4998ada835af4d31d0b5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None.
INFO:  1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK
INFO 03-02 01:46:54 [async_llm.py:261] Added request cmpl-8378e4333a9e4998ada835af4d31d0b5-0.