Can you list some popular HuggingFace datasets that we can used to set guidellm --data

**What is the URL, file, or UI containing proposed doc change**
https://github.com/vllm-project/guidellm/blob/main/README.md

**What is the current content or situation in question**
GuideLLM supports HuggingFace datasets, local files, and synthetic data. This example loads the CNN DailyMail dataset from HuggingFace and maps the article column to prompts while using the summary token count column to determine output lengths.

```bash
guidellm benchmark \
  --target http://localhost:8000 \
  --data "hf:cnn_dailymail" \
  --data-args '{"prompt_column":"article","output_tokens_count_column":"summary_tokens"}'
```

**What is the proposed change**
Can you list some popular dataset that we can use, for example, used to calibrate MOE expert load balance. Or some guide to judge which types dataset we can use and which types of dataset can not.



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Can you list some popular HuggingFace datasets that we can used to set guidellm --data #505

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Can you list some popular HuggingFace datasets that we can used to set guidellm --data #505

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions