FrenchToxicityPrompts

📊 Benchmark Details

Name: FrenchToxicityPrompts

Overview: A dataset of 50,000 naturally occurring French prompts and their continuations, annotated with toxicity scores from a widely used toxicity classifier, designed to evaluate toxicity in generated content by language models.

Data Type: text

Domains:

Natural Language Processing

Languages:

French

Resources:

Resource

🎯 Purpose and Intended Users

Goal: To evaluate and mitigate toxicity in French text generated by language models.

Target Audience:

ML Researchers
Industry Practitioners
Model Developers

Tasks:

Toxicity Detection

Limitations: The dataset covers exclusively French data, and the toxicity scores are dependent on Perspective API.

💾 Data

Source: Lélu, a French written dialogue dataset extracted from Reddit’s public dataset.

Size: 50,000 prompts and continuations

Format: JSON

Annotation: Annotated using the Perspective API for various toxicity attributes.

🔬 Methodology

Methods:

Human evaluation
Automated metrics

Metrics:

Expected Maximum Toxicity (EMT)
Toxicity Probability (TP)
Toxic Fraction (TF)
Average Toxicity (AT)

Calculation: Metrics are calculated based on the scores provided by the Perspective API for toxicity attributes.

Interpretation: Higher toxicity metrics indicate greater potential for the model to generate toxic content.

Validation: Dataset was manually examined for a selection of samples to ensure the validity of toxicity annotations.

⚠️ Targeted Risks

Risk Categories:

Fairness
Safety
Accuracy

Atlas Risks: No specific atlas risks defined

Demographic Analysis: N/A

Potential Harm: N/A

🔒 Ethical and Legal Considerations

Privacy And Anonymity: The dataset contains explicit content and harmful language which requires careful handling.

Data Licensing: Not Applicable

Consent Procedures: Not Applicable

Compliance With Regulations: Not Applicable

FrenchToxicityPrompts

📊 Benchmark Details

Name: FrenchToxicityPrompts

Data Type: text

Domains:

Natural Language Processing

Languages:

French

Resources:

Resource

🎯 Purpose and Intended Users

Goal: To evaluate and mitigate toxicity in French text generated by language models.

Target Audience:

ML Researchers
Industry Practitioners
Model Developers

Tasks:

Toxicity Detection

Limitations: The dataset covers exclusively French data, and the toxicity scores are dependent on Perspective API.

💾 Data

Source: Lélu, a French written dialogue dataset extracted from Reddit’s public dataset.

Size: 50,000 prompts and continuations

Format: JSON

Annotation: Annotated using the Perspective API for various toxicity attributes.

🔬 Methodology

Methods:

Human evaluation
Automated metrics

Metrics:

Expected Maximum Toxicity (EMT)
Toxicity Probability (TP)
Toxic Fraction (TF)
Average Toxicity (AT)

Calculation: Metrics are calculated based on the scores provided by the Perspective API for toxicity attributes.

Interpretation: Higher toxicity metrics indicate greater potential for the model to generate toxic content.

Validation: Dataset was manually examined for a selection of samples to ensure the validity of toxicity annotations.

⚠️ Targeted Risks

Risk Categories:

Fairness
Safety
Accuracy

Atlas Risks: No specific atlas risks defined

Demographic Analysis: N/A

Potential Harm: N/A

🔒 Ethical and Legal Considerations

Privacy And Anonymity: The dataset contains explicit content and harmful language which requires careful handling.

Data Licensing: Not Applicable

Consent Procedures: Not Applicable

Compliance With Regulations: Not Applicable

FrenchToxicityPrompts

FrenchToxicityPrompts

📊 Benchmark Details

🎯 Purpose and Intended Users

💾 Data

🔬 Methodology

⚠️ Targeted Risks

🔒 Ethical and Legal Considerations

Related Skills

Nano Banana Pro

Markdown Converter

1password

FrenchToxicityPrompts

📊 Benchmark Details

🎯 Purpose and Intended Users

💾 Data

🔬 Methodology

⚠️ Targeted Risks

🔒 Ethical and Legal Considerations