Back to home

Data Schema & Methodology

Complete documentation for researchers and transparency

What Is a "Report"?

A report is a single community submission documenting a person's self-reported experience with a specific peptide. Reports are collected from public forums and self-experimenter networks and structured into a standardized format.

Report Schema

FieldDescription
PeptideName of the peptide referenced (e.g. BPC-157, Ozempic)
Source URLDirect link to the original post or thread
PlatformSource community (e.g. Reddit, forum name)
Report DateDate of original post
CategoryReported effect type: Benefit / Side Effect / Dosing / Neutral
SubcategorySpecific tag (e.g. "recovery", "sleep", "appetite suppression")
SentimentPositive / Negative / Neutral (rule-based classification)
Confidence Score1–3 (1 = anecdotal mention, 2 = detailed account, 3 = structured self-experiment)
Duplicate FlagWhether the submission was flagged as a cross-post

How Reports Enter the System

  1. 1

    Continuous monitoring of major peptide communities via keyword and entity tracking

  2. 2

    Each candidate post is evaluated against minimum criteria (must reference a specific peptide, must include a reported effect or experience)

  3. 3

    Posts are categorized and tagged using a rule-based classification system

  4. 4

    Source URL is retained and linked for independent verification

Deduplication

  • Cross-posts (the same report appearing on multiple platforms) are flagged and counted once
  • Posts from the same user within a 30-day window on the same peptide are deduplicated
  • Deduplication is based on source URL + content fingerprint matching

Spam & Bias Filtering

  • Promotional content and vendor posts are excluded
  • Posts without a reported personal experience are excluded
  • Conflicting reports are retained and classified separately — they are not merged or averaged
  • No editorial weighting is applied; all qualifying reports carry equal weight

Dataset Size by Peptide (Top 10)

PeptideReport Count
BPC-157~9,200
Semaglutide/Ozempic~8,400
TB-500~6,100
Tirzepatide~4,800
NAD+~4,200
GHK-Cu~3,700
PT-141~3,400
AOD-9604~2,900
Semax~2,600
Selank~1,800
87 peptides total in dataset

Sample Data

Below is a representative sample of 10 anonymized, structured reports:

PeptideCategorySubcategorySentimentConfidencePlatformSource
BPC-157BenefitRecoveryPositive3Redditreddit.com/r/Peptides/comments/example1
SemaglutideSide EffectNauseaNegative2Redditreddit.com/r/Semaglutide/comments/example2
TB-500BenefitHealingPositive2Redditreddit.com/r/Peptides/comments/example3
TirzepatideBenefitAppetitePositive3Redditreddit.com/r/Tirzepatide/comments/example4
BPC-157DosingProtocolNeutral2Forumlongecity.org/forum/topic/example5
PT-141BenefitLibidoPositive2Redditreddit.com/r/Peptides/comments/example6
SemaxBenefitCognitivePositive3Redditreddit.com/r/Nootropics/comments/example7
NAD+NeutralEnergyNeutral1Redditreddit.com/r/Biohackers/comments/example8
GHK-CuBenefitSkinPositive2Redditreddit.com/r/Peptides/comments/example9
SelankSide EffectFatigueNegative1Forumlongecity.org/forum/topic/example10

Known Limitations

We publish these limitations explicitly because transparency is the foundation of trustworthy data.

  • Self-reported: All data is user-submitted and not clinically validated
  • Selection bias: Data skews toward enthusiast communities; casual or negative experiences may be underrepresented
  • No outcome verification: We do not confirm whether reported outcomes occurred or persisted
  • No dosing standardization: Dosing information reflects user-reported amounts, not clinical protocols
  • No adverse event tracking: Reports do not systematically capture long-term effects
  • Not peer-reviewed: This dataset has not been reviewed by any scientific or medical body

This data is appropriate for:

  • Trend identification
  • Hypothesis generation
  • Community pattern analysis

This data is NOT appropriate for:

  • Clinical conclusions
  • Medical guidance
  • Regulatory submissions

How to Cite DYK Peptides

If referencing this platform in research or writing:

DYK Peptides. (2025). Community-Reported Peptide Data Aggregation Platform [Dataset]. Retrieved from https://dykpeptides.com/data-schema

Citation Questions?

For academic citation guidance or data inquiries, send us a message.