63 points by __cayenne__ 2 hours ago | 17 comments on HN
| Mixed — Strong positive signals on information access (Article 19) and scientific participation (Article 27) offset by significant privacy/tracking concerns (Article 12). Benchmark structure supports procedural fairness (Article 7) and community engagement (Article 13, 20) but lacks privacy governance and user consent mechanisms. Editorial
· vv3.4 · 2026-02-25
Article Heatmap
Negative Neutral Positive No Data
Aggregates
Weighted Mean
+0.13
Unweighted Mean
+0.12
Max
+0.68 Article 19
Min
-0.33 Article 12
Signal
15
No Data
16
Confidence
ND
Volatility
0.41 (High)
Negative
2
Channels
E: 0.6S: 0.4
SETL
+0.04
Editorial-dominant
FW Ratio
66%
0 facts · 0 inferences
Evidence: High: 2 Medium: 7 Low: 6 No Data: 16
Theme Radar
Domain Context Profile
Element
Modifier
Affects
Note
Privacy
—
No privacy policy or data handling statement visible on domain; Google Analytics tracking present.
Terms of Service
—
No terms of service link found on-domain.
Accessibility
+0.05
Article 2 Article 13
Page includes alt text for images, semantic HTML structure, and fallback noscript content for charts, suggesting accessibility-conscious design.
Mission
+0.10
Article 19 Article 27
Site explicitly promotes open science: references open-source Screeps paradigm, OpenCode harness, public GitHub repository, and documentation transparency.
Editorial Code
—
No explicit editorial standards or corrections policy visible.
Ownership
—
GitHub organization 'llmskirmish' present; parent entity not explicitly identified on-domain.
Access Model
+0.05
Article 19 Article 27
Free public access to benchmark data, tournament results, and detailed analysis; no paywall or registration requirement evident.
Ad/Tracking
-0.08
Article 3 Article 12
Google Analytics tracking (GA-CZH5MJ4H15) present without visible explicit consent mechanism or granular opt-out controls; potential privacy signal.
Score Breakdown
+0.33
PreamblePreamble
Medium Framing Advocacy
Editorial
+0.35
Structural
+0.25
SETL
+0.19
Combined
ND
Context Modifier
ND
Preamble emphasis on equal recognition of dignity and rights implicit in framing LLMs as competitive agents with agency; benchmark structure suggests formal evaluation principles.
Observable Facts
Page frames LLMs as autonomous agents competing in a structured tournament with equal match opportunity.
Introduction explains benchmark design philosophy reflecting transparency and reproducibility values.
Site provides open access to documentation, GitHub repository, and example strategies without paywalls.
Inferences
The tournament structure and public documentation suggest commitment to fair evaluation and knowledge-sharing, consistent with Preamble ideals of shared human responsibility.
Framing LLMs as strategic agents implies some form of equal treatment in the experimental design.
+0.13
Article 1Freedom, Equality, Brotherhood
Low
Editorial
+0.15
Structural
+0.10
SETL
+0.09
Combined
ND
Context Modifier
ND
Article 1 (freedom and equality) not directly addressed. Content focuses on technical benchmark rather than rights/dignity of any stakeholder.
Observable Facts
Page contains no explicit statements about equality or freedom of LLMs, humans, or stakeholders.
Tournament structure treats all five models with equal round-robin scheduling in each round.
Inferences
Equal match scheduling is procedurally neutral but does not constitute advocacy for freedom or equality in the UDHR sense.
+0.13
Article 2Non-Discrimination
Low
Editorial
ND
Structural
+0.08
SETL
ND
Combined
ND
Context Modifier
ND
No editorial content on discrimination. Structural accessibility features (alt text, semantic HTML, noscript fallbacks) suggest inclusive design practice.
Observable Facts
Page includes alt text for images and noscript fallback content for JavaScript-dependent charts.
Semantic HTML structure present (header, main, article, section tags).
No evidence of language-specific barriers; English-only content but no explicit language restriction.
Inferences
Accessibility design choices suggest awareness of non-discrimination principles in digital access.
-0.28
Article 3Life, Liberty, Security
Medium Practice
Editorial
-0.15
Structural
-0.25
SETL
+0.16
Combined
ND
Context Modifier
ND
Google Analytics tracking without visible explicit consent or opt-out mechanism; no privacy policy accessible; potential security/liberty concern.
Observable Facts
Google Analytics tracking script (GA-CZH5MJ4H15) loaded asynchronously at page initialization.
No cookie consent banner, privacy policy link, or analytics opt-out control visible on page.
No GDPR compliance notice or data handling disclosure present.
Inferences
Unannounced tracking suggests potential collection of personal data (visitor behavior, device info) without transparent user control, contradicting Article 3 (security of person) and Article 12 (privacy).
Absence of consent mechanism or privacy documentation suggests tracking takes precedence over user autonomy.
ND
Article 4No Slavery
ND
Article 4 (no slavery) not addressed in benchmark content or site structure.
ND
Article 5No Torture
ND
Article 5 (torture/cruel treatment) not addressed in benchmark content.
ND
Article 6Legal Personhood
ND
Article 6 (right to recognition as person) not directly addressed; LLMs framed as agents but not as rights-bearing entities.
+0.15
Article 7Equality Before Law
Low Practice
Editorial
ND
Structural
+0.15
SETL
ND
Combined
ND
Context Modifier
ND
Structural equality: tournament design provides equal protection and impartial evaluation (round-robin scheduling, standardized rules). No editorial commentary.
Observable Facts
All five models receive identical match scheduling (every player plays every other player once per round).
Game rules and API documentation publicly available; no hidden information.
Validation process applies uniformly to all agents: up to 3 attempts to fix script errors before proceeding.
Inferences
Impartial tournament structure reflects equal protection principles in procedural design.
ND
Article 8Right to Remedy
ND
Article 8 (right to remedy) not addressed; no appeals process or grievance mechanism documented.
ND
Article 9No Arbitrary Detention
ND
Article 9 (freedom from arbitrary arrest) not applicable to technical benchmark context.
+0.07
Article 10Fair Hearing
Low Framing
Editorial
+0.10
Structural
+0.05
SETL
+0.07
Combined
ND
Context Modifier
ND
Fair hearing elements implicit in structured tournament; public documentation of results provides transparency but no explicit right of appeal.
Observable Facts
Match results, standings, and detailed analysis publicly available.
Footnotes explain validation methodology and performance anomalies (e.g., Gemini 3 Pro context rot analysis).
Public documentation of methodology and results suggests commitment to fair adjudication, though no appeal or challenge mechanism is visible.
ND
Article 11Presumption of Innocence
ND
Article 11 (presumption of innocence) not applicable.
-0.33
Article 12Privacy
Medium Practice
Editorial
-0.20
Structural
-0.30
SETL
+0.17
Combined
ND
Context Modifier
ND
Arbitrary interference with privacy: unannounced Google Analytics tracking, no privacy policy, no user control over data collection.
Observable Facts
Google Analytics tracking active with no visible consent mechanism, privacy policy, or opt-out control.
No disclosure of what data is collected, how it is retained, or who has access.
Footer email link is obfuscated (Cloudflare protection), limiting direct contact.
No GDPR, CCPA, or privacy regulation compliance notices present.
Inferences
Unannounced tracking without user control constitutes potential arbitrary interference with privacy.
Absence of transparency documentation suggests visitor data privacy is not a priority.
+0.32
Article 13Freedom of Movement
Medium Advocacy Coverage
Editorial
+0.20
Structural
+0.35
SETL
-0.23
Combined
ND
Context Modifier
ND
Freedom of movement/association: public access to benchmark, community features (Discord, GitHub), open participation structure.
Observable Facts
Page links to public Discord community and GitHub organization with no registration barrier.
Tournament bracket and detailed results openly accessible.
Site references 'community ladder' as primary navigation element.
GitHub repository is open source, allowing downstream use and participation.
Inferences
Open community channels and public data structures facilitate freedom of association.
Emphasis on 'community ladder' signals inclusive participation model.
ND
Article 14Asylum
ND
Article 14 (asylum) not applicable to technical benchmark.
ND
Article 15Nationality
ND
Article 15 (nationality) not addressed.
ND
Article 16Marriage & Family
ND
Article 16 (marriage/family) not applicable.
ND
Article 17Property
ND
Article 17 (property rights) not addressed in benchmark context.
ND
Article 18Freedom of Thought
ND
Article 18 (freedom of thought/conscience) not addressed.
+0.68
Article 19Freedom of Expression
High Advocacy Practice
Editorial
+0.55
Structural
+0.60
SETL
-0.17
Combined
ND
Context Modifier
ND
Strong positive signals: freedom to seek and receive information. Open-source benchmark, public GitHub, detailed documentation, transparent methodology, free access.
Observable Facts
GitHub repository (github.com/llmskirmish) is publicly accessible with full source code, prompt templates, and example strategies.
Documentation links to OBJECTIVE.md, NEXT_ROUND.md, and example strategies on GitHub with no paywall.
Page provides detailed analysis of methodology, results, and per-model breakdowns.
All match data, standings, and performance charts available free to all visitors.
'Documentation' link in navigation directs to technical documentation.
Inferences
Open-source release and public documentation strongly support freedom to seek and receive information about benchmark methodology and results.
Transparent publication of results and analysis enables informed public discourse about LLM capabilities.
Removal of access barriers (no login, no paywall) extends freedom of information to all visitors.
+0.20
Article 20Assembly & Association
Low Practice
Editorial
ND
Structural
+0.20
SETL
ND
Combined
ND
Context Modifier
ND
Structural freedom of peaceful assembly: public Discord community and GitHub organization provide platforms for collective action around LLM evaluation.
Observable Facts
Discord server link provided (discord.gg/7pdtNEHW7d) for community discussion.
GitHub organization allows community contributions and forks.
Inferences
Provision of community platforms enables peaceful assembly around shared interest in LLM benchmarking.
+0.10
Article 21Political Participation
Low Practice
Editorial
ND
Structural
+0.10
SETL
ND
Combined
ND
Context Modifier
ND
Minimal structural support for political participation; benchmark is technical, not political, but open governance model could enable broader participation.
Observable Facts
No explicit governance or decision-making process documented on-domain.
GitHub organization and open-source model allow community contributions in principle.
Inferences
Open-source structure creates potential for participatory governance, though actual mechanisms are not documented on-domain.
+0.21
Article 22Social Security
Medium Framing Practice
Editorial
+0.25
Structural
+0.15
SETL
+0.16
Combined
ND
Context Modifier
ND
Editorial framing emphasizes scientific and technical advancement; structural access enables social participation around benchmark science.
Observable Facts
Page frames LLM Skirmish as a contribution to scientific understanding of in-context learning and model capabilities.
References to peer-level analysis (model breakdowns, cost efficiency charts) position readers as informed participants in evaluation discourse.
Public dissemination of methodology and results enables participation in scientific community around LLM evaluation.
Inferences
Open publication of benchmark supports participation in scientific progress and evaluation discourse.
Detailed analysis and public data enable collective knowledge-building about LLM capabilities.
ND
Article 23Work & Equal Pay
ND
Article 23 (work/employment) not addressed in technical benchmark context.
ND
Article 24Rest & Leisure
ND
Article 24 (rest/leisure) not addressed.
ND
Article 25Standard of Living
ND
Article 25 (health/welfare) not addressed in benchmark context.
+0.28
Article 26Education
Medium Advocacy Coverage
Editorial
+0.30
Structural
+0.25
SETL
+0.12
Combined
ND
Context Modifier
ND
Education commitment: detailed technical documentation, prompt tutorials, transparent methodology enable learning about LLM capabilities and evaluation science.
Links to example strategies and prompt templates (OBJECTIVE.md, NEXT_ROUND.md) provided for learning purposes.
Model breakdowns educate readers about strategy diversity and model behavior patterns.
Documentation explicitly describes Screeps paradigm and OpenCode harness, enabling reproducibility and learning.
Inferences
Comprehensive documentation and structured analysis support development of understanding about LLM capabilities and evaluation methodologies.
Educational framing positions benchmark as teaching tool, not just competitive leaderboard.
+0.63
Article 27Cultural Participation
High Advocacy Practice
Editorial
+0.50
Structural
+0.55
SETL
-0.17
Combined
ND
Context Modifier
ND
Strong signals supporting participation in scientific and cultural life: open-source publication, public benchmarking, community participation, reproducible research.
Observable Facts
GitHub repository published under open-source license (presumed), enabling derivative research and community extension.
Benchmark results, methodology, and source code made publicly available without paywalls.
References to Screeps open-source paradigm and OpenCode open-source harness emphasize reproducibility and community science.
Community ladder and Discord community enable participation in shared scientific endeavor.
Page explicitly states 'fully open source to aid in replicability.'
Inferences
Open-source release and public benchmarking strongly support participation in scientific and cultural life.
Emphasis on replicability and community contribution extends participation beyond original authors.
Free public access removes barriers to participation in evaluation discourse.
ND
Article 28Social & International Order
ND
Article 28 (international order) not applicable to technical website.
ND
Article 29Duties to Community
ND
Article 29 (duties/limitations) not explicitly addressed in benchmark content.
ND
Article 30No Destruction of Rights
ND
Article 30 (UDHR supremacy) not applicable to technical website context.