Skip to main content
Skip to content
Case File
d-16866House OversightOther

Study on Automated Annotation of Censorship Victims and Culturomic Epidemic Tracking

The passage describes a methodological validation of an algorithm for identifying censored names and a culturomic analysis of historical epidemics. It contains no specific allegations, financial flows Human annotator classified 36 names as 'S', 27 as 'B', and 37 as ambiguous 'N'. Algorithm suppression/elevation matched annotator decisions with 81% (S) and 93% (B) correspondence. Culturomic data fr

Date
November 11, 2025
Source
House Oversight
Reference
House Oversight #017037
Pages
1
Persons
0
Integrity
No Hash Available

Summary

The passage describes a methodological validation of an algorithm for identifying censored names and a culturomic analysis of historical epidemics. It contains no specific allegations, financial flows Human annotator classified 36 names as 'S', 27 as 'B', and 37 as ambiguous 'N'. Algorithm suppression/elevation matched annotator decisions with 81% (S) and 93% (B) correspondence. Culturomic data fr

Tags

epidemic-trackingpublic-health-monitoringresearch-methodologyalgorithm-validationhistorical-datahouse-oversightculturomicscensorship-detection

Ask AI About This Document

0Share
PostReddit

Extracted Text (OCR)

EFTA Disclosure
Text extracted via OCR from the original document. May contain errors from the scanning process.
The annotator assigned 36 names to the S category and 27 names to the B category; the remaining 37 were given the ambiguous N classification. Of the names assigned to the S category by the human annotator, 29 had been annotated as suppressed by our algorithm, and 7 as elevated, so the correspondence between the annotator and our algorithm was 81%. Of the names assigned to the B category, 25 were annotated as elevated by our algorithm, and only 2 as suppressed, so the correspondence was 93%. Taken together, the conclusions of a scholarly annotator researching one name at a time closely matched those of our automated approach. These findings confirm that our computational method provides an effective strategy for rapidly identifying likely victims of censorship given a large pool of possibilities. III.10. Epidemics Disease epidemics have a significant impact on the surrounding culture (Fig. $18 A-C). It was recently shown that during seasonal influenza epidemics, users of Google are more likely to engage in influenza- related searches, and that this signature of influenza epidemics corresponds well with the results of CDC surveillance (Ref S16). We therefore reasoned that culturomic approaches might be used to track historical epidemics. These could help complement historical medical records, which are often woefully incomplete. We examined timelines for 4 diseases: influenza (main text), cholera, HIV, and poliomyelitis. In the case of influenza, peaks in cultural interest showed excellent correspondence with known historical epidemics (the Russian Flu of 1890, leading to 1M deaths, the Spanish Flu of 1918, leading to 20-100M deaths; and the Asian Flu of 1957, leading to 1.5M deaths). Similar results were observed for cholera and HIV. However, results for polio were mixed. The US epidemic of 1916 is clearly observed, but the 1951-55 epidemic is harder to pinpoint: the observed peak is much broader, starting in the 30s and ending in the 60s. This is likely due to increased interest in polio following the election of Franklin Delano Roosevelt in 1932, as well as the development and deployment of Salk’s polio vaccine in 1952 and Sabin’s oral version in 1962. These confounding factors highlight the challenge of interpreting timelines of cultural interest: interest may increase in response to an epidemic, but it may also respond to a stricken celebrity or a famous cure. The dates of important historical epidemics were derived from the Cambridge World History of Human Diseases (1993) 3" Edition. For cholera, we retained the time periods which most affected the Western world, according to this resource: - 1830-35 (Second Cholera Epidemic) - 1848-52, and 1854 (Third Cholera Epidemic) - 1866-74 (Fourth Cholera Epidemic) - 1883-1887 (Fifth Cholera Epidemic) The first, sixth and seventh cholera epidemics appear not to have caused significant casualties in the Western world. 29

Technical Artifacts (1)

View in Artifacts Browser

Email addresses, URLs, phone numbers, and other technical indicators extracted from this document.

Phone1883-1887

Forum Discussions

This document was digitized, indexed, and cross-referenced with 1,400+ persons in the Epstein files. 100% free, ad-free, and independent.

Annotations powered by Hypothesis. Select any text on this page to annotate or highlight it.