Security Scan Report: loc.gov

Submitted: Oct 16, 2025, 12:05:15 PMCompleted: Oct 16, 2025, 12:07:11 PMpubliccompleted
Loading additional data...

Summary

This website contacted 8 IPs in 1 country across 2 domains to perform 24 HTTP transactions. The main domain is loc.gov and was registered NaN years ago.

Submitted URL: https://loc.gov/

AI Security Verdict

Moderate Risk

Confidence: 70%

4
Risk Score

Official site shows suspicious misinformation; likely compromised but no direct phishing or malware observed.

Risk Factors
Misinformation notice displayed on an official government domain suggests possible site compromise
Mismatch between OCR visible content and actual HTML content
Unranked status in Cisco Umbrella for a high‑profile .gov domain
Safety Factors
Long domain age (over 10 000 days)
No credential or payment forms present
No external links or redirects detected
No malicious Indicators of Compromise matches
Domain age information unavailable

Details

Bot Protection Detected

This website is protected by Cloudflare bot protection. Our scanner was challenged or blocked during access.

Page Title

Home | Library of Congress

Scan Type

public

Language

🇺🇸

English

(80% confidence)

Category

government

(48%)

Domain Information

Within the United States government-restricted top-level domain (.gov), 'loc.gov' is registered with no subdomain. The registrable portion 'loc' spans 3 characters with one vowel and 2 consonants. Word splitting yields 1 word: loc. 'loc' is most common in Romanian usage. Secondary signals appear in Breton and Danish. Taken together, it feels Romanian with single-word simplicity.

Screenshot

Security scan screenshot of https://loc.gov/

Page Load Overview

87.86s
Total Load Time
24
HTTP Requests
2
Domains
1 KB
Total Size

Language Analysis

Primary Language

🇺🇸English
Code: en
Confidence:80%
Script:Latin
Direction:ltr

Detection Details

Language Code:en
Detection Confidence:80%
Script Type:Latin
HTML Lang Attribute:en
Text Length:8,813 chars
Detector Agreement:100%

Website Classification

Primary Category

government48% confidence
Type: spa
Method: ml+structural

All Detected Categories

government
48%
entertainment media
40%
news/blog
20%

Detected Features

Search
OG: article

Domain & IP Information

RequestsIP AddressLocationAS Autonomous System
3104.18.64.82United States
AS13335CLOUDFLARENET
3104.17.6.58United States
AS13335CLOUDFLARENET
3104.18.94.41United States
AS13335CLOUDFLARENET
32606:4700::6812:4052United States
AS13335CLOUDFLARENET
32606:4700::6812:5f29United States
AS13335CLOUDFLARENET
32606:4700::6811:63aUnited States
AS13335CLOUDFLARENET
3104.18.95.41United States
AS13335CLOUDFLARENET
32606:4700::6812:5e29United States
AS13335CLOUDFLARENET
248--

Content Similarity HashesFor malware variant detection

TLSH (Trend Micro Locality Sensitive Hash)

Security-focused

Specialized for malware detection and similarity analysis

T1B6823B378A42101B72674FA77065F7548011F284D702A3BEF4A3AE689BCD95F56633EC

ssdeep (Context Triggered Piecewise Hashing)

Context-aware

Detects similar content even with modifications

192:Va+69yIPl2boVmJkEqQ2Bktkmtag0DT3kQ69OozyjgXl:U+6cq4brJkxVmAT3kQa3mg1

sdhash (Similarity Digest Hashing)

High-precision

High-precision similarity detection for forensic analysis

sdhash:3:18074:DPhWGyAABiAABAQEAQHpEjVCmQKUAOgyHgGVggYXAGQkRCjvMIMY2AECkQBQqa1ZgQmbYIHoMDLw6SRCIAxKAIgJGgICUkVK

These hashes enable detection of similar websites and malware variants by comparing content similarity even when exact matches aren't found.

Image Hashes

Perceptual Hashes

Average Hash:ffc7c3c3dfffffe7
Perceptual Hash:b038c3cb6349c7c7
Difference Hash:203406161400000c
Wavelet Hash:3c000000cfffffc3
Color Hash:#6640bf

Other Hashes

Crop Resistant:203406161400000c

Scan History

Scan history not available

Unable to load historical scan data