Security Scan Report: webharvest.gov

Submitted: Oct 24, 2025, 6:39:04 PMCompleted: Oct 24, 2025, 6:40:59 PMpubliccompleted
Loading additional data...

Summary

This website contacted 4 IPs in 1 country across 1 domain to perform 2 HTTP transactions. The main domain is webharvest.gov and was registered NaN years ago.

Submitted URL: https://webharvest.gov/

AI Security Verdict

Safe Website

Confidence: 95%

0
Risk Score

Site appears legitimate with no security concerns.

Safety Factors
Established government domain (.gov)
Long domain age indicating legitimacy
Absence of suspicious forms or data collection
No external links or redirects
No malicious Indicators of Compromise
Domain age information unavailable

Details

Bot Protection Detected

This website is protected by rate_limit bot protection. Our scanner was challenged or blocked during access.

Page Title

National Archives

Scan Type

public

Language

🇺🇸

English

(50% confidence)

Category

government

(95%)

Domain Information

You're looking at domain 'webharvest.gov' on the United States government-restricted top-level domain (.gov) without a subdomain. Count 10 characters in 'webharvest' holding 3 vowels versus 7 consonants. Splitting it apart reveals two words: web, harvest. Median word length is 5 characters. 'web' is most common in Sinhala usage. Usage also turns up in English and Vietnamese contexts. Overall, 'webharvest.gov' reads as Sinhala.

Screenshot

Security scan screenshot of https://webharvest.gov/

Page Load Overview

1.74s
Total Load Time
2
HTTP Requests
1
Domains
1 KB
Total Size

Language Analysis

Primary Language

🇺🇸English
Code: en
Confidence:50%
Script:Latin
Direction:ltr

Detection Details

Language Code:en
Detection Confidence:50%
Script Type:Latin
Text Length:279 chars
Detector Agreement:100%

Website Classification

Primary Category

government95% confidence
Type: static
Method: structural

All Detected Categories

government
95%

Detected Features

No structural features detected

Domain & IP Information

RequestsIP AddressLocationAS Autonomous System
2207.241.225.8United States
AS7941INTERNET-ARCHIVE
02620:0:9c0::a172San Francisco, California, United States
AS7941INTERNET-ARCHIVE
0207.241.232.8United States
AS7941INTERNET-ARCHIVE
02620:0:9c0::a173San Francisco, California, United States
AS7941INTERNET-ARCHIVE
24--

Content Similarity HashesFor malware variant detection

TLSH (Trend Micro Locality Sensitive Hash)

Security-focused

Specialized for malware detection and similarity analysis

T189F0ABA243F40933048004081600F2062F91C03F8747616A318E0F750F89E8AC9AF1D7

ssdeep (Context Triggered Piecewise Hashing)

Context-aware

Detects similar content even with modifications

6:qF/UGsQ4NxP3AEdBAkqoXaZxWRS8mAX3mflyAgx0rIAT/ckyPFRf3aS+X1LYMWXT:WEdDlXCWRSxAXWfdr5/ckyPGxXJhWoQL

sdhash (Similarity Digest Hashing)

High-precision

High-precision similarity detection for forensic analysis

sdhash:1:0:40a0457e763902ed12097cffc3536f1e

These hashes enable detection of similar websites and malware variants by comparing content similarity even when exact matches aren't found.

Image Hashes

Perceptual Hashes

Average Hash:00ffffffffffffff
Perceptual Hash:ba3a3a3a3a3a3a2a
Difference Hash:0500000000000000
Wavelet Hash:00ffffff00000000
Color Hash:#87c588

Other Hashes

Crop Resistant:0500000000000000

Scan History

Scan history not available

Unable to load historical scan data