Security Scan Report: archive.org

Submitted: Jan 26, 2026, 6:34:18 AMCompleted: Jan 26, 2026, 6:35:41 AMpubliccompleted
Loading additional data...

Summary

This website contacted 2 IPs in 1 country across 4 domains to perform 41 HTTP transactions. The main domain is archive.org and was registered NaN years ago.

Submitted URL: https://archive.org

The Cisco Umbrella rank of the primary domain is #13,499 of the top 1 million websites

AI Security Verdict

Safe Website

Confidence: 98%

0
Risk Score

The site is a legitimate, well‑established digital library with no security concerns.

Safety Factors
Well‑established non‑profit organization
Consistent branding with official domain archive.org
No suspicious redirects or URL manipulation
No JavaScript malware patterns detected
No brand impersonation or phishing indicators
Domain age information unavailable

Details

Page Title

Internet Archive: Digital Library of Free & Borrowable Texts, Movies, Music & Wayback Machine

Scan Type

public

Language

🇺🇸

English

(80% confidence)

Category

entertainment media

(91%)

Domain Information

Within the non-profit oriented generic top-level domain (.org), 'archive.org' is registered and has no subdomain. The core label 'archive' covers 7 characters split between 3 vowels and 4 consonants. Word splitting yields 1 word: archive. Median word length is seven characters. No strong language cues emerged from the frequency lists.

Screenshot

Security scan screenshot of https://archive.org

Page Load Overview

1.91s
Total Load Time
55
HTTP Requests
3
Domains
4 KB
Total Size

Language Analysis

Primary Language

🇺🇸English
Code: en
Confidence:80%
Script:Latin
Direction:ltr

Detection Details

Language Code:en
Detection Confidence:80%
Script Type:Latin
HTML Lang Attribute:en
Text Length:194 chars
Detector Agreement:100%

Website Classification

Primary Category

entertainment media91% confidence
Type: dynamic
Method: ml+structural

All Detected Categories

entertainment media
91%
technology software
72%
documentation technical
58%

Detected Features

No structural features detected

Domain & IP Information

RequestsIP AddressLocationAS Autonomous System
28207.241.224.2United States
AS7941Internet Archive
27207.241.225.195United States
552--

Detected Technologies1

40%

Content Similarity HashesFor malware variant detection

TLSH (Trend Micro Locality Sensitive Hash)

Security-focused

Specialized for malware detection and similarity analysis

T109F1DB0C7C44C46A56370B4E7DD2E899AAD2FB4F4145D6E0E0FF62A84BE4FD14CA9C26

ssdeep (Context Triggered Piecewise Hashing)

Context-aware

Detects similar content even with modifications

192:Uk0jxYoQgbhQbS0hdopk1sR0HoN39Z/f8VUG6:U7jxYonbhQS0hdoqa1/fOY

sdhash (Similarity Digest Hashing)

High-precision

High-precision similarity detection for forensic analysis

sdhash:3:7699:BClgyZCgLGRgQhA4QQJARCiGhINJADZVwrAJAASiSgQADCAwOQQCharoAQAJjgOATgCRlwKCQABDQAIBAVgIwKJhEoAEhIoc

These hashes enable detection of similar websites and malware variants by comparing content similarity even when exact matches aren't found.

Image Hashes

Perceptual Hashes

Average Hash:0000ffffffffe3c3
Perceptual Hash:bc1b61433ea672e4
Difference Hash:63f92a2b260c0686
Wavelet Hash:0000a7bfffffc300
Color Hash:#4e931f

Scan History

Scan history not available

Unable to load historical scan data