Security Scan Report: archive.org

Submitted: Mar 7, 2026, 7:46:02 PMCompleted: Mar 7, 2026, 7:47:37 PMpubliccompleted
Loading additional data...

Summary

This website contacted 2 IPs in 1 country across 2 domains to perform 1 HTTP transaction. The main domain is archive.org and was registered NaN years ago.

Submitted URL: https://archive.org/download/789UnofficialArchive-mediafire/789%20Unofficial%20Archive.zip

The Cisco Umbrella rank of the primary domain is #13,499 of the top 1 million websites

AI Security Verdict

Safe Website

Confidence: 96%

0
Risk Score

No security concerns detected; archive.org is a reputable site.

Safety Factors
Well‑established domain with long registration history
Absence of malicious Indicators of Compromise
No credential or payment collection mechanisms
No JavaScript malware patterns detected
No IDS alerts or suspicious network activity
Domain age information unavailable

Details

Page Title

401 Authorization Required

Scan Type

public

Language

🇺🇸

English

(80% confidence)

Category

technology software

(85%)

Domain Information

You're looking at domain 'archive.org' on the non-profit oriented generic top-level domain (.org) with no subdomain. The second-level label 'archive' is 7 characters long holding three vowels versus four consonants. Breaking it apart gives 1 word: archive. The median word length lands at 7 characters. No strong language cues emerged from the frequency lists.

Screenshot

Security scan screenshot of https://archive.org/download/789UnofficialArchive-mediafire/789%20Unofficial%20Archive.zip

Page Load Overview

8.17s
Total Load Time
1
HTTP Requests
1
Domains
N/A
Total Size

Language Analysis

Primary Language

🇺🇸English
Code: en
Confidence:80%
Script:Latin
Direction:ltr

Detection Details

Language Code:en
Detection Confidence:80%
Script Type:Latin
HTML Lang Attribute:en
Text Length:729 chars
Detector Agreement:100%

Website Classification

Primary Category

technology software85% confidence
Type: static
Method: ml+structural

All Detected Categories

technology software
85%
documentation technical
67%
adult content
59%
government public service
39%
news media journalism
26%

Detected Features

No structural features detected

Domain & IP Information

RequestsIP AddressLocationAS Autonomous System
1204.62.248.196United States
0207.241.224.2United States
AS7941Internet Archive
12--

Content Similarity HashesFor malware variant detection

TLSH (Trend Micro Locality Sensitive Hash)

Security-focused

Specialized for malware detection and similarity analysis

T1D0F0909B9F1A303F3E238571F4C32169DF740956EB8D25D28759011F72CA0419AB6FB8

ssdeep (Context Triggered Piecewise Hashing)

Context-aware

Detects similar content even with modifications

12:kx/3nL8bx0AqynLPlIgr8IHTF83TF83TF83TF83TF83TFf:kp8mAF9LTuTuTuTuTuTF

sdhash (Similarity Digest Hashing)

High-precision

High-precision similarity detection for forensic analysis

sdhash:3:560:AAAAAEAAAAAAAAAABAAAAAAAAAAAAAgAAAAAAEACAAAAAAAAAAAAAAIAAAQAAAAAAAAAAAACAAAAAACAAAgAAAAAAAgAAAAA

These hashes enable detection of similar websites and malware variants by comparing content similarity even when exact matches aren't found.

Image Hashes

Perceptual Hashes

Average Hash:00ffffffffffffff
Perceptual Hash:e666666666666626
Difference Hash:0c00080000000000
Wavelet Hash:00ff3f3f00000000
Color Hash:#404fbf

Other Hashes

Crop Resistant:0c00080000000000

Scan History

Scan history not available

Unable to load historical scan data