Security Scan Report: web.archive.org

Submitted: Jul 3, 2026, 2:52:56 PMCompleted: Jul 3, 2026, 2:54:35 PMpubliccompleted
Loading additional data...

Summary

This website contacted 1 IP in 1 country across 1 domain to perform 4 HTTP transactions. The main domain is web.archive.org and was registered NaN years ago.

Submitted URL: https://web.archive.org/web/20260414174030/https://mail-owaexchnange-autodisciover-mailbox.s3.us-east-1.amazonaws.com/index.html#[email protected]

The Cisco Umbrella rank of the primary domain is #13,499 of the top 1 million websites

AI Security Verdict

Low Risk

Confidence: 70%

3
Risk Score

The page contains a phishing lure (email in URL fragment) but no credential collection; moderate risk overall.

Risk Factors
Phishing lure via email in URL fragment
Medium severity IDS alert
Absence of legitimate page content
Safety Factors
Domain age >30 years and high reputation ranking
No JavaScript malware or obfuscation detected
No external domains or cross‑origin requests
No forms collecting credentials or payment data
Verdict cited a credential/login form, but DOM analysis found no password field (real or disguised) or payment field, and no other hard signal — credential-phishing framing unsupported; risk adjusted from 5 to 3
Domain age information unavailable

Details

Page Title

web.archive.org

Scan Type

public

Language

🇺🇸

English

(80% confidence)

Category

technology software

(85%)

Domain Information

The domain name 'web.archive.org' uses the non-profit oriented generic top-level domain (.org); it also runs on subdomain 'web'. The second-level label 'archive' is 7 characters long containing 3 vowels alongside 4 consonants. Tokenizing the label suggests one word: archive. No strong language cues emerged from the frequency lists.

Screenshot

Security scan screenshot of https://web.archive.org/web/20260414174030/https://mail-owaexchnange-autodisciover-mailbox.s3.us-east-1.amazonaws.com/index.html#example@example.com

Page Load Overview

8.08s
Total Load Time
1
HTTP Requests
1
Domains
N/A
Total Size

Language Analysis

Primary Language

🇺🇸English
Code: en
Confidence:80%
Script:Latin
Direction:ltr

Detection Details

Language Code:en
Detection Confidence:80%
Script Type:Latin
HTML Lang Attribute:en
Text Length:741 chars
Detector Agreement:100%

Website Classification

Primary Category

technology software85% confidence
Type: static
Method: ml+structural

All Detected Categories

technology software
85%
documentation technical
68%
adult content
55%
phishing scam
27%

Detected Features

No structural features detected

Domain & IP Information

RequestsIP AddressLocationAS Autonomous System
1207.241.237.3San Francisco, California, United States
AS7941Internet Archive
11--

Content Similarity HashesFor malware variant detection

TLSH (Trend Micro Locality Sensitive Hash)

Security-focused

Specialized for malware detection and similarity analysis

T1F7048F77329A063D86558498E45B43099F20B143F506C9BCB9BCBAD8BFDED06107BB78

ssdeep (Context Triggered Piecewise Hashing)

Context-aware

Detects similar content even with modifications

3072:l/Qho9PKBb9Js3q9Jzbs6tlg1ySBKwdQ9gcoIsPa2bMy8Old/:mhoC9JSqzzbs6okSjggcpsS2eAB

sdhash (Similarity Digest Hashing)

High-precision

High-precision similarity detection for forensic analysis

sdhash:3:188136:Q5ksEAFAJhOYLFjQDPh4BDjQcQ0D8YOylggNgFA0EKEEBQpF4wxBgABgIBQ0QhHNVCi1zBQwopQgEy5CgkQicBmxIDii8QUB

These hashes enable detection of similar websites and malware variants by comparing content similarity even when exact matches aren't found.

Image Hashes

Perceptual Hashes

Average Hash:ffc7c7c3d3ffffff
Perceptual Hash:b1339acccc93b364
Difference Hash:0018181616000000
Wavelet Hash:3c1c0404c0fcfcfc
Color Hash:#aac587

Other Hashes

Crop Resistant:0018181616000000

Scan History

Scan history not available

Unable to load historical scan data