Security Scan Report: webarchive.nationalarchives.gov.uk

Submitted: Oct 22, 2025, 7:38:00 PMCompleted: Oct 22, 2025, 7:39:34 PMpubliccompleted
Loading additional data...

Summary

This website contacted 39 IPs in 2 countries across 7 domains to perform 24 HTTP transactions. The main domain is webarchive.nationalarchives.gov.uk and was registered NaN years ago.

Submitted URL: https://webarchive.nationalarchives.gov.uk/ukgwa/*/http:/www.hefce.ac.uk/

AI Security Verdict

Safe Website

Confidence: 95%

0
Risk Score

Legitimate government archive page with no security concerns.

Safety Factors
Hosted on a long‑standing UK Government Web Archive domain
Content consists of archival timeline information only
No collection of sensitive data (passwords, payment details, personal identifiers)
Domain age information unavailable

Details

Page Title

Archive Timeline - UK Government Web Archive

Scan Type

public

Language

🇺🇸

English

(50% confidence)

Category

government

(48%)

Domain Information

Within the United Kingdom country-code top-level domain (.gov.uk), 'webarchive.nationalarchives.gov.uk' is registered; it also runs on subdomain 'webarchive'. Its registrable label 'nationalarchives' stretches across 16 characters holding 7 vowels versus nine consonants. Splitting it apart reveals 2 words: national, archives. Median word length is eight characters. The linguistic tilt is English for 'national'. You may catch it in Chinese (Pinyin) and French as well.

Screenshot

Security scan screenshot of https://webarchive.nationalarchives.gov.uk/ukgwa/*/http:/www.hefce.ac.uk/

Page Load Overview

44.85s
Total Load Time
24
HTTP Requests
7
Domains
1.0 MB
Total Size

Language Analysis

Primary Language

🇺🇸English
Code: en
Confidence:50%
Script:Latin
Direction:ltr

Detection Details

Language Code:en
Detection Confidence:50%
Script Type:Latin
HTML Lang Attribute:en-gb
Text Length:2,245 chars
Detector Agreement:100%

Website Classification

Primary Category

government48% confidence
Type: dynamic
Method: ml+structural

All Detected Categories

government
48%
healthcare medical
34%
government public service
34%
news media journalism
32%
adult content
27%

Detected Features

Articles

Domain & IP Information

RequestsIP AddressLocationAS Autonomous System
24151.101.130.137San Francisco, California, United States
AS54113FASTLY
0142.250.185.163United States
AS15169GOOGLE
0216.239.34.36United States
AS15169GOOGLE
013.226.244.36United States
AS16509AMAZON-02
013.226.244.31United States
AS16509AMAZON-02
0142.250.185.136United States
AS15169GOOGLE
0151.101.65.229San Francisco, California, United States
AS54113FASTLY
0142.250.186.170United States
AS15169GOOGLE
013.226.244.86United States
AS16509AMAZON-02
0151.101.1.229San Francisco, California, United States
AS54113FASTLY
2439--

Content Similarity HashesFor malware variant detection

TLSH (Trend Micro Locality Sensitive Hash)

Security-focused

Specialized for malware detection and similarity analysis

T168249C955CF21726B3B2829027A93F94BE82A4C3C86612557BEC4FD10F92D93D9DF107

ssdeep (Context Triggered Piecewise Hashing)

Context-aware

Detects similar content even with modifications

768:11RWW4DF13kdevunKtfmSwJR1zuCI/pnxgl7joIkLvzxV0mcbiT605BlgIO1efMy:xWbDqBnKy7Beh4O7L2duSbEqjZedSy

sdhash (Similarity Digest Hashing)

High-precision

High-precision similarity detection for forensic analysis

sdhash:3:215896:gagsCJEsAEICMFIAQABAQJAADMSAQA5hIAGBjEwMEBARJAYQjAEoIAECoAAYCIggCCgMuaIUAgJZSRggUAIGRBClBIACAEAg

These hashes enable detection of similar websites and malware variants by comparing content similarity even when exact matches aren't found.

Image Hashes

Perceptual Hashes

Average Hash:N/A
Perceptual Hash:N/A
Difference Hash:N/A
Wavelet Hash:N/A
Color Hash:N/A

Other Hashes

Crop Resistant:N/A

Scan History

Scan history not available

Unable to load historical scan data