Security Scan Report: webarchive.nationalarchives.gov.uk

Site favicon
Submitted: Jan 18, 2026, 4:09:06 AMCompleted: Jan 18, 2026, 4:10:38 AMpubliccompleted
Loading additional data...

Summary

This website contacted 11 IPs in 1 country across 7 domains to perform 24 HTTP transactions. The main domain is webarchive.nationalarchives.gov.uk and was registered NaN years ago.

Submitted URL: https://webarchive.nationalarchives.gov.uk/ukgwa/*/http:/www.hefce.ac.uk/

The Cisco Umbrella rank of the primary domain is #150,668 of the top 1 million websites

AI Security Verdict

Safe Website

Confidence: 96%

0
Risk Score

Legitimate government archive site with no security concerns.

Safety Factors
Well-established domain (>20 years)
No forms or sensitive data collection
No malicious Indicators of Compromise
Official government archive service
Domain age information unavailable

Details

Page Title

Archive Timeline - UK Government Web Archive

Scan Type

public

Language

🇺🇸

English

(80% confidence)

Category

government

(48%)

Domain Information

The domain name 'webarchive.nationalarchives.gov.uk' uses the United Kingdom country-code top-level domain (.gov.uk), featuring subdomain 'webarchive'. The registrable portion 'nationalarchives' spans 16 characters with seven vowels and 9 consonants. Breaking it apart gives two words: national, archives. Average segment length settles at 8 characters. No strong language cues emerged from the frequency lists.

Screenshot

Security scan screenshot of https://webarchive.nationalarchives.gov.uk/ukgwa/*/http:/www.hefce.ac.uk/

Page Load Overview

2.58s
Total Load Time
24
HTTP Requests
7
Domains
1.0 MB
Total Size

Language Analysis

Primary Language

🇺🇸English
Code: en
Confidence:80%
Script:Latin
Direction:ltr

Detection Details

Language Code:en
Detection Confidence:80%
Script Type:Latin
HTML Lang Attribute:en-gb
Text Length:2,245 chars
Detector Agreement:100%

Website Classification

Primary Category

government48% confidence
Type: dynamic
Method: ml+structural

All Detected Categories

government
48%
healthcare medical
34%
government public service
34%
news media journalism
32%
adult content
27%

Detected Features

Articles

Domain & IP Information

RequestsIP AddressLocationAS Autonomous System
4142.251.38.74United States
AS15169GOOGLE
2142.251.38.67United States
AS15169GOOGLE
2142.251.38.72United States
AS15169GOOGLE
213.33.235.21New York, New York, United States
AS16509AMAZON-02
213.33.235.52New York, New York, United States
AS16509AMAZON-02
2151.101.2.137United States
AS54113FASTLY
213.33.235.110New York, New York, United States
AS16509AMAZON-02
213.33.235.56New York, New York, United States
AS16509AMAZON-02
2151.101.130.137United States
AS54113FASTLY
2216.239.32.36United States
AS15169GOOGLE
2411--

Content Similarity HashesFor malware variant detection

TLSH (Trend Micro Locality Sensitive Hash)

Security-focused

Specialized for malware detection and similarity analysis

T135249C956CF21726B3B2829027A93F94BE82A4C3C86612557BEC4FD10F92D93D9DF107

ssdeep (Context Triggered Piecewise Hashing)

Context-aware

Detects similar content even with modifications

768:w1RWW4DF13kdevunKtfmSwJR1zuCI/pnxgl7joIkLvzxV0mcbiT605BlgIO1efMy:WWbDqBnKy7Beh4O7L2duSbEqjZedSy

sdhash (Similarity Digest Hashing)

High-precision

High-precision similarity detection for forensic analysis

sdhash:3:215965:ACAFgLAECEFAGARDEiCYUABCiIQySAwkEIAACGQnCRIEFAkAjAEC8gCsAEQcMQwgCmgFNhAAIAEACCBgAQACgBQIggQCIGQA

These hashes enable detection of similar websites and malware variants by comparing content similarity even when exact matches aren't found.

Scan History

Scan history not available

Unable to load historical scan data