Security Scan Report: webcrawler-md.pages.dev

Site favicon
Submitted: May 10, 2026, 4:13:11 AMCompleted: May 10, 2026, 4:15:08 AMpubliccompleted
Loading additional data...

Summary

This website contacted 3 IPs in 1 country across 3 domains to perform 4 HTTP transactions. The main domain is webcrawler-md.pages.dev and was registered NaN years ago.

Submitted URL: https://webcrawler-md.pages.dev/

AI Security Verdict

Low Risk

Confidence: 80%

2
Risk Score

Low risk site; no malicious activity detected, but unknown subdomain age and high JS obfuscation suggest cautious use.

Risk Factors
Unranked domain
Unknown subdomain creation date
High JavaScript obfuscation score
Safety Factors
No credential or payment forms present
No malicious Indicators of Compromise detected
No JavaScript malware YARA matches
No network IDS alerts
Domain age information unavailable

Details

Page Title

Web Crawler to Markdown

Scan Type

public

Language

🇺🇸

English

(80% confidence)

Category

technology software

(52%)

Domain Information

Within the developer-focused generic top-level domain (.dev), 'webcrawler-md.pages.dev' is registered, featuring subdomain 'webcrawler-md'. The registrable portion 'pages' spans 5 characters with 2 vowels and 3 consonants. Tokenizing the label suggests 1 word: pages. No strong language cues emerged from the frequency lists.

Screenshot

Security scan screenshot of https://webcrawler-md.pages.dev/

Page Load Overview

0.74s
Total Load Time
6
HTTP Requests
3
Domains
165 KB
Total Size

Language Analysis

Primary Language

🇺🇸English
Code: en
Confidence:80%
Script:Latin
Direction:ltr

Detection Details

Language Code:en
Detection Confidence:80%
Script Type:Latin
HTML Lang Attribute:en
Text Length:985 chars
Detector Agreement:67%

Website Classification

Primary Category

technology software52% confidence
Type: static
Method: ml+structural

All Detected Categories

technology software
52%
documentation technical
35%
news media journalism
30%
government public service
28%
healthcare medical
27%

Detected Features

No structural features detected

Domain & IP Information

RequestsIP AddressLocationAS Autonomous System
2172.66.44.104United States
AS13335Cloudflare, Inc.
2104.26.2.143United States
AS13335Cloudflare, Inc.
2104.17.24.14United States
AS13335Cloudflare, Inc.
63--

Content Similarity HashesFor malware variant detection

TLSH (Trend Micro Locality Sensitive Hash)

Security-focused

Specialized for malware detection and similarity analysis

T18C03E92971F100376DA3C0F6F7DBB558B526A0C3EA1AC9A6BDCD4304AFC66B28553784

ssdeep (Context Triggered Piecewise Hashing)

Context-aware

Detects similar content even with modifications

384:7F9pagRhL7fUPzR9mzxfEVR96nepeiwPdeCKZ562tAy6Rzcrr1jSKo4:7F9pxhnfuR9ax4HqMh6Rzcrhdo4

sdhash (Similarity Digest Hashing)

High-precision

High-precision similarity detection for forensic analysis

sdhash:3:39559:IQAVMIiJoEQCaBgAYUJQwlXAXIREpsCGeA+BkEQRSNkgJXB8GsADxhi5CgwAAENKEQAACEYDWiQYqCAGF1eIBdAgAJIgIETK

These hashes enable detection of similar websites and malware variants by comparing content similarity even when exact matches aren't found.

Image Hashes

Perceptual Hashes

Average Hash:c3c3ffc3ffffffff
Perceptual Hash:f064656565649b9b
Difference Hash:8616199619080600
Wavelet Hash:c3c3ffc0ccfcc0c0
Color Hash:#79d2af

Scan History

Scan history not available

Unable to load historical scan data