Security Scan Report: codenote.pages.dev

Site favicon
Submitted: Jan 19, 2026, 5:53:21 AMCompleted: Jan 19, 2026, 5:54:27 AMpubliccompleted
Loading additional data...

Summary

This website contacted 2 IPs in 1 country across 2 domains to perform 4 HTTP transactions. The main domain is codenote.pages.dev and was registered NaN years ago.

Submitted URL: https://codenote.pages.dev/ja/posts/update-cloudflare-robots-txt-for-ai-crawlers/

AI Security Verdict

Safe Website

Confidence: 96%

0
Risk Score

Legitimate informational blog post with no security concerns.

Safety Factors
Established domain with over 5 years of registration
No malicious Indicators of Compromise detected
No credential or payment collection forms
Standard web hosting environment
Domain age information unavailable

Details

Page Title

Cloudflareのrobots.txtを管理するを「コンテンツシグナルポリシー」から「robots.txtでAIボットのトラフィックを指示する」へ設定変更して、AIボットによるクロールを許可しました

Scan Type

public

Language

🇯🇵

Japanese

(80% confidence)

Category

education learning

(65%)

Domain Information

The domain name 'codenote.pages.dev' uses the developer-focused generic top-level domain (.dev), featuring subdomain 'codenote'. The second-level label 'pages' is 5 characters long split between two vowels and 3 consonants. Word splitting yields 1 word: pages. The median word length lands at 5 characters. No strong language cues emerged from the frequency lists.

Screenshot

Security scan screenshot of https://codenote.pages.dev/ja/posts/update-cloudflare-robots-txt-for-ai-crawlers/

Page Load Overview

0.40s
Total Load Time
5
HTTP Requests
3
Domains
N/A
Total Size

Language Analysis

Primary Language

🇯🇵Japanese
Code: ja
Confidence:80%
Script:Mixed
Direction:ltr

Detection Details

Language Code:ja
Detection Confidence:80%
Script Type:Mixed
HTML Lang Attribute:ja
Text Length:2,082 chars
Detector Agreement:100%

Website Classification

Primary Category

education learning65% confidence
Type: static
Method: ml+structural

All Detected Categories

education learning
65%
technology software
59%
blog personal website
56%
news media journalism
55%
documentation technical
49%

Detected Features

OG: article

Domain & IP Information

RequestsIP AddressLocationAS Autonomous System
3172.66.47.71United States
AS13335CLOUDFLARENET
2104.16.79.73United States
AS13335CLOUDFLARENET
52--

Content Similarity HashesFor malware variant detection

TLSH (Trend Micro Locality Sensitive Hash)

Security-focused

Specialized for malware detection and similarity analysis

T1D862B66EE2A08536113643EAA80477F9B05E9C07D9933935753EC266B9D2FDCF841E38

ssdeep (Context Triggered Piecewise Hashing)

Context-aware

Detects similar content even with modifications

384:46ues7ch6ulsTA6usshg/dFqsS7cDxAQCMGLMhCAM6uUzFognExay1loJ/s1B/Z5:ca/CqsS7cDxAQ/GLMhCFkLZHtE

sdhash (Similarity Digest Hashing)

High-precision

High-precision similarity detection for forensic analysis

sdhash:3:14729:RQBA4ITigDTIRAErdFIAIKQggAITpA2CQrgYMwEgFYcCUJWpAQH5gpIaEIIigkqApVSNWIo2Cq5AhWKZzBFosIIDEKigEMBI

These hashes enable detection of similar websites and malware variants by comparing content similarity even when exact matches aren't found.

Image Hashes

Perceptual Hashes

Average Hash:dfc7e7e7efc7c7c3
Perceptual Hash:b1c63c98cc39ce6c
Difference Hash:320c4c0e1a9c9e96
Wavelet Hash:9ac2c6c3c7c3c3c3
Color Hash:#bf40a1

Other Hashes

Crop Resistant:320c4c0e1a9c9e96

Scan History

Scan history not available

Unable to load historical scan data