Security Scan Report: catalog.data.gov

Site favicon
Submitted: Nov 2, 2025, 9:01:56 PMCompleted: Nov 2, 2025, 9:03:35 PMpubliccompleted
Loading additional data...

Summary

This website contacted 45 IPs in 2 countries across 8 domains to perform 54 HTTP transactions. The main domain is catalog.data.gov and was registered NaN years ago.

Submitted URL: https://catalog.data.gov/dataset/?tags=new-york-lottery

AI Security Verdict

Safe Website

Confidence: 95%

0
Risk Score

Legitimate US government data catalog site.

Safety Factors
Official .gov domain
HTTPS connection
No credential or payment forms present
Well‑established domain with minimal risk
Domain age information unavailable

Details

Page Title

Dataset - Catalog

Scan Type

public

Language

🇺🇸

English

(80% confidence)

Category

government public service

(59%)

Domain Information

The domain 'catalog.data.gov' uses the United States government-restricted top-level domain (.gov), featuring subdomain 'catalog'. Count 4 characters in 'data' containing 2 vowels alongside 2 consonants. It segments into 1 word: data. 'data' is most common in Romanian usage. You will also see it in Italian and Malay contexts.

Screenshot

Security scan screenshot of https://catalog.data.gov/dataset/?tags=new-york-lottery

Page Load Overview

54.20s
Total Load Time
54
HTTP Requests
8
Domains
1.0 MB
Total Size

Language Analysis

Primary Language

🇺🇸English
Code: en
Confidence:80%
Script:Latin
Direction:ltr

Detection Details

Language Code:en
Detection Confidence:80%
Script Type:Latin
HTML Lang Attribute:en
Text Length:4,674 chars
Detector Agreement:75%

Website Classification

Primary Category

government public service59% confidence
Type: dynamic
Method: ml+structural

All Detected Categories

government public service
59%
gambling betting
48%
government
48%
documentation technical
41%
education learning
35%

Detected Features

Search
OG: website

Domain & IP Information

RequestsIP AddressLocationAS Autonomous System
10216.58.206.72United States
AS15169GOOGLE
1136.18.0.178Boardman, Oregon, United States
AS8987Amazon Data Services Ireland Ltd
199.84.152.56United States
AS16509AMAZON-02
1162.247.243.39United States
AS54113FASTLY
1136.18.0.180Boardman, Oregon, United States
AS8987Amazon Data Services Ireland Ltd
152.222.236.63United States
AS16509AMAZON-02
1104.18.10.207United States
AS13335CLOUDFLARENET
1216.239.32.36United States
AS15169GOOGLE
152.222.236.58United States
AS16509AMAZON-02
1162.247.241.128United States
AS23467NEWRELIC-AS-1
5445--

Content Similarity HashesFor malware variant detection

TLSH (Trend Micro Locality Sensitive Hash)

Security-focused

Specialized for malware detection and similarity analysis

T148D318E6B2F03436426355F2E2796B09AAA27117F0455C00B2BD5EF42FD6E84AD2373C

ssdeep (Context Triggered Piecewise Hashing)

Context-aware

Detects similar content even with modifications

1536:6fEYzhK8mKVob5KYgpDWlRkTcs0kz/cRV33nT3I/:YEyhMKVUvgIlR5sZz/cRV33nT3I/

sdhash (Similarity Digest Hashing)

High-precision

High-precision similarity detection for forensic analysis

sdhash:3:138778:wUtCIKwsoYgBKJZZRDRmodhpuoMIhXAK0sDoxAEYgOACIHxUGiUMjAxpNBEBBgECDBAdiQGPKBYXxkAAeUkwgllmswxoTEIl

These hashes enable detection of similar websites and malware variants by comparing content similarity even when exact matches aren't found.

Image Hashes

Perceptual Hashes

Average Hash:0000ffffefefefef
Perceptual Hash:b31e0623858dcded
Difference Hash:fc4d320a0a1a0a0a
Wavelet Hash:0400dfcfc3c3e3e3
Color Hash:#c5879e

Other Hashes

Scan History

Scan history not available

Unable to load historical scan data