SCAN INCOMPLETE - LIMITED DATA COLLECTED

There were problems collecting data from this website

The website may be blocking automated browsers (bot protection)
The site may be using geo-blocking or rate limiting
Network connectivity issues may have prevented access

LIMITED DATA

Note: There were problems collecting data during this scan, and some information may be missing or incomplete. The security analysis below is based on limited information and may not be accurate. Consider trying the scan again.

Security Scan Report: www.sanjoseca.gov

Submitted: Nov 13, 2025, 5:45:35 AMCompleted: Nov 13, 2025, 5:47:03 AMpubliccompleted
Loading additional data...

Summary

This website contacted 5 IPs in 0 countries across 1 domain to perform 2 HTTP transactions. The main domain is sanjoseca.gov and was registered NaN years ago.

Submitted URL: https://www.sanjoseca.gov/

AI Security Verdict

Safe Website

Confidence: 95%

0
Risk Score

Legitimate government site; no security concerns detected.

Safety Factors
Official government domain (.gov)
Well-established domain age
No malicious Indicators of Compromise
No credential or payment collection
Domain age information unavailable

Details

Page Title

Access Denied

Scan Type

public

Language

🇺🇸

English

(56% confidence)

Category

government

(48%)

Domain Information

Within the United States government-restricted top-level domain (.gov), 'www.sanjoseca.gov' is registered; it also runs on subdomain 'www'. The second-level label 'sanjoseca' is 9 characters long holding four vowels versus five consonants. Word splitting yields three words: sanjo, sec, a. Median word length comes out to three characters. No strong language cues emerged from the frequency lists.

Screenshot

Security scan screenshot of https://www.sanjoseca.gov/

Page Load Overview

13.74s
Total Load Time
2
HTTP Requests
1
Domains
N/A
Total Size

Language Analysis

Primary Language

🇺🇸English
Code: en
Confidence:56%
Script:Latin
Direction:ltr

Detection Details

Language Code:en
Detection Confidence:56%
Script Type:Latin
Text Length:162 chars
Detector Agreement:100%

Website Classification

Primary Category

government48% confidence
Type: static
Method: ml+structural

All Detected Categories

government
48%
technology software
35%
documentation technical
33%

Detected Features

No structural features detected

Domain & IP Information

RequestsIP AddressLocationAS Autonomous System
223.50.131.156UnknownUnknown
023.216.134.220UnknownUnknown
023.216.134.219UnknownUnknown
02a02:26f0:7100::210:138UnknownUnknown
02a02:26f0:7100::210:111UnknownUnknown
25--

Content Similarity HashesFor malware variant detection

TLSH (Trend Micro Locality Sensitive Hash)

Security-focused

Specialized for malware detection and similarity analysis

T166E0E78FF00B000A0E113DD218713310F7BA34B0515517C4930BD4738C07EE4D905479

ssdeep (Context Triggered Piecewise Hashing)

Context-aware

Detects similar content even with modifications

6:qzxwyEr6VPWxxdGztAc41AHDFWgxkU5n+xM4a4dnkU5nnKqz:kxVRpedg6AHxCSZ4dnTj

sdhash (Similarity Digest Hashing)

High-precision

High-precision similarity detection for forensic analysis

sdhash:1:0:6a8783873baa44fc316f1fb22abc44e7

These hashes enable detection of similar websites and malware variants by comparing content similarity even when exact matches aren't found.

Image Hashes

Perceptual Hashes

Average Hash:1f3fffffffffffff
Perceptual Hash:870707070707fbf9
Difference Hash:e040000000000000
Wavelet Hash:1030f0f0f0f0f0f0
Color Hash:#e06caa

Other Hashes

Crop Resistant:e040000000000000

Scan History

Scan history not available

Unable to load historical scan data