SCAN INCOMPLETE - LIMITED DATA COLLECTED

There were problems collecting data from this website

The website may be blocking automated browsers (bot protection)
The site may be using geo-blocking or rate limiting
Network connectivity issues may have prevented access

LIMITED DATA

Note: There were problems collecting data during this scan, and some information may be missing or incomplete. The security analysis below is based on limited information and may not be accurate. Consider trying the scan again.

Security Scan Report: www.sanjoseca.gov

Site favicon
Submitted: Dec 10, 2025, 9:21:38 PMCompleted: Dec 10, 2025, 9:22:59 PMpubliccompleted
Loading additional data...

Summary

This website contacted 4 IPs in 1 country across 1 domain to perform 4 HTTP transactions. The main domain is sanjoseca.gov and was registered NaN years ago.

Submitted URL: https://www.sanjoseca.gov/

The Cisco Umbrella rank of the primary domain is #307,973 of the top 1 million websites

AI Security Verdict

Safe Website

Confidence: 95%

0
Risk Score

Site appears legitimate; likely a temporary connectivity issue.

Safety Factors
Official .gov top‑level domain
Long registration history
Well‑established government domain with no suspicious activity
Domain age information unavailable

Details

Page Title

www.sanjoseca.gov

Scan Type

public

Language

🇺🇸

English

(80% confidence)

Category

technology software

(85%)

Domain Information

You're looking at domain 'www.sanjoseca.gov' on the United States government-restricted top-level domain (.gov) and includes subdomain 'www'. The second-level label 'sanjoseca' is 9 characters long with four vowels and 5 consonants. Segmentation suggests three words: sanjo, sec, a. Median word length comes out to three characters. No strong language cues emerged from the frequency lists.

Screenshot

Security scan screenshot of https://www.sanjoseca.gov/

Page Load Overview

13.23s
Total Load Time
4
HTTP Requests
1
Domains
N/A
Total Size

Language Analysis

Primary Language

🇺🇸English
Code: en
Confidence:80%
Script:Latin
Direction:ltr

Detection Details

Language Code:en
Detection Confidence:80%
Script Type:Latin
HTML Lang Attribute:en
Text Length:747 chars
Detector Agreement:100%

Website Classification

Primary Category

technology software85% confidence
Type: static
Method: ml+structural

All Detected Categories

technology software
85%
documentation technical
63%
government public service
56%
adult content
54%
government
48%

Detected Features

No structural features detected

Domain & IP Information

RequestsIP AddressLocationAS Autonomous System
12.20.142.193Frankfurt am Main, Hesse, Germany
AS20940Akamai International B.V.
12.20.142.42Frankfurt am Main, Hesse, Germany
AS20940Akamai International B.V.
12a02:26f0:480:23::1726:6291Frankfurt am Main, Hesse, Germany
AS20940Akamai International B.V.
12a02:26f0:480:23::1726:62a4Frankfurt am Main, Hesse, Germany
AS20940Akamai International B.V.
44--

Content Similarity HashesFor malware variant detection

TLSH (Trend Micro Locality Sensitive Hash)

Security-focused

Specialized for malware detection and similarity analysis

T194048E77329A063986558498E05B830D9F20B543F506C9BC79BCBAD8BFDED06107BB78

ssdeep (Context Triggered Piecewise Hashing)

Context-aware

Detects similar content even with modifications

3072:ZfQho9PKBb9Js3q9Jzbs6tlg3SBKwdQWgceIszE2bMy8Oldd:OhoC9JSqzzbs6o3Sj3gcrs42eA/

sdhash (Similarity Digest Hashing)

High-precision

High-precision similarity detection for forensic analysis

sdhash:3:187142:+gFCCjowUBAGKApsYMIAHooQYAhgIJhIegphBjmAPSWxFQglgKETDncA96ALACQpxEEBHSgHL9QDAAT0IKQACHCY1JABAZgU

These hashes enable detection of similar websites and malware variants by comparing content similarity even when exact matches aren't found.

Image Hashes

Perceptual Hashes

Average Hash:ffc7c7c3d3ffffff
Perceptual Hash:b1339acccc93b364
Difference Hash:0018181616000000
Wavelet Hash:fcdcc4c4c0f8f0f0
Color Hash:#1f9327

Other Hashes

Crop Resistant:0018181616000000

Scan History

Scan history not available

Unable to load historical scan data