Security Scan Report: diffbot.com

Redirected to: https://www.diffbot.com/

Submitted: Mar 22, 2026, 12:10:22 AMCompleted: Mar 22, 2026, 12:11:37 AMpubliccompleted
Loading additional data...

Summary

This website contacted 4 IPs in 1 country across 6 domains to perform 72 HTTP transactions. The main domain is diffbot.com and was registered NaN years ago.

Submitted URL: https://diffbot.com

Effective URL: https://www.diffbot.com/Redirected

The Cisco Umbrella rank of the primary domain is #234,023 of the top 1 million websites

AI Security Verdict

Safe Website

Confidence: 95%

0
Risk Score

Site appears legitimate with no malicious activity detected.

Safety Factors
Well‑established domain with long registration history
Consistent branding in meta tags and page title
Absence of forms that collect sensitive data
No malicious JavaScript or known malware signatures
No credential exfiltration or cross‑origin credential submissions
Domain age information unavailable

Details

Page Title

Diffbot | Knowledge Graph, AI Web Data Extraction and Crawling

Scan Type

public

Language

🇺🇸

English

(80% confidence)

Category

unknown

(0%)

Domain Information

Domain 'diffbot.com' uses the commercial generic top-level domain (.com) without a subdomain. The core label 'diffbot' covers 7 characters split between 2 vowels and 5 consonants. It segments into two words: diff, bot. Median word length comes out to 3.5 characters. No strong language cues emerged from the frequency lists.

Screenshot

Security scan screenshot of https://diffbot.com

Page Load Overview

4.63s
Total Load Time
72
HTTP Requests
6
Domains
194 KB
Total Size

Language Analysis

Primary Language

🇺🇸English
Code: en
Confidence:80%
Script:Latin
Direction:ltr

Detection Details

Language Code:en
Detection Confidence:80%
Script Type:Latin
HTML Lang Attribute:en
Text Length:3,042 chars
Detector Agreement:100%

Website Classification

Primary Category

unknown0% confidence
Type: dynamic
Method: structural

All Detected Categories

No categories detected

Detected Features

No structural features detected

Domain & IP Information

RequestsIP AddressLocationAS Autonomous System
18104.16.174.226United States
AS13335Cloudflare, Inc.
1864.71.166.35Columbus, Ohio, United States
AS6939Hurricane Electric LLC
18216.218.191.197United States
AS6939Hurricane Electric LLC
18216.218.141.229United States
AS6939Hurricane Electric LLC
724--

Detected Technologies3

Content Similarity HashesFor malware variant detection

TLSH (Trend Micro Locality Sensitive Hash)

Security-focused

Specialized for malware detection and similarity analysis

T1A713327174DC0D7F015322CA3520BB89A0DFCF36D62745EAF2B7064927D7E8258AA366

ssdeep (Context Triggered Piecewise Hashing)

Context-aware

Detects similar content even with modifications

384:Sng/nUFWEsxM11t6cC7kuQ9adFlztiDcfjVK7ElrwtO6frwtOWprwtO9wKrwtO/v:ig/UFWEYWcfTEpPKFm1hXIbg+X5rOjW

sdhash (Similarity Digest Hashing)

High-precision

High-precision similarity detection for forensic analysis

sdhash:3:41763:joCNgKSGCRdgARQaAHHeCPgoMVawExgRCjRYEPIEk0YJFwZAABICIACksRmkCgUYCgCDIVABBx8BHQAJDFEuwAoJNBDquRHw

These hashes enable detection of similar websites and malware variants by comparing content similarity even when exact matches aren't found.

Image Hashes

Perceptual Hashes

Average Hash:bfe7c7c7c38383e7
Perceptual Hash:b1d9c72d981f9862
Difference Hash:78cc9e9e8e164e8c
Wavelet Hash:9ec2c3c3c38383e7
Color Hash:#623a78

Other Hashes

Crop Resistant:78cc9e9e8e164e8c

Scan History

Scan history not available

Unable to load historical scan data