Security Scan Report: toloka.ai

Site favicon
Submitted: Oct 26, 2025, 8:24:36 PMCompleted: Oct 26, 2025, 8:28:02 PMpubliccompleted
Loading additional data...

Summary

This website contacted 99 IPs in 7 countries across 30 domains to perform 193 HTTP transactions. The main domain is toloka.ai and was registered NaN years ago.

Submitted URL: https://toloka.ai/blog/agi-vs-other-ai/

AI Security Verdict

Safe Website

Confidence: 95%

0
Risk Score

No security concerns detected; the site appears legitimate.

Safety Factors
Well‑established domain (>5 years)
No malicious Indicators of Compromise
No suspicious forms or data collection
Domain age information unavailable

Details

Page Title

AGI vs. other types of AI: what's the difference?

Scan Type

public

Language

🇺🇸

English

(80% confidence)

Category

adult content

(52%)

Domain Information

You're looking at domain 'toloka.ai' on the Anguillan country-code top-level domain (.ai) with no subdomain. The core label 'toloka' covers 6 characters split between 3 vowels and three consonants. Breaking it apart gives 2 words: to, loka. Expect 3 characters per word on average. 'to' is most common in Czech usage. Secondary signals appear in Slovak and Polish.

Screenshot

Security scan screenshot of https://toloka.ai/blog/agi-vs-other-ai/

Page Load Overview

10.48s
Total Load Time
193
HTTP Requests
30
Domains
1.1 MB
Total Size

Language Analysis

Primary Language

🇺🇸English
Code: en
Confidence:80%
Script:Latin
Direction:ltr

Detection Details

Language Code:en
Detection Confidence:80%
Script Type:Latin
HTML Lang Attribute:en
Text Length:11,979 chars
Detector Agreement:100%

Website Classification

Primary Category

adult content52% confidence
Type: spa
Method: ml+structural

All Detected Categories

adult content
52%
technology software
48%
government public service
38%
education learning
33%
documentation technical
30%

Detected Features

OG: website

Domain & IP Information

RequestsIP AddressLocationAS Autonomous System
13318.173.205.84United States
AS16509AMAZON-02
913.107.213.44United States
AS8075MICROSOFT-CORP-MSN-AS-BLOCK
552.242.103.142Boydton, Virginia, United States
AS8075MICROSOFT-CORP-MSN-AS-BLOCK
53.174.46.52United States
AS16509AMAZON-02
3150.171.28.10United States
AS8075MICROSOFT-CORP-MSN-AS-BLOCK
3157.240.0.35Frankfurt am Main, Hesse, Germany
AS32934FACEBOOK
3150.171.22.12United States
AS8075MICROSOFT-CORP-MSN-AS-BLOCK
313.226.244.8United States
AS16509AMAZON-02
3142.250.186.72United States
AS15169GOOGLE
2146.75.89.140Lisbon, Lisbon, Portugal
AS54113FASTLY
19399--

Content Similarity HashesFor malware variant detection

TLSH (Trend Micro Locality Sensitive Hash)

Security-focused

Specialized for malware detection and similarity analysis

T1A974D553A259F5106CE32ABEF32DAA183C155102FF33C6DB61EC456F95CACE8129276C

ssdeep (Context Triggered Piecewise Hashing)

Context-aware

Detects similar content even with modifications

6144:srC3V8RlpSRSx9wCQhtg7YVvtu0KhpdSbh17lp80hO3G1Hh8zQZBX+Wtip:fV8RlpSRSxhQDg7YVvtu0KHdSbr7lp8N

sdhash (Similarity Digest Hashing)

High-precision

High-precision similarity detection for forensic analysis

sdhash:3:350399:QaoNSBAAkRtAoBhHVACAJuUPkAPgUZaTDoOIiAAgBrBC1yvrVOGCmFuigAoNgwC5QSe6bCAqRAJUCZAgQAM3gEIwTvgAQEWK

These hashes enable detection of similar websites and malware variants by comparing content similarity even when exact matches aren't found.

Image Hashes

Perceptual Hashes

Average Hash:c3c3e783838383c3
Perceptual Hash:adc3873cda088f6c
Difference Hash:9e8e4f1717161616
Wavelet Hash:c7c7e783838383c3
Color Hash:#9479d2

Scan History

Scan history not available

Unable to load historical scan data