LLM

Sockpuppeting: How a Single Line Can Bypass LLM Safety Guardrails

10 de abril de 2026

A jailbreak through sockpuppeting can be easily done as it requires no special tools nor optimization. It only takes a faulty prefill feature, and the gates are open. We tested 11 LLM-powered assistants against sockpuppeting and found varying levels of robustness across today’s leading LLMs.

Guarding LLMs With a Layered Prompt Injection Representation
31 de marzo de 2026
TrendAI™ Research has developed a model training procedure for learning an essential representation of prompt injection attacks. The resulting prompt representation exhibits approximately linear separability, allowing the specialized, small-scale classifier trained on features derived from the representation to achieve high classification performance.
Leer más
Stay Ahead of AI Threats: Secure LLM Applications With Trend Vision One
14 de octubre de 2025
Trend Vision One™ tackles 9 of OWASP’s Top 10 LLM vulnerabilities, offering comprehensive protection against prompt injection, data leakage, AI supply chain risks, and other critical flaws.
Leer más
LLM as a Judge: Evaluating Accuracy in LLM Security Scans
04 de agosto de 2025
As large language models (LLMs) become more capable and widely adopted, the risk of unintended or adversarial outputs grows, especially within a security-sensitive context. To identify and mitigate such risks, Trend Micro researchers ran LLM security scans that simulate adversarial attacks.
Leer más
The Mirage of AI Programming: Hallucinations and Code Integrity
25 de julio de 2024
The adoption of large language models (LLMs) and Generative Pre-trained Transformers (GPTs), such as ChatGPT, by leading firms like Microsoft, Nuance, Mix and Google CCAI Insights, drives the industry towards a series of transformative changes. As the use of these new technologies becomes prevalent, it is important to understand their key behavior, advantages, and the risks they present.
Leer más

LLM

Sockpuppeting: How a Single Line Can Bypass LLM Safety Guardrails

Guarding LLMs With a Layered Prompt Injection Representation

Stay Ahead of AI Threats: Secure LLM Applications With Trend Vision One

LLM as a Judge: Evaluating Accuracy in LLM Security Scans

The Mirage of AI Programming: Hallucinations and Code Integrity

Trend Vision Oneone-platform - Proactive Security Starts Here

Recursos

Soporte

Acerca de Trend

Sede en el país