Question 1

Does adding AI capabilities to an application automatically make it less secure than a traditional application?

Accepted Answer

Not automatically. AI components introduce specific risk categories, such as prompt injection, model inversion, and training data poisoning, that traditional applications do not face. However, these risks are manageable through established controls including input validation, output filtering, access restrictions on model endpoints, and supply chain verification of pretrained models. An application with AI components that are properly scoped, tested, and monitored is not inherently less secure than a comparable traditional application. The risk differential comes from whether AI-specific threats are accounted for in the threat model, not from the presence of AI itself.

Question 2

Can existing application security tools fully cover AI-specific threats without any changes to tooling or process?

Accepted Answer

No. Existing static analysis, dependency scanning, and DAST tools address the conventional application security surface of an AI-enabled application, such as injection flaws in surrounding code, vulnerable libraries, and insecure API configurations. However, they typically cannot detect AI-specific threats such as adversarial inputs crafted to manipulate model outputs, data poisoning in training pipelines, prompt injection through user-controlled inputs to language models, or model extraction attempts at runtime. Covering AI-specific threats generally requires additional controls, including runtime monitoring of model inputs and outputs, evaluation frameworks for model robustness, and threat modeling that explicitly addresses the machine learning pipeline.

Question 3

When evaluating AI-based security detection tools, what false positive and false negative risks should practitioners account for?

Accepted Answer

Practitioners should account for both directions of error with equal weight. False positives occur when the tool flags benign inputs or behaviors as threats, which can cause alert fatigue and erode confidence in the tooling. False negatives, which are cases where genuine threats are not flagged, are equally significant and often underemphasized. AI-based detection tools may fail to flag novel attack patterns that differ from training data distributions, adversarial inputs specifically crafted to evade the model, or low-and-slow attacks that individually fall below detection thresholds. Scope boundaries also matter: most AI-based detection tools operate on observable signals at a specific layer, such as network traffic or application logs, and cannot detect threats that do not produce observable signals at that layer.

Question 4

How should organizations integrate AI security considerations into an existing secure development lifecycle?

Accepted Answer

Organizations should extend existing SDLC phases rather than create a parallel process. In the design phase, threat modeling should explicitly include the machine learning pipeline, covering training data sources, model provenance, and inference endpoints as attack surfaces. In development, dependency and supply chain controls should extend to pretrained models and datasets, not only code libraries. In testing, in addition to conventional security testing, evaluation should include adversarial robustness testing and prompt injection testing where applicable. In deployment and operations, runtime monitoring should cover model input and output anomalies in addition to standard application telemetry. Existing change management and incident response procedures should be updated to account for model updates and dataset changes as events that may affect the security posture.

Question 5

What controls are most effective at reducing prompt injection risk in applications that use large language models?

Accepted Answer

Effective controls typically include strict separation of trusted system instructions from untrusted user inputs at the architectural level, output filtering and validation before any model-generated content is acted upon or rendered, privilege minimization so that the model's access to downstream systems or actions is restricted to what is necessary, and monitoring of inputs and outputs for patterns consistent with injection attempts. No single control eliminates prompt injection risk entirely. Defense in depth is the recommended approach because prompt injection can occur through indirect channels, such as content retrieved from external sources during retrieval-augmented generation, not only through direct user input. Controls must account for both direct and indirect injection paths.

Question 6

How should training data and pretrained models be treated from a supply chain security perspective?

Accepted Answer

Training data and pretrained models should be subject to supply chain controls comparable to those applied to third-party code dependencies. This typically includes verifying the provenance and integrity of datasets and model weights before use, maintaining an inventory of models and their sources analogous to a software bill of materials, assessing the trustworthiness of sources from which pretrained models are obtained, and scanning datasets for poisoned or adversarially manipulated samples where feasible. Pretrained models obtained from external repositories may contain embedded behaviors that are not detectable through static inspection of weights alone, so additional validation through behavioral testing is generally warranted before deployment. Updates to pretrained models or fine-tuning datasets should be treated as changes that require security review, not only functional review.

Artificial Intelligence Security

Why it matters

Who it's relevant to

Inside Artificial Intelligence Security

Common questions

Common misconceptions

Best practices