Research on LLM Output Drift in Financial Workflows: Quantifying Consistency Across Model Sizes

Financial institutions deploy Large Language Models (LLMs) for reconciliations, regulatory reporting, and client communications, but nondeterministic outputs (output drift) undermine auditability and…

Read the full article

raffisk8mo ago2 min readenInsight

technology finance ai research financial technology

You might also wanna read

FinGuard: Detecting Financial Regulatory Non-Compliance in LLM Interactions

FinGuard improves financial compliance detection by training a model on Chinese regulations, outperforming larger LLMs and safety models. It

arxiv.org·1mo ago

From Prompts to Contracts: Harness Engineering for Auditable Enterprise LLM Agents

arXiv:2607.08028v1 Announce Type: cross Abstract: Enterprise large language model (LLM) applications often begin as prototypes whose behavio

machinebrief.com·7d ago

When the Judge Changes, So Does the Measurement: Auditing LLM-as-Judge Reliability

arXiv:2607.08535v1 Announce Type: new Abstract: An LLM-as-judge score can move even when the candidate responses stay fixed, simply because

machinebrief.com·7d ago

BPE Tokenization Creates Exploitable Safety Gaps in LLM Alignment, Study Finds

Character-level perturbations bypass safety alignment in modern LLMs despite leaving prompts human-readable. We identify and test a central

arxiv.org·14d ago

Study Reveals Invisible Manipulation Vulnerability in AI Financial Advisory Systems That Evades All Current Detection Methods

AI systems are increasingly deployed for credit assessment and investment advisory in global financial markets, yet the integrity of their i

arxiv.org·1mo ago

Research Proposal: Measuring LLM Perplexity Scaling Laws Across Codebase Sizes for Safer Software

Research proposal for measuring how coding LLM perplexity scales with codebase context size, using Lean as a test case for whether formal la

gwern.net·10d ago

Comments

No comments yet. Be the first.