INSTRUCTION-GUIDED ALIGNMENT OF CHATBOTS BASED ON LARGE LANGUAGE MODELS FOR COMPLIANCE-CONSTRAINED FINANCIAL SCENARIOS

Main Article Content

Oleksandr PISKUN

Abstract

Introduction. This paper investigates whether chatbots based on large language
models (LLMs) can be safely used in regulated financial scenarios at the pre-sales stage of client
interaction without model fine-tuning, provided that only instruction-based constraints are applied.
Currency risk management is considered as a case study. Within the study, two chatbot configurations
are designed and analyzed: a baseline (unconstrained) version and an instruction-constrained version
oriented toward compliance requirements. For evaluation, a compact framework is proposed,
covering three key dimensions: compliance violations, informativeness, and prescriptiveness. Based
on a selected set of realistic user queries, it is demonstrated that instruction-based alignment can
significantly reduce the model’s recommendation behavior while preserving a substantial portion of
its explanatory value.
Purpose. The aim of this study is to analyze the feasibility of safely using LLM-based chatbots
in compliance-constrained financial scenarios, to develop an instruction-constrained chatbot
configuration for pre-sales interaction in the domain of currency risk management, and to evaluate its
effectiveness.
Results. Design principles for deploying LLM-based chatbots in regulated financial
environments are formulated, including role constraints, functional limitations, controlled response
generation, and structured escalation mechanisms. These principles operationalize the practical
application of instruction-based alignment without modifying model parameters.
A compact, domain-specific evaluation framework for compliance-oriented conversational
systems is proposed, combining binary safety indicators with graded measures of informativeness and
prescriptiveness, supplemented by human evaluation procedures and inter-annotator agreement
analysis.
An empirical evaluation of instruction-based alignment under compliance constraints is
conducted in the context of pre-sales financial interaction. In contrast to prior approaches that focus
on alignment at the model level (e.g., RLHF or fine-tuning), this study isolates control at the usage
stage, implemented through prompts and policy constraints.
The results indicate that instruction-based alignment can effectively reduce compliance risks in
the use of LLM-based chatbots in regulated financial scenarios. In particular, the application of
clearly defined role constraints, prohibitions on recommendations, and structured response
generation mechanisms substantially reduces recommendation behavior without significant loss of
informativeness.
Conclusion. The study demonstrates that instruction-based alignment is a practical approach to
reducing compliance risks when deploying LLM-based chatbots in financial pre-sales scenarios. The
proposed evaluation framework and empirical results show that a significant reduction in
recommendation behavior can be achieved without substantial loss of explanatory capability.At the
same time, instruction-based control should be considered as one component of a broader risk
management system that includes monitoring, auditing, and human oversight. Future research should
focus on expanding the empirical base, improving evaluation methodologies, and integrating
instruction-based alignment with other approaches to ensure the safe use of artificial intelligence.

Article Details

How to Cite
PISKUN, O. (2025). INSTRUCTION-GUIDED ALIGNMENT OF CHATBOTS BASED ON LARGE LANGUAGE MODELS FOR COMPLIANCE-CONSTRAINED FINANCIAL SCENARIOS. Cherkasy University Bulletin: Applied Mathematics. Informatics, (1). https://doi.org/10.31651/2076-5886-2025-1-58-72
Section
Інформатика
Author Biography

Oleksandr PISKUN, Bohdan Khmelnytsky National University of Cherkasy

Candidate of Technical Sciences, Associate Professor, Head of Department of Applied Mathematics
and Informatics, Bohdan Khmelnytsky National University of Cherkasy

References

MSCI. (2016). Currency hedging: Adapting to volatility. MSCI Research.

Meketa Investment Group. (2022). Currency hedging. White paper.

Huang, W., Krohn, I., & Sushko, V. (2025). Global FX markets when hedging takes centre stage. BIS

Quarterly Review.

Isabella, G., de Almeida, M. I. S., Duran, F. M., & Gabler, C. (2025). From static to conversational: The role

of landing pages and chatbots in B2B lead generation. Journal of Business Research, 201, 115681.

Maga, S., & Bodlaj, M. (2025). Drivers and outcomes of chatbot use in the business-to-business context.

Journal of Business & Industrial Marketing, 40(1), 250–264.

Bank of England, & Financial Conduct Authority. (2024). Artificial intelligence in UK financial services.

Bank of England. https://www.bankofengland.co.uk

International Organization of Securities Commissions. (2021). The use of artificial intelligence and machine

learning by market intermediaries and asset managers: Final report. IOSCO. https://www.iosco.org

Aldasoro, I., Gambacorta, L., Korinek, A., Shreeti, V., & Stein, M. (2024). Intelligent financial system: How

AI is transforming finance (BIS Working Papers No. 1194). Bank for International Settlements.

European Central Bank. (2024). The rise of artificial intelligence: Benefits and risks for financial stability.

Financial Stability Review.

Kang, H., & Liu, X.-Y. (2023). Deficiency of large language models in finance: An empirical examination

of hallucination. arXiv. https://doi.org/10.48550/arXiv.2311.15548

European Securities and Markets Authority. (2023). Guidelines on certain aspects of the MiFID II suitability

requirements (ESMA35-43-3172). ESMA. https://www.esma.europa.eu

Wu, S., Irsoy, O., Lu, S., Dabravolski, V., Dredze, M., Gehrmann, S., … Mann, G. (2023). BloombergGPT:

A large language model for finance. arXiv. https://doi.org/10.48550/arXiv.2303.17564

Christiano, P. F., Leike, J., Brown, T. B., Martic, M., Legg, S., & Amodei, D. (2017). Deep reinforcement

learning from human preferences. arXiv. https://doi.org/10.48550/arXiv.1706.03741

Ouyang, L., Wu, J., Jiang, X., Almeida, D., Wainwright, C., Mishkin, P., … Lowe, R. (2022). Training

language models to follow instructions with human feedback. In Advances in Neural Information Processing

Systems, 35, 27730–27744.

Wei, J., Bosma, M., Zhao, V. Y., Guu, K., Yu, A. W., Lester, B., … Le, Q. V. (2021). Finetuned language

models are zero-shot learners. arXiv. https://doi.org/10.48550/arXiv.2109.01652

Sanh, V., Webson, A., Raffel, C., Bach, S. H., Sutawika, L., Alyafeai, Z., … Rush, A. M. (2021). Multitask

prompted training enables zero-shot task generalization. arXiv. https://doi.org/10.48550/arXiv.2110.08207

Chung, H. W., Hou, L., Longpre, S., Zoph, B., Tay, Y., Fedus, W., … Wei, J. (2024). Scaling instructionfinetuned language models. Journal of Machine Learning Research, 25(70), 1–53.

Bai, Y., Jones, A., Ndousse, K., Askell, A., Chen, A., DasSarma, N., … Amodei, D. (2022). Constitutional

AI: Harmlessness from AI feedback. arXiv. https://doi.org/10.48550/arXiv.2212.08073

Miehling, E., Desmond, M., Natesan Ramamurthy, K., Daly, E. M., Varshney, K. R., Farchi, E., Dognin, P.,

Rios, J., Bouneffouf, D., Liu, M., & Sattigeri, P. (2025). Evaluating the prompt steerability of large language

models. In Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association

for Computational Linguistics: Human Language Technologies (pp. 7874–7900). Association for

Computational Linguistics.

Yang, H., Liu, X.-Y., & Wang, C. D. (2023). FinGPT: Open-source financial large language models. arXiv.

https://doi.org/10.48550/arXiv.2306.06031.

Wallace, E., Xiao, K., Leike, R., Weng, L., Heidecke, J., & Beutel, A. (2024). The instruction hierarchy:

Training LLMs to prioritize privileged instructions. arXiv. https://doi.org/10.48550/arXiv.2404.13208/

Qin, Y., Liang, S., Ye, Y., Zhu, K., Yan, L., Lu, Y., Lin, Y., Cong, X., Tang, X., Qian, B., Zhao, S., Hong,

L., Tian, R., Xie, R., Zhou, J., Gerstein, M., Li, D., Liu, Z., & Sun, M. (2024). ToolLLM: Facilitating large

language models to master 16000+ real-world APIs. In Proceedings of the International Conference on

Learning Representations.

Wen, J., Zhong, R., Khan, A., Perez, E., Steinhardt, J., Huang, M., Bowman, S. R., He, H., & Feng, S.

(2024). Language models learn to mislead humans via RLHF. arXiv.

https://doi.org/10.48550/arXiv.2409.12822

Cao, B., Wang, S., Lin, X., Wu, X., Zhang, H., Ni, L. M., & Guo, J. (2025). From deep learning to LLMs: A

survey of AI in quantitative investment. arXiv. https://doi.org/10.48550/arXiv.2503.21422

Yu, Z., et al. (2020). AVA: A conversational assistant for financial services. In Proceedings of the AAAI

Conference on Artificial Intelligence.

Deng, Y., Liao, L., Lei, W., Yang, G. H., Lam, W., & Chua, T.-S. (2025). Proactive conversational AI: A

comprehensive survey of advancements and opportunities. ACM Transactions on Information Systems,

(3), 1–45. https://doi.org/10.1145/3715097

Takayanagi, T., Izumi, K., Sanz-Cruzado, J., McCreadie, R., & Ounis, I. (2025). Are generative AI agents

effective personalized financial advisors? In Proceedings of the 48th International ACM SIGIR Conference

on Research and Development in Information Retrieval (pp. 286–295).

https://doi.org/10.1145/3726302.3729897

Biyani, P., Xu, J., & Carenini, G. (2024). RUBICON: Rubric-based evaluation of domain-specific human–AI

conversations. Microsoft Research. https://www.microsoft.com/en-us/research/publication/rubicon/

Lizée, T., et al. (2024). Evaluating conversational AI systems in healthcare: A multi-stage validation

framework. npj Digital Medicine.

Lewis, P., Perez, E., Piktus, A., Petroni, F., Karpukhin, V., Goyal, N., … Riedel, S. (2020). Retrievalaugmented generation for knowledge-intensive NLP tasks. arXiv. https://doi.org/10.48550/arXiv.2005.11401

van der Lee, C., Gatt, A., van Miltenburg, E., Wubben, S., & Krahmer, E. (2019). Best practices for the

human evaluation of automatically generated text. In Proceedings of the 12th International Conference on

Natural Language Generation (pp. 355–368). Association for Computational Linguistics.

Liu, C.-W., Lowe, R., Serban, I. V., Noseworthy, M., Charlin, L., & Pineau, J. (2016). How NOT to

evaluate your dialogue system: An empirical study of unsupervised evaluation metrics for dialogue response

generation. In Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing

(pp. 2122–2132). Association for Computational Linguistics.

Efron, B., & Tibshirani, R. (1986). Bootstrap methods for standard errors, confidence intervals, and other

measures of statistical accuracy. Statistical Science, 1(1), 54–75. https://doi.org/10.1214/ss/1177013815

Braun, V., & Clarke, V. (2006). Using thematic analysis in psychology. Qualitative Research in Psychology,

(2), 77–101. https://doi.org/10.1191/1478088706qp063oa

van der Lee, C., Gatt, A., van Miltenburg, E., & Krahmer, E. (2021). Human evaluation of automatically

generated text: Current trends and best practice guidelines. Computer Speech & Language, 67, Article

https://doi.org/10.1016/j.csl.2020.101151

Khashabi, D., Stanovsky, G., Bragg, J., Lourie, N., Kasai, J., Choi, Y., Smith, N. A., & Weld, D. S. (2022).

GENIE: Toward reproducible and standardized human evaluation for text generation. In Proceedings of the

Conference on Empirical Methods in Natural Language Processing (pp. 11444–11458). Association

for Computational Linguistics. https://doi.org/10.18653/v1/2022.emnlp-main.787