This thesis presents a reward-oriented approach to autoregressive natural language generation (NLG), introducing a novel decoding strategy that reframes beam search as a reward optimization process. In this framework termed Reward Search, each candidate sequence (beam) acts as an autonomous decision unit, selecting its next token by maximizing a composite payoff function. This function integrates the language model's fluency, measured via log-probability, with external control signals such as toxicity reduction or sentiment promotion, computed through a fine-tuned auxiliary reward model. By embedding these constraints directly into the decoding process, Reward Search enables fine-grained control over generation while preserving fluency and coherence. Empirical evaluations across detoxification and sentiment-controlled tasks demonstrate that the proposed method effectively reduces harmful or off-target outputs without sacrificing generation quality, offering a principled alternative to conventional decoding algorithms for safer and more controllable NLG systems.
A reward-driven analysis of autoregressive natural language generation.
ALUKO, OLUWATOBILOBA TEMITOPE
2024/2025
Abstract
This thesis presents a reward-oriented approach to autoregressive natural language generation (NLG), introducing a novel decoding strategy that reframes beam search as a reward optimization process. In this framework termed Reward Search, each candidate sequence (beam) acts as an autonomous decision unit, selecting its next token by maximizing a composite payoff function. This function integrates the language model's fluency, measured via log-probability, with external control signals such as toxicity reduction or sentiment promotion, computed through a fine-tuned auxiliary reward model. By embedding these constraints directly into the decoding process, Reward Search enables fine-grained control over generation while preserving fluency and coherence. Empirical evaluations across detoxification and sentiment-controlled tasks demonstrate that the proposed method effectively reduces harmful or off-target outputs without sacrificing generation quality, offering a principled alternative to conventional decoding algorithms for safer and more controllable NLG systems.File | Dimensione | Formato | |
---|---|---|---|
Thesis Documentation.pdf
accesso aperto
Dimensione
912.24 kB
Formato
Adobe PDF
|
912.24 kB | Adobe PDF | Visualizza/Apri |
I documenti in UNITESI sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.
https://hdl.handle.net/20.500.14247/25802