Our new paper Retrieval-Guided Reinforcement Learning for Boolean Circuit Minimization has been accepted to appear at ICLR 2024. In this paper we proposed an OOD detection based strategy to inform the reward function for policies in RL, with specific applications to secure automated hardware design.
A huge shout-out to Animesh and the other co-authors.