Lessons on off-policy methods from a notification component of a chatbot (Q2071403)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: Lessons on off-policy methods from a notification component of a chatbot |
scientific article; zbMATH DE number 7465681
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | Lessons on off-policy methods from a notification component of a chatbot |
scientific article; zbMATH DE number 7465681 |
Statements
Lessons on off-policy methods from a notification component of a chatbot (English)
0 references
28 January 2022
0 references
contextual bandits
0 references
off-policy training
0 references
off-policy evaluation
0 references
limited data
0 references
small data
0 references