Thanks for this. It is a crucial topic fundamental to wide-scale adoption in corporate settings. In the world of time series models, no measure is perfect, but we have many good ones that allow baselining and monitoring (that also include whether users added value by changing the outputs). Finding more rigorous approaches to LLMs will require some very thoughtful work.
Thanks for this. It is a crucial topic fundamental to wide-scale adoption in corporate settings. In the world of time series models, no measure is perfect, but we have many good ones that allow baselining and monitoring (that also include whether users added value by changing the outputs). Finding more rigorous approaches to LLMs will require some very thoughtful work.
Thanks for this. It is a crucial topic fundamental to wide-scale adoption in corporate settings. In the world of time series models, no measure is perfect, but we have many good ones that allow baselining and monitoring (that also include whether users added value by changing the outputs). Finding more rigorous approaches to LLMs will require some very thoughtful work.