UL Solutions will not evaluate the foundation models, but will instead look at the value-add that an enterprise layers on top. But that forces even more questions about the reliability of such third-party evaluations.
Credit: Shutterstock/Sansoen Saengsakaorat
UL Solutions, part of the UL enterprise that grew out of Underwriters Laboratories, on Monday jumped into the crowded genAI third-party evaluation service market, joining Stanford University and Microsoft, among many others, but with a more customized approach. The UL team will be asking questions as well as analyzing code.
Some analysts and others in the AI space have questioned how reliable and precise such an effort would be. Would the workers handling the value-add genAI code know the answers to those questions? Even more cynically they ask whether the workers — or contractors — would answer all questions honestly, or would they be more likely to tell the UL team what they think they want to hear, …