K Pro

How do we measure/prevent hallucinations?

We have daily monitoring in place (e.g., using a metric like Tool Call Accuracy (TCA)) to track the percentage of times the correct tool is identified and called. This helps us quickly identify systemic issues where an agent might be "hallucinating" the need for a specific tool when another is more suitable, or failing to identify any tool when one is needed. Parameter Monitoring: Once a tool is called, it's important that the correct parameters are passed to it. Incorrect parameters can lead to wrong actions, all of which manifest as hallucinations in the agent's final response. We monitor the accuracy and completeness of parameters passed to tools via some automated tests on a set of evaluation questions.

Browse all FAQs

How is K Pro different from other LLMs?

Is the idea of Biological Artificial Superintelligence ethically responsible?

Can I use K Pro with my own data / can I upload my own data?

How do I start with K Pro?

How would different LLMs or versions of the same LLM affect final outcomes?

How can you be sure that K Pro will always be used for good? Is there potential for it to be used for nefarious purposes?

How do you make sure that the scientific conclusions generated by K Pro are evidence-based and accountable?

Can I access new datasets through K Pro?

How does Owkin prevent bias in K Pro’s recommendations?

Do you own your full end to end tech stack and if not, how do you vet your vendors to ensure they have the same commitment protecting data as Owkin does?

What can K Pro do?

Are external experts or the academic community involved in shaping the development of K Pro?

Can anyone see the data I upload to K Pro?

What is the list of public and private datasets available in K Pro?

What is a Katalyst session?

How does K Pro integrate into existing IT systems?

How can I report an issue with the tool?

How do we standardize/harmonize diverse datasets and turn them into a productized agentic system (not just prompts)?

How do we measure/prevent hallucinations?

Isn’t there a risk of misuse if the AI can act “agentically”? How does Owkin ensure responsible use of agentic capabilities in AI?

How does K Pro protect confidential data?

How can I analyze data from specific patient populations?

What do the MOSAIC and MOSAIC Window datasets include?

Doesn’t giving an AI this much scientific scope risk displacing whole research teams?

Can I access the codebase and decision making process of K Pro?

How do I know I can trust the results that K Pro produces?

How mature is Agentic AI for pharmaceutical use?