Skip to content

Resources and Annotated Bibliography

Bennett, Kristin P edited this page Sep 6, 2024 · 2 revisions
Clone this wiki locally

CTBench Benchmark

  1. Neehal, N., Wang, B., Debopadhaya, S., Dan, S., Murugesan, K., Anand, V. and Bennett, K.P., 2024. CTBench: A Comprehensive Benchmark for Evaluating Language Model Capabilities in Clinical Trial Design. arXiv preprint arXiv:2406.17888. Description of CTBench Paper Link

  2. Github with CTBENCH benchmark. This is the Github given to people trying to use the benchmark. https://github.com/nafis-neehal/CTBench_LLM