Resources and Annotated Bibliography

CTBench Benchmark

Neehal, N., Wang, B., Debopadhaya, S., Dan, S., Murugesan, K., Anand, V. and Bennett, K.P., 2024. CTBench: A Comprehensive Benchmark for Evaluating Language Model Capabilities in Clinical Trial Design. arXiv preprint arXiv:2406.17888. Description of CTBench Paper Link
Github with CTBENCH benchmark. This is the Github given to people trying to use the benchmark. https://github.com/nafis-neehal/CTBench_LLM