Item request has been placed! ×
Item request cannot be made. ×
loading  Processing Request
Item request has been placed! ×
Item request cannot be made. ×
loading  Processing Request
Conference

TaxReasoning: Benchmarking Knowledge-Intensive Mathematical Reasoning with Evolving Tax Laws

  • Source: Hu, N, Wu, Y, Li, J, Hu, H, Qi, G, Zhai, S, Chen, Y, Wu, T, Wu, T, Chen, J & Pan, J Z 2026, TaxReasoning: Benchmarking Knowledge-Intensive Mathematical Reasoning with

Record details

×
Conference

Dynamic Intelligence Assessment:Benchmarking LLMs on the Road to AGI with a Focus on Model Confidence

Subjects: Artificial Intelligence; Large Language Models; Dynamic Benchmarking

  • Source: Tihanyi, N, Bisztray, T, Dubniczky, R A, Toth, R, Borsos, B, Cherif, B, Ferrag, M A, Muzsai, L, Jain, R, Marinelli, R, Cordeiro, L, Debbah, M, Mavroeidis, V & Josang, A 2024, Dynamic Intelligence

Record details

×
Academic Journal

Multi-centre benchmarking of deep learning models for COVID-19 detection in chest x-rays

Subjects: COVID-19; artificial intelligence; benchmarking

  • Source: Harkness, R, Frangi, A F, Zucker, K & Ravikumar, N 2024, 'Multi-centre benchmarking of deep learning models for COVID-19 detection in chest x-rays', Frontiers in

Record details

×
Conference

Towards Robust NILM: A Unified Seq2Point Benchmarking Framework with Balanced Model

Subjects: NILM; smart meter; Sequence2Point

  • Source: Mahajan, Y, Zhang, R & Mustafa, M 2025, Towards Robust NILM: A Unified Seq2Point Benchmarking Framework with Balanced Model. in IEEE PES ISGT (Innovative Smart Grid

Record details

×
Conference

Can LLMs Evaluate Complex Attribution in QA? Automatic Benchmarking using Knowledge Graphs

Subjects: Large Language Model; Attributed Question Answering; Knowledge Graph

  • Source: Hu, N, Chen, J, Wu, Y, Qi, G, Wang, H, Bi, S, Chen, Y, Wu, T & Pan, J Z 2025, Can LLMs Evaluate Complex Attribution in QA? Automatic Benchmarking using Knowledge Graphs.

Record details

×
Conference

CASTLE: Benchmarking Dataset for Static Code Analyzers and LLMs towards CWE Detection

Subjects: Security; Static Code Analysis; Security Analysis

  • Source: Dubniczky, R A, Horvát, K Z, Bisztray, T, Ferrag, M A, Cordeiro, L & Tihanyi, N 2025, CASTLE: Benchmarking Dataset for Static Code Analyzers and LLMs towards CWE

Record details

×
Academic Journal

Benchmarking small-variant genotyping in polyploids

Subjects: Benchmarking; Genotype; High-Throughput Nucleotide Sequencing

  • Source: Cooke, D P, Wedge, D C & Lunter, G 2022, 'Benchmarking small-variant genotyping in polyploids', Genome research, vol. 32, no. 2, pp. 403-408.

Record details

×
Academic Journal

Crowd-sourced benchmarking of single-sample tumor subclonal reconstruction

  • Source: PCAWG Evolution and Heterogeneity Working Group 2024, 'Crowd-sourced benchmarking of single-sample tumor subclonal reconstruction', Nature biotechnology.

Record details

×
Conference

Tidal turbine benchmarking project: Stage I - steady flow experiments

Subjects: ResearchInstitutes_Networks_Beacons/03/04; name=Energy

  • Source: Harvey, S W T, Chen, X, Rowe, D T, Mcnaughton, J, Vogel, C R, Bhavsar, K, Allsop, T, Gilbert, J, Mullings, H, Stallard, T, Benson, I, Young, A & Willden, R H J 2023, Tidal turbine

Record details

×
Academic Journal

Improving the benchmarking of ESG in real estate investment

  • Source: Newell, G, Nanda, A & Moss, A 2023, 'Improving the benchmarking of ESG in real estate investment', Journal of Property Investment and Finance.

Record details

×
  • 1-10 of  119 results for ""Benchmarking""