Maktaba

Item request has been placed!

Item request cannot be made.

Processing Request

Conference

TaxReasoning: Benchmarking Knowledge-Intensive Mathematical Reasoning with Evolving Tax Laws

Authors : Hu, Nan; Wu, Yike; Li, Jiaye

Source: Hu, N, Wu, Y, Li, J, Hu, H, Qi, G, Zhai, S, Chen, Y, Wu, T, Wu, T, Chen, J & Pan, J Z 2026, TaxReasoning: Benchmarking Knowledge-Intensive Mathematical Reasoning with

Record details

Read More Add to Saved list

Conference

Dynamic Intelligence Assessment:Benchmarking LLMs on the Road to AGI with a Focus on Model Confidence

Authors : Tihanyi, Norbert; Bisztray, Tamas; Dubniczky, Richard A.

Subjects: Artificial Intelligence; Large Language Models; Dynamic Benchmarking

Source: Tihanyi, N, Bisztray, T, Dubniczky, R A, Toth, R, Borsos, B, Cherif, B, Ferrag, M A, Muzsai, L, Jain, R, Marinelli, R, Cordeiro, L, Debbah, M, Mavroeidis, V & Josang, A 2024, Dynamic Intelligence

Record details

Read More Add to Saved list

Academic Journal

Multi-centre benchmarking of deep learning models for COVID-19 detection in chest x-rays

Authors : Harkness, Rachael; Frangi, Alejandro F; Zucker, Kieran

Subjects: COVID-19; artificial intelligence; benchmarking

Source: Harkness, R, Frangi, A F, Zucker, K & Ravikumar, N 2024, 'Multi-centre benchmarking of deep learning models for COVID-19 detection in chest x-rays', Frontiers in

Record details

Read More Add to Saved list

Conference

Towards Robust NILM: A Unified Seq2Point Benchmarking Framework with Balanced Model

Authors : Mahajan, Yash; Zhang, Ruichang; Mustafa, Mustafa

Subjects: NILM; smart meter; Sequence2Point

Source: Mahajan, Y, Zhang, R & Mustafa, M 2025, Towards Robust NILM: A Unified Seq2Point Benchmarking Framework with Balanced Model. in IEEE PES ISGT (Innovative Smart Grid

Record details

Read More Add to Saved list

Conference

Can LLMs Evaluate Complex Attribution in QA? Automatic Benchmarking using Knowledge Graphs

Authors : Hu, Nan; Chen, Jiaoyan; Wu, Yike

Subjects: Large Language Model; Attributed Question Answering; Knowledge Graph

Source: Hu, N, Chen, J, Wu, Y, Qi, G, Wang, H, Bi, S, Chen, Y, Wu, T & Pan, J Z 2025, Can LLMs Evaluate Complex Attribution in QA? Automatic Benchmarking using Knowledge Graphs.

Record details

Read More Add to Saved list

Conference

CASTLE: Benchmarking Dataset for Static Code Analyzers and LLMs towards CWE Detection

Authors : Dubniczky, Richard A.; Horvát, Krisztofer Zoltán; Bisztray, Tamas

Subjects: Security; Static Code Analysis; Security Analysis

Source: Dubniczky, R A, Horvát, K Z, Bisztray, T, Ferrag, M A, Cordeiro, L & Tihanyi, N 2025, CASTLE: Benchmarking Dataset for Static Code Analyzers and LLMs towards CWE

Record details

Read More Add to Saved list

Academic Journal

Benchmarking small-variant genotyping in polyploids

Authors : Cooke, Daniel P.; Wedge, David C.; Lunter, Gerton

Subjects: Benchmarking; Genotype; High-Throughput Nucleotide Sequencing

Source: Cooke, D P, Wedge, D C & Lunter, G 2022, 'Benchmarking small-variant genotyping in polyploids', Genome research, vol. 32, no. 2, pp. 403-408.

Record details

Read More Add to Saved list

Academic Journal

Crowd-sourced benchmarking of single-sample tumor subclonal reconstruction

Source: PCAWG Evolution and Heterogeneity Working Group 2024, 'Crowd-sourced benchmarking of single-sample tumor subclonal reconstruction', Nature biotechnology.

Record details

Read More Add to Saved list

Conference

Tidal turbine benchmarking project: Stage I - steady flow experiments

Authors : Harvey, S W Tucker; Chen, X; Rowe, D T

Subjects: ResearchInstitutes_Networks_Beacons/03/04; name=Energy

Source: Harvey, S W T, Chen, X, Rowe, D T, Mcnaughton, J, Vogel, C R, Bhavsar, K, Allsop, T, Gilbert, J, Mullings, H, Stallard, T, Benson, I, Young, A & Willden, R H J 2023, Tidal turbine

Record details

Read More Add to Saved list

Academic Journal

Improving the benchmarking of ESG in real estate investment

Authors : Newell, Graeme; Nanda, Anupam; Moss, Alex

Source: Newell, G, Nanda, A & Moss, A 2023, 'Improving the benchmarking of ESG in real estate investment', Journal of Property Investment and Finance.

Record details

Read More Add to Saved list

Search Results

Your Filters

TaxReasoning: Benchmarking Knowledge-Intensive Mathematical Reasoning with Evolving Tax Laws

Dynamic Intelligence Assessment:Benchmarking LLMs on the Road to AGI with a Focus on Model Confidence

Multi-centre benchmarking of deep learning models for COVID-19 detection in chest x-rays

Towards Robust NILM: A Unified Seq2Point Benchmarking Framework with Balanced Model

Can LLMs Evaluate Complex Attribution in QA? Automatic Benchmarking using Knowledge Graphs

CASTLE: Benchmarking Dataset for Static Code Analyzers and LLMs towards CWE Detection

Benchmarking small-variant genotyping in polyploids

Crowd-sourced benchmarking of single-sample tumor subclonal reconstruction

Tidal turbine benchmarking project: Stage I - steady flow experiments

Improving the benchmarking of ESG in real estate investment

Contact

Follow us