Hello,
I wanted to ask a question in regards to the data generation and quality control mentioned in sections 2.4 and 2.5 of the paper.
In section 2.4 it is stated "We utilized clinical reports from PTB-XL and MIMIC-IV-ECG as initial seed data and leveraged an advanced LLM (i.e., Llama-3-70B-Instruct) for data synthesis."
In section 2.5 it then states "LLM judge and scoring: an independent LLM (Llama 3 (Meta, 2024))."
Would you be able to clarify what the exact LLM was for judging and scoring? I wanted to clarify whether the LLM used for synthesis and LLM used for evaluation were different in architecture.
Thank you!
Hello,
I wanted to ask a question in regards to the data generation and quality control mentioned in sections 2.4 and 2.5 of the paper.
In section 2.4 it is stated "We utilized clinical reports from PTB-XL and MIMIC-IV-ECG as initial seed data and leveraged an advanced LLM (i.e., Llama-3-70B-Instruct) for data synthesis."
In section 2.5 it then states "LLM judge and scoring: an independent LLM (Llama 3 (Meta, 2024))."
Would you be able to clarify what the exact LLM was for judging and scoring? I wanted to clarify whether the LLM used for synthesis and LLM used for evaluation were different in architecture.
Thank you!