This readme file was generated on 2025-01-30] by Dr Joscelyn Harris GENERAL INFORMATION Title of Dataset: Rapid Evaporative Ionisation Mass Spectrometry can reliably age field caught malaria vectors (and has the potential to simultaneously also identify species and infection rate) Author/Principal Investigator Information Name: Hilary Ranson ORCID:0000-0003-2332-8247 Institution:Liverpool School of Tropical Medicine Address: Pembroke Place, Liverpool L3 5QA, UK Email: hilary.ranson@lstmed.ac.uk Author/Alternative Investigator Information Name: Joscelyn Harris ORCID:0000-0002-1942-6306 Institution: University of Liverpool Address: Crown Street, Liverpool. L69 7ZB, UK Email: jsarsby@liverpool.ac.uk Date of data collection: Approximate dates 2022-08-01 to 2023-01- 30. Geographic location of data collection: Lab Samples were collected at Liverpool School of Tropical Medicine, Liverpool, UK Remaining samples were collected from water sources in the town of Tiefora, Sitena, Tengreal in Burkina Faso. Infected samples were collected at Department of Pathology, University of Cambridge, UK Analysis was conducted at the Centre for Proteome Pesearch, Liverpool, UK Information about funding sources that supported the collection of the data: This work was funded by a grant from IVCC and MRC (MC_PC_19045) A.M.B thanks the MRC [MR/N00227×/1 and MR/W025701/1], Sir Isaac Newton Trust, Alborada Fund, Wellcome Trust ISSF and University of Cambridge JRG Scheme, GHIT, Rosetrees Trust (G109130) and the Royal Society (RGS/R1/201,293) (IEC/R3/19,302 ). SHARING/ACCESS INFORMATION Licenses/restrictions placed on the data: None Links to publications that cite or use the data: Paper in preparation - will update once accepted. Links to other publicly accessible locations of the data: N/A Links/relationships to ancillary data sets: N/A Was data derived from another source? No If yes, list source(s): Recommended citation for this dataset: Paper in preparation - will update once accepted. DATA & FILE OVERVIEW File List: List of Folders and subfolders. Within each subfolder there is a list of .raw files containing the REIMS analysis of one mosquito per file. .raw files are aquired by Rapid Evaporative Ionisation Mass Spectrometry on a Waters Synapt G2-si instrument. The R code folder contains the inhouse code used to do the data analysis. Laboratory reared with and without blood meals 2 days old.zip Not Fed 4-5 days old.zip Blood Fed Not Fed 8-9 days old.zip Blood Fed Not Fed 9-10 days old.zip Blood Fed Not Fed 11-12 days old.zip Blood Fed Not Fed 15-16 days old.zip Blood Fed Not Fed 16-17 days old.zip Blood Fed Not Fed 18-19 days old .zip Blood Fed Not Fed 20-21 days old.zip Blood Fed Not Fed Additional 2 day old.zip Blood Fed Not Fed Blind samples.zip Blood Fed Not Fed Larval collection Siniena_in classes Day 1.zip Day 3.zip Day 5.zip Day 9.zip Day 13.zip Day 15.zip Day 20.zip Larval collection Tengrela_in classes Day 1.zip Day 3.zip Day 5.zip Day 9.zip Day 13.zip Day 15.zip Day 20.zip Larval collection Tiefora_in classes Day 1.zip Day 3.zip Day 5.zip Day 9.zip Day 13.zip Day 15.zip Day 20.zip Ovarian Age -> data in calssifications Blind Samples.zip Laid Once Laid Twice Not Laid Laid eggs once.zip Blood Fed Blood Fed Twice Laid eggs twice.zip Nulliparous.zip Not Fed Blood Fed Once Adult mosquito collection from houses -> Adult collection_in classes Blind samples.zip Day 0.zip Day 4.zip Day 8.zip Infected vs non-infected Infected.zip Non-infected .zip Mosquito abdomens_in classes Day 1.zip Day 3.zip Day 5.zip Day 9.zip Day 13.zip Day 15.zip Day 20+.zip Outside cages_in classes Day 1.zip Day 3.zip Day 7.zip Day 9.zip Day 14.zip Day 16.zip Semi-field station_in classes Day 1.zip Day 3.zip Day 7.zip Day 9.zip Day 14.zip Day 16.zip Rcode Packages.R R code Notes.docx Rcode Random Forest.R Rcode_PCA + LDA plots.R Relationship between files, if important: Each group of files is a discrete experiment. Additional related data collected that was not included in the current data package: None Are there multiple versions of the dataset? No If yes, name of file(s) that was updated: N/A Why was the file updated? When was the file updated? METHODOLOGICAL INFORMATION Description of methods used for collection/generation of data: Detailed methodology included in Rapid Evaporative Ionisation Mass Spectrometry can reliably age field caught malaria vectors (and has the potential to simultaneously also identify species and infection rate) Breifly, Mosquitos grown in labs or Larve collected from the wild and then hatched into cages are killed by freezeing then analysed by REIMS. A link to published paper will be included once accpeted. Methods for processing the data: Raw data is processed by REIMS softwear called Offline Model builder, this alignes, normalises and bins the data to create a matrix per experiement. This matrix is interigated using the r-code provided. Instrument- or software-specific information needed to interpret the data: Instrument: Synapt G2-Si - Waters, Softwear:Offline Model Builder - Waters, Softwear: R Sudio, R, RStudio Team (2020). RStudio: Integrated Development for R. RStudio, PBC, Boston, MA URL http://www.rstudio.com/. Standards and calibration information, if appropriate: At the start of each day the instrument is calibrated using sodium formate. LueEnk was infused into the instrument for the duration of the experiment to help align spectra during data analysis Environmental/experimental conditions: Full detials are included in the publication Rapid Evaporative Ionisation Mass Spectrometry can reliably age field caught malaria vectors (and has the potential to simultaneously also identify species and infection rate) Breifly, Investgations into analysing mosquitos, The effects of blood feeding, age, breading cycles and rearing in a semi-natural habitat and the infection state. A link to published paper will be included once accpeted. Describe any quality-assurance procedures performed on the data: Models created by the data was tested by using blind samples. People involved with sample collection, processing, analysis and/or submission: