corems.mass_spectrum.output.export API documentation

HighResMassSpecExport(out_file_path, mass_spectrum, output_type='excel') View Source

80    def __init__(self, out_file_path, mass_spectrum, output_type="excel"):
81        Thread.__init__(self)
82
83        self.output_file = Path(out_file_path)
84
85        # 'excel', 'csv' or 'pandas'
86        self.output_type = output_type
87
88        self.mass_spectrum = mass_spectrum
89
90        # collect all assigned atoms and order them accordingly to the Atoms.atoms_order list
91        self.atoms_order_list = self.get_all_used_atoms_in_order(self.mass_spectrum)
92
93        self._init_columns()

This constructor should always be called with keyword arguments. Arguments are:

group should be None; reserved for future extension when a ThreadGroup class is implemented.

target is the callable object to be invoked by the run() method. Defaults to None, meaning nothing is called.

name is the thread name. By default, a unique name is constructed of the form "Thread-N" where N is a small decimal number.

args is the argument tuple for the target invocation. Defaults to ().

kwargs is a dictionary of keyword arguments for the target invocation. Defaults to {}.

If a subclass overrides the constructor, it must make sure to invoke the base class constructor (Thread.__init__()) before doing anything else to the thread.

output_file

output_type

Returns the output type of the mass spectrum.

mass_spectrum

atoms_order_list

def save(self): View Source

139    def save(self):
140        """Save the mass spectrum data to the output file.
141
142        Raises
143        ------
144        ValueError
145            If the output type is not supported.
146        """
147
148        if self.output_type == "excel":
149            self.to_excel()
150        elif self.output_type == "csv":
151            self.to_csv()
152        elif self.output_type == "pandas":
153            self.to_pandas()
154        elif self.output_type == "hdf5":
155            self.to_hdf()
156        else:
157            raise ValueError(
158                "Unkown output type: %s; it can be 'excel', 'csv' or 'pandas'"
159                % self.output_type
160            )

Save the mass spectrum data to the output file.

Raises

ValueError: If the output type is not supported.

def run(self): View Source

162    def run(self):
163        """Run the export process.
164
165        This method is called when the thread starts.
166        It calls the save method to perform the export."""
167        self.save()

Run the export process.

This method is called when the thread starts. It calls the save method to perform the export.

def get_pandas_df(self, additional_columns=None): View Source

169    def get_pandas_df(self, additional_columns=None):
170        """Returns the mass spectrum data as a pandas DataFrame.
171
172        Parameters
173        ----------
174        additional_columns : list, optional
175            Additional columns to include in the DataFrame. Defaults to None.
176            Suitable additional columns are: 'Aromaticity Index', 'NOSC', 'Aromaticity Index (modified)'.
177
178        Returns
179        -------
180        DataFrame
181            The mass spectrum data as a pandas DataFrame.
182        """
183        if additional_columns is not None:
184            possible_additional_columns = [
185                "Aromaticity Index",
186                "NOSC",
187                "Aromaticity Index (modified)",
188            ]
189            if additional_columns:
190                for column in additional_columns:
191                    if column not in possible_additional_columns:
192                        raise ValueError("Invalid additional column: %s" % column)
193            columns = (
194                self.columns_label
195                + additional_columns
196                + self.get_all_used_atoms_in_order(self.mass_spectrum)
197            )
198        else:
199            columns = self.columns_label + self.get_all_used_atoms_in_order(
200                self.mass_spectrum
201            )
202        dict_data_list = self.get_list_dict_data(
203            self.mass_spectrum, additional_columns=additional_columns
204        )
205        df = DataFrame(dict_data_list, columns=columns)
206        df.name = self.output_file
207        return df

Returns the mass spectrum data as a pandas DataFrame.

Parameters

additional_columns (list, optional): Additional columns to include in the DataFrame. Defaults to None. Suitable additional columns are: 'Aromaticity Index', 'NOSC', 'Aromaticity Index (modified)'.

Returns

DataFrame: The mass spectrum data as a pandas DataFrame.

def write_settings(self, output_path, mass_spectrum): View Source

209    def write_settings(self, output_path, mass_spectrum):
210        """Writes the settings of the mass spectrum to a JSON file.
211
212        Parameters
213        ----------
214        output_path : str
215            The output file path.
216        mass_spectrum : MassSpectrum
217            The mass spectrum to export.
218        """
219
220        import json
221
222        dict_setting = parameter_to_dict.get_dict_data_ms(mass_spectrum)
223
224        dict_setting["MassSpecAttrs"] = self.get_mass_spec_attrs(mass_spectrum)
225        dict_setting["analyzer"] = mass_spectrum.analyzer
226        dict_setting["instrument_label"] = mass_spectrum.instrument_label
227        dict_setting["sample_name"] = mass_spectrum.sample_name
228
229        with open(
230            output_path.with_suffix(".json"),
231            "w",
232            encoding="utf8",
233        ) as outfile:
234            output = json.dumps(
235                dict_setting, sort_keys=True, indent=4, separators=(",", ": ")
236            )
237            outfile.write(output)

Writes the settings of the mass spectrum to a JSON file.

Parameters

output_path (str): The output file path.
mass_spectrum (MassSpectrum): The mass spectrum to export.

def to_pandas(self, write_metadata=True): View Source

239    def to_pandas(self, write_metadata=True):
240        """Exports the mass spectrum data to a pandas DataFrame and saves it as a pickle file.
241
242        Parameters
243        ----------
244        write_metadata : bool, optional
245            Whether to write the metadata to a JSON file. Defaults to True.
246        """
247
248        columns = self.columns_label + self.get_all_used_atoms_in_order(
249            self.mass_spectrum
250        )
251
252        dict_data_list = self.get_list_dict_data(self.mass_spectrum)
253
254        df = DataFrame(dict_data_list, columns=columns)
255
256        df.to_pickle(self.output_file.with_suffix(".pkl"))
257
258        if write_metadata:
259            self.write_settings(self.output_file, self.mass_spectrum)

Exports the mass spectrum data to a pandas DataFrame and saves it as a pickle file.

Parameters

write_metadata (bool, optional): Whether to write the metadata to a JSON file. Defaults to True.

def to_excel(self, write_metadata=True): View Source

261    def to_excel(self, write_metadata=True):
262        """Exports the mass spectrum data to an Excel file.
263
264        Parameters
265        ----------
266        write_metadata : bool, optional
267            Whether to write the metadata to a JSON file. Defaults to True.
268        """
269
270        columns = self.columns_label + self.get_all_used_atoms_in_order(
271            self.mass_spectrum
272        )
273
274        dict_data_list = self.get_list_dict_data(self.mass_spectrum)
275
276        df = DataFrame(dict_data_list, columns=columns)
277
278        df.to_excel(self.output_file.with_suffix(".xlsx"))
279
280        if write_metadata:
281            self.write_settings(self.output_file, self.mass_spectrum)

Exports the mass spectrum data to an Excel file.

Parameters

write_metadata (bool, optional): Whether to write the metadata to a JSON file. Defaults to True.

def to_csv(self, write_metadata=True): View Source

283    def to_csv(self, write_metadata=True):
284        """Exports the mass spectrum data to a CSV file.
285
286        Parameters
287        ----------
288        write_metadata : bool, optional
289            Whether to write the metadata to a JSON file. Defaults to True.
290        """
291
292        columns = self.columns_label + self.get_all_used_atoms_in_order(
293            self.mass_spectrum
294        )
295
296        dict_data_list = self.get_list_dict_data(self.mass_spectrum)
297
298        import csv
299
300        try:
301            with open(self.output_file.with_suffix(".csv"), "w", newline="") as csvfile:
302                writer = csv.DictWriter(csvfile, fieldnames=columns)
303                writer.writeheader()
304                for data in dict_data_list:
305                    writer.writerow(data)
306            if write_metadata:
307                self.write_settings(self.output_file, self.mass_spectrum)
308
309        except IOError as ioerror:
310            print(ioerror)

Exports the mass spectrum data to a CSV file.

Parameters

write_metadata (bool, optional): Whether to write the metadata to a JSON file. Defaults to True.

def to_json(self): View Source

312    def to_json(self):
313        """Exports the mass spectrum data to a JSON string."""
314
315        columns = self.columns_label + self.get_all_used_atoms_in_order(
316            self.mass_spectrum
317        )
318
319        dict_data_list = self.get_list_dict_data(self.mass_spectrum)
320
321        df = DataFrame(dict_data_list, columns=columns)
322
323        # for key, values in dict_data.items():
324        #    if not values: dict_data[key] = NaN
325
326        # output = json.dumps(dict_data, sort_keys=True, indent=4, separators=(',', ': '))
327        return df.to_json(orient="records")

Exports the mass spectrum data to a JSON string.

def add_mass_spectrum_to_hdf5( self, hdf_handle, mass_spectrum, group_key, mass_spectra_group=None, export_raw=True): View Source

329    def add_mass_spectrum_to_hdf5(
330        self,
331        hdf_handle,
332        mass_spectrum,
333        group_key,
334        mass_spectra_group=None,
335        export_raw=True,
336    ):
337        """Adds the mass spectrum data to an HDF5 file.
338
339        Parameters
340        ----------
341        hdf_handle : h5py.File
342            The HDF5 file handle.
343        mass_spectrum : MassSpectrum
344            The mass spectrum to add to the HDF5 file.
345        group_key : str
346            The group key (where to add the mass spectrum data within the HDF5 file).
347        mass_spectra_group : h5py.Group, optional
348            The mass spectra group. Defaults to None (no group, mass spectrum is added to the root).
349        export_raw : bool, optional
350            Whether to export the raw data. Defaults to True.
351            If False, only the processed data (peaks) is exported (essentially centroided data).
352        """
353        if mass_spectra_group is None:
354            # Check if the file has the necessary attributes and add them if not
355            # This assumes that if there is a mass_spectra_group, these attributes were already added to the file
356            if not hdf_handle.attrs.get("date_utc"):
357                timenow = str(
358                    datetime.now(timezone.utc).strftime("%d/%m/%Y %H:%M:%S %Z")
359                )
360                hdf_handle.attrs["date_utc"] = timenow
361                hdf_handle.attrs["file_name"] = mass_spectrum.filename.name
362                hdf_handle.attrs["data_structure"] = "mass_spectrum"
363                hdf_handle.attrs["analyzer"] = mass_spectrum.analyzer
364                hdf_handle.attrs["instrument_label"] = mass_spectrum.instrument_label
365                hdf_handle.attrs["sample_name"] = mass_spectrum.sample_name
366
367        list_results = self.list_dict_to_list(mass_spectrum, is_hdf5=True)
368
369        dict_ms_attrs = self.get_mass_spec_attrs(mass_spectrum)
370
371        setting_dicts = parameter_to_dict.get_dict_data_ms(mass_spectrum)
372
373        columns_labels = json.dumps(
374            self.columns_label + self.get_all_used_atoms_in_order(mass_spectrum),
375            sort_keys=False,
376            indent=4,
377            separators=(",", ": "),
378        )
379
380        group_key = group_key
381
382        if mass_spectra_group is not None:
383            hdf_handle = mass_spectra_group
384
385        if group_key not in hdf_handle.keys():
386            scan_group = hdf_handle.create_group(group_key)
387
388            # If there is raw data (from profile data) save it
389            if not mass_spectrum.is_centroid and export_raw:
390                mz_abun_array = empty(shape=(2, len(mass_spectrum.abundance_profile)))
391
392                mz_abun_array[0] = mass_spectrum.abundance_profile
393                mz_abun_array[1] = mass_spectrum.mz_exp_profile
394
395                raw_ms_dataset = scan_group.create_dataset(
396                    "raw_ms", data=mz_abun_array, dtype="f8"
397                )
398
399            else:
400                #  create empy dataset for missing raw data
401                raw_ms_dataset = scan_group.create_dataset("raw_ms", dtype="f8")
402
403            raw_ms_dataset.attrs["MassSpecAttrs"] = json.dumps(dict_ms_attrs)
404
405            if isinstance(mass_spectrum, MassSpecfromFreq):
406                raw_ms_dataset.attrs["TransientSetting"] = json.dumps(
407                    setting_dicts.get("TransientSetting"),
408                    sort_keys=False,
409                    indent=4,
410                    separators=(",", ": "),
411                )
412
413        else:
414            scan_group = hdf_handle.get(group_key)
415
416        # if there is not processed data len = 0, otherwise len() will return next index
417        index_processed_data = str(len(scan_group.keys()))
418
419        timenow = str(datetime.now(timezone.utc).strftime("%d/%m/%Y %H:%M:%S %Z"))
420
421        processed_dset = scan_group.create_dataset(
422            index_processed_data, data=list_results
423        )
424
425        processed_dset.attrs["date_utc"] = timenow
426
427        processed_dset.attrs["ColumnsLabels"] = columns_labels
428
429        processed_dset.attrs["MoleculaSearchSetting"] = json.dumps(
430            setting_dicts.get("MoleculaSearch"),
431            sort_keys=False,
432            indent=4,
433            separators=(",", ": "),
434        )
435
436        processed_dset.attrs["MassSpecPeakSetting"] = json.dumps(
437            setting_dicts.get("MassSpecPeak"),
438            sort_keys=False,
439            indent=4,
440            separators=(",", ": "),
441        )
442
443        processed_dset.attrs["MassSpectrumSetting"] = json.dumps(
444            setting_dicts.get("MassSpectrum"),
445            sort_keys=False,
446            indent=4,
447            separators=(",", ": "),
448        )

Adds the mass spectrum data to an HDF5 file.

Parameters

hdf_handle (h5py.File): The HDF5 file handle.
mass_spectrum (MassSpectrum): The mass spectrum to add to the HDF5 file.
group_key (str): The group key (where to add the mass spectrum data within the HDF5 file).
mass_spectra_group (h5py.Group, optional): The mass spectra group. Defaults to None (no group, mass spectrum is added to the root).
export_raw (bool, optional): Whether to export the raw data. Defaults to True. If False, only the processed data (peaks) is exported (essentially centroided data).

def to_hdf(self): View Source

450    def to_hdf(self):
451        """Exports the mass spectrum data to an HDF5 file."""
452
453        with h5py.File(self.output_file.with_suffix(".hdf5"), "a") as hdf_handle:
454            self.add_mass_spectrum_to_hdf5(
455                hdf_handle, self.mass_spectrum, str(self.mass_spectrum.scan_number)
456            )

Exports the mass spectrum data to an HDF5 file.

def parameters_to_toml(self): View Source

458    def parameters_to_toml(self):
459        """Converts the mass spectrum parameters to a TOML string.
460
461        Returns
462        -------
463        str
464            The TOML string of the mass spectrum parameters.
465        """
466
467        dict_setting = parameter_to_dict.get_dict_data_ms(self.mass_spectrum)
468
469        dict_setting["MassSpecAttrs"] = self.get_mass_spec_attrs(self.mass_spectrum)
470        dict_setting["analyzer"] = self.mass_spectrum.analyzer
471        dict_setting["instrument_label"] = self.mass_spectrum.instrument_label
472        dict_setting["sample_name"] = self.mass_spectrum.sample_name
473
474        output = toml.dumps(dict_setting)
475
476        return output

Converts the mass spectrum parameters to a TOML string.

Returns

str: The TOML string of the mass spectrum parameters.

def parameters_to_json(self): View Source

478    def parameters_to_json(self):
479        """Converts the mass spectrum parameters to a JSON string.
480
481        Returns
482        -------
483        str
484            The JSON string of the mass spectrum parameters.
485        """
486
487        dict_setting = parameter_to_dict.get_dict_data_ms(self.mass_spectrum)
488
489        dict_setting["MassSpecAttrs"] = self.get_mass_spec_attrs(self.mass_spectrum)
490        dict_setting["analyzer"] = self.mass_spectrum.analyzer
491        dict_setting["instrument_label"] = self.mass_spectrum.instrument_label
492        dict_setting["sample_name"] = self.mass_spectrum.sample_name
493
494        output = json.dumps(dict_setting)
495
496        return output

Converts the mass spectrum parameters to a JSON string.

Returns

str: The JSON string of the mass spectrum parameters.

def get_mass_spec_attrs(self, mass_spectrum): View Source

498    def get_mass_spec_attrs(self, mass_spectrum):
499        """Returns the mass spectrum attributes as a dictionary.
500
501        Parameters
502        ----------
503        mass_spectrum : MassSpectrum
504            The mass spectrum to export.
505
506        Returns
507        -------
508        dict
509            The mass spectrum attributes.
510        """
511
512        dict_ms_attrs = {}
513        dict_ms_attrs["polarity"] = mass_spectrum.polarity
514        dict_ms_attrs["rt"] = mass_spectrum.retention_time
515        dict_ms_attrs["tic"] = mass_spectrum.tic
516        dict_ms_attrs["mobility_scan"] = mass_spectrum.mobility_scan
517        dict_ms_attrs["mobility_rt"] = mass_spectrum.mobility_rt
518        dict_ms_attrs["Aterm"] = mass_spectrum.Aterm
519        dict_ms_attrs["Bterm"] = mass_spectrum.Bterm
520        dict_ms_attrs["Cterm"] = mass_spectrum.Cterm
521        dict_ms_attrs["baseline_noise"] = mass_spectrum.baseline_noise
522        dict_ms_attrs["baseline_noise_std"] = mass_spectrum.baseline_noise_std
523
524        return dict_ms_attrs

Returns the mass spectrum attributes as a dictionary.

Parameters

mass_spectrum (MassSpectrum): The mass spectrum to export.

Returns

dict: The mass spectrum attributes.

def get_all_used_atoms_in_order(self, mass_spectrum): View Source

526    def get_all_used_atoms_in_order(self, mass_spectrum):
527        """Returns the list of assigned atoms in the order specified by Atoms.atoms_order list.
528
529        Parameters
530        ----------
531        mass_spectrum : MassSpectrum
532            The mass spectrum to export.
533
534        Returns
535        -------
536        list
537            The list of assigned atoms in the order specified by Atoms.atoms_order list.
538        """
539
540        atoms_in_order = Atoms.atoms_order
541        all_used_atoms = set()
542        if mass_spectrum:
543            for ms_peak in mass_spectrum:
544                if ms_peak:
545                    for m_formula in ms_peak:
546                        for atom in m_formula.atoms:
547                            all_used_atoms.add(atom)
548
549        def sort_method(atom):
550            return [atoms_in_order.index(atom)]
551
552        return sorted(all_used_atoms, key=sort_method)

Returns the list of assigned atoms in the order specified by Atoms.atoms_order list.

Parameters

mass_spectrum (MassSpectrum): The mass spectrum to export.

Returns

list: The list of assigned atoms in the order specified by Atoms.atoms_order list.

def list_dict_to_list(self, mass_spectrum, is_hdf5=False): View Source

554    def list_dict_to_list(self, mass_spectrum, is_hdf5=False):
555        """Returns the mass spectrum data as a list of dictionaries.
556
557        Parameters
558        ----------
559        mass_spectrum : MassSpectrum
560            The mass spectrum to export.
561        is_hdf5 : bool, optional
562            Whether the mass spectrum is being exported to an HDF5 file. Defaults to False.
563
564        Returns
565        -------
566        list
567            The mass spectrum data as a list of dictionaries.
568        """
569
570        column_labels = self.columns_label + self.get_all_used_atoms_in_order(
571            mass_spectrum
572        )
573
574        dict_list = self.get_list_dict_data(mass_spectrum, is_hdf5=is_hdf5)
575
576        all_lines = []
577        for dict_res in dict_list:
578            result_line = [NaN] * len(column_labels)
579
580            for label, value in dict_res.items():
581                label_index = column_labels.index(label)
582                result_line[label_index] = value
583
584            all_lines.append(result_line)
585
586        return all_lines

Returns the mass spectrum data as a list of dictionaries.

Parameters

mass_spectrum (MassSpectrum): The mass spectrum to export.
is_hdf5 (bool, optional): Whether the mass spectrum is being exported to an HDF5 file. Defaults to False.

Returns

list: The mass spectrum data as a list of dictionaries.

def get_list_dict_data( self, mass_spectrum, include_no_match=True, include_isotopologues=True, isotopologue_inline=True, no_match_inline=False, is_hdf5=False, additional_columns=None): View Source

588    def get_list_dict_data(
589        self,
590        mass_spectrum,
591        include_no_match=True,
592        include_isotopologues=True,
593        isotopologue_inline=True,
594        no_match_inline=False,
595        is_hdf5=False,
596        additional_columns=None,
597    ):
598        """Returns the mass spectrum data as a list of dictionaries.
599
600        Parameters
601        ----------
602        mass_spectrum : MassSpectrum
603            The mass spectrum to export.
604        include_no_match : bool, optional
605            Whether to include unassigned (no match) data. Defaults to True.
606        include_isotopologues : bool, optional
607            Whether to include isotopologues. Defaults to True.
608        isotopologue_inline : bool, optional
609            Whether to include isotopologues inline. Defaults to True.
610        no_match_inline : bool, optional
611            Whether to include unassigned (no match) data inline. Defaults to False.
612        is_hdf5 : bool, optional
613            Whether the mass spectrum is being exported to an HDF5 file. Defaults to False.
614
615        Returns
616        -------
617        list
618            The mass spectrum data as a list of dictionaries.
619        """
620
621        dict_data_list = []
622
623        if is_hdf5:
624            encode = ".encode('utf-8')"
625        else:
626            encode = ""
627
628        def add_no_match_dict_data(index, ms_peak):
629            """
630            Export dictionary of mspeak info for unassigned (no match) data
631            """
632            dict_result = {
633                "Index": index,
634                "m/z": ms_peak._mz_exp,
635                "Calibrated m/z": ms_peak.mz_exp,
636                "Peak Height": ms_peak.abundance,
637                "Peak Area": ms_peak.area,
638                "Resolving Power": ms_peak.resolving_power,
639                "S/N": ms_peak.signal_to_noise,
640                "Ion Charge": ms_peak.ion_charge,
641                "Heteroatom Class": eval("Labels.unassigned{}".format(encode)),
642            }
643
644            dict_data_list.append(dict_result)
645
646        def add_match_dict_data(index, ms_peak, mformula, additional_columns=None):
647            """
648            Export dictionary of mspeak info for assigned (match) data
649            """
650            formula_dict = mformula.to_dict()
651
652            dict_result = {
653                "Index": index,
654                "m/z": ms_peak._mz_exp,
655                "Calibrated m/z": ms_peak.mz_exp,
656                "Calculated m/z": mformula.mz_calc,
657                "Peak Height": ms_peak.abundance,
658                "Peak Area": ms_peak.area,
659                "Resolving Power": ms_peak.resolving_power,
660                "S/N": ms_peak.signal_to_noise,
661                "Ion Charge": ms_peak.ion_charge,
662                "m/z Error (ppm)": mformula.mz_error,
663                "Confidence Score": mformula.confidence_score,
664                "Isotopologue Similarity": mformula.isotopologue_similarity,
665                "m/z Error Score": mformula.average_mz_error_score,
666                "DBE": mformula.dbe,
667                "Heteroatom Class": eval("mformula.class_label{}".format(encode)),
668                "H/C": mformula.H_C,
669                "O/C": mformula.O_C,
670                "Ion Type": eval("mformula.ion_type.lower(){}".format(encode)),
671                "Is Isotopologue": int(mformula.is_isotopologue),
672                "Molecular Formula": eval("mformula.string{}".format(encode)),
673            }
674            if additional_columns is not None:
675                possible_dict = {
676                    "Aromaticity Index": mformula.A_I,
677                    "NOSC": mformula.nosc,
678                    "Aromaticity Index (modified)": mformula.A_I_mod,
679                }
680                for column in additional_columns:
681                    dict_result[column] = possible_dict.get(column)
682
683            if mformula.adduct_atom:
684                dict_result["Adduct"] = eval("mformula.adduct_atom{}".format(encode))
685
686            if mformula.is_isotopologue:
687                dict_result["Mono Isotopic Index"] = mformula.mspeak_index_mono_isotopic
688
689            if self.atoms_order_list is None:
690                atoms_order_list = self.get_all_used_atoms_in_order(mass_spectrum)
691            else:
692                atoms_order_list = self.atoms_order_list
693
694            for atom in atoms_order_list:
695                if atom in formula_dict.keys():
696                    dict_result[atom] = formula_dict.get(atom)
697
698            dict_data_list.append(dict_result)
699
700        score_methods = mass_spectrum.molecular_search_settings.score_methods
701        selected_score_method = (
702            mass_spectrum.molecular_search_settings.output_score_method
703        )
704
705        if selected_score_method in score_methods:
706            # temp set score method as the one chosen in the output
707            current_method = mass_spectrum.molecular_search_settings.score_method
708            mass_spectrum.molecular_search_settings.score_method = selected_score_method
709
710            for index, ms_peak in enumerate(mass_spectrum):
711                # print(ms_peak.mz_exp)
712
713                if ms_peak:
714                    m_formula = ms_peak.best_molecular_formula_candidate
715
716                    if m_formula:
717                        if not m_formula.is_isotopologue:
718                            add_match_dict_data(
719                                index,
720                                ms_peak,
721                                m_formula,
722                                additional_columns=additional_columns,
723                            )
724
725                            for (
726                                iso_mspeak_index,
727                                iso_mf_formula,
728                            ) in m_formula.mspeak_mf_isotopologues_indexes:
729                                iso_ms_peak = mass_spectrum[iso_mspeak_index]
730                                add_match_dict_data(
731                                    iso_mspeak_index,
732                                    iso_ms_peak,
733                                    iso_mf_formula,
734                                    additional_columns=additional_columns,
735                                )
736                else:
737                    if include_no_match and no_match_inline:
738                        add_no_match_dict_data(index, ms_peak)
739
740            if include_no_match and not no_match_inline:
741                for index, ms_peak in enumerate(mass_spectrum):
742                    if not ms_peak:
743                        add_no_match_dict_data(index, ms_peak)
744            # reset score method as the one chosen in the output
745            mass_spectrum.molecular_search_settings.score_method = current_method
746
747        else:
748            for index, ms_peak in enumerate(mass_spectrum):
749                # check if there is a molecular formula candidate for the msPeak
750
751                if ms_peak:
752                    # m_formula = ms_peak.molecular_formula_lowest_error
753                    for m_formula in ms_peak:
754                        if mass_spectrum.molecular_search_settings.output_min_score > 0:
755                            if (
756                                m_formula.confidence_score
757                                >= mass_spectrum.molecular_search_settings.output_min_score
758                            ):
759                                if m_formula.is_isotopologue:  # isotopologues inline
760                                    if include_isotopologues and isotopologue_inline:
761                                        add_match_dict_data(
762                                            index,
763                                            ms_peak,
764                                            m_formula,
765                                            additional_columns=additional_columns,
766                                        )
767                                else:
768                                    add_match_dict_data(
769                                        index,
770                                        ms_peak,
771                                        m_formula,
772                                        additional_columns=additional_columns,
773                                    )  # add monoisotopic peak
774
775                            # cutoff because of low score
776                            else:
777                                add_no_match_dict_data(index, ms_peak)
778
779                        else:
780                            if m_formula.is_isotopologue:  # isotopologues inline
781                                if include_isotopologues and isotopologue_inline:
782                                    add_match_dict_data(
783                                        index,
784                                        ms_peak,
785                                        m_formula,
786                                        additional_columns=additional_columns,
787                                    )
788                            else:
789                                add_match_dict_data(
790                                    index,
791                                    ms_peak,
792                                    m_formula,
793                                    additional_columns=additional_columns,
794                                )  # add monoisotopic peak
795                else:
796                    # include not_match
797                    if include_no_match and no_match_inline:
798                        add_no_match_dict_data(index, ms_peak)
799
800            if include_isotopologues and not isotopologue_inline:
801                for index, ms_peak in enumerate(mass_spectrum):
802                    for m_formula in ms_peak:
803                        if m_formula.is_isotopologue:
804                            if (
805                                m_formula.confidence_score
806                                >= mass_spectrum.molecular_search_settings.output_min_score
807                            ):
808                                add_match_dict_data(
809                                    index,
810                                    ms_peak,
811                                    m_formula,
812                                    additional_columns=additional_columns,
813                                )
814
815            if include_no_match and not no_match_inline:
816                for index, ms_peak in enumerate(mass_spectrum):
817                    if not ms_peak:
818                        add_no_match_dict_data(index, ms_peak)
819
820        # remove duplicated add_match data possibly introduced on the output_score_filter step
821        res = []
822        [res.append(x) for x in dict_data_list if x not in res]
823
824        return res

Returns the mass spectrum data as a list of dictionaries.

Parameters

mass_spectrum (MassSpectrum): The mass spectrum to export.
include_no_match (bool, optional): Whether to include unassigned (no match) data. Defaults to True.
include_isotopologues (bool, optional): Whether to include isotopologues. Defaults to True.
isotopologue_inline (bool, optional): Whether to include isotopologues inline. Defaults to True.
no_match_inline (bool, optional): Whether to include unassigned (no match) data inline. Defaults to False.
is_hdf5 (bool, optional): Whether the mass spectrum is being exported to an HDF5 file. Defaults to False.

Returns

list: The mass spectrum data as a list of dictionaries.

corems.mass_spectrum.output.export

Parameters

Attributes

Methods

Raises

Parameters

Returns

Parameters

Parameters

Parameters

Parameters

Parameters

Returns

Returns

Parameters

Returns

Parameters

Returns

Parameters

Returns

Parameters

Returns

Inherited Members