umami.input_vars_tools package#

Submodules#

umami.input_vars_tools.plotting_functions module#

Plots the given input variables of the given files and also a comparison.

umami.input_vars_tools.plotting_functions.check_kwargs_for_ylabel_and_n_ratio_panel(kwargs: dict, fallback_ylabel: str, n_datasets: int) dict#

Helper function to check the following keyword arguments + using fallback values if they are not set - ylabel - n_ratio_panels - norm (set to “True” if not provided)

Parameters:
  • kwargs (dict) – Keyword arguments handed to the plotting function

  • fallback_ylabel (str) – Fallback value for the ylabel

  • n_datasets (int) – Number of datasets that are plotted

Returns:

Updated keyword arguments

Return type:

kwargs

umami.input_vars_tools.plotting_functions.get_datasets_configuration(plotting_config: dict, tracks: bool = False)#

Helper function to transform dict that stores the configuration of the different datasets into lists of certain parameters.

Parameters:
  • plotting_config (dict) – Plotting configuration

  • tracks (bool, optional) – Bool if the function should look for the tracks_name variable in the dataset configurations.

Returns:

  • filepath_list (list) – List with the filepaths of all the datasets.

  • labels_list (list) – List with the ‘dataset label’ of each dataset.

  • class_labels_list (list) – List with the class labels for each dataset. If no dataset-specific class labels are provided, the globally defined class labels are used.

  • tracks_name_list (list) – List with the track names of the datasets. Only returned if tracks is True.

umami.input_vars_tools.plotting_functions.plot_input_vars_jets(datasets_filepaths: list, datasets_labels: list, datasets_class_labels: list, var_dict: dict, cut_vars_dict: dict, n_jets: int, xlabels_dict: dict | None = None, special_param_jets: dict | None = None, output_directory: str = 'input_vars_jets', plot_type: str = 'pdf', **kwargs)#

Plot the jet variable comparison for the given datasets. If multiple datasets are provided, a ratio plot comparing each flavour individually will be added below the main plot.

Parameters:
  • datasets_filepaths (list) – List of filepaths to the files.

  • datasets_labels (list) – Label of the dataset for the legend.

  • datasets_class_labels (list) – List with dataset-specific class labels, e.g. [[“ujets”, “cjets”], [“cjets”]] to plot light-jets and c-jets for the first but only c-jets for the second dataset

  • var_dict (dict) – Dict with all the variables you want to plot inside with their binning.

  • cut_vars_dict (dict) – Dict with cuts on variables applied to the input datasets.

  • n_jets (int) – Number of jets to use for plotting.

  • xlabels_dict (dict, optional) – Dict that stores the xlabels of the variables that are plotted. I.e. to specify a label for “pt_btagJes”, use {“pt_btagJes”: “$p_T$ [MeV]”}. Variables that do not appear in the dict will have the variable name as xlabel. By default None

  • special_param_jets (dict, optional) – Dict with special x-axis cuts for the given variable.

  • output_directory (str, optional) – Name of the output directory. Only the dir name not path! By default “input_vars_jets”

  • plot_type (str, optional) – File format for the output, by default “pdf”

  • **kwargs (dict) – Keyword arguments passed to the plot. You can use all arguments that are supported by the HistogramPlot class in the plotting API.

Raises:

ValueError – When operator log is chosen but more than one variable is given

umami.input_vars_tools.plotting_functions.plot_input_vars_trks(datasets_filepaths: list, datasets_labels: list, datasets_class_labels: list, datasets_track_names: list, n_jets: int, var_dict: dict, cut_vars_dict: dict, sorting_variable: str = 'ptfrac', xlabels_dict: dict | None = None, n_leading: list | None = None, output_directory: str = 'input_vars_trks', plot_type: str = 'pdf', track_origin: str = 'All', **kwargs)#

Plot the track variable in comparison to another model with ratio plot. If multiple datasets are provided, a ratio plot comparing each flavour individually will be added below the main plot.

Parameters:
  • datasets_filepaths (list) – List of filepaths to the files.

  • datasets_labels (list) – Label of the dataset for the legend.

  • datasets_class_labels (list) – List with dataset-specific class labels, e.g. [[“ujets”, “cjets”], [“cjets”]] to plot light-jets and c-jets for the first but only c-jets for the second dataset

  • datasets_track_names (list) – List with the track names of the files.

  • n_jets (int) – Number of jets to use for plotting.

  • var_dict (dict) – Dict with all the variables you want to plot inside with their binning.

  • cut_vars_dict (dict) – Dict with cuts on variables applied to the input datasets.

  • sorting_variable (str, optional) – Variable which is used for sorting, by default “ptfrac”

  • xlabels_dict (dict, optional) – Dict that stores the xlabels of the variables that are plotted. I.e. to specify a label for “pt_btagJes”, use {“pt_btagJes”: “$p_T$ [MeV]”}. Variables that do not appear in the dict will have the variable name as xlabel. By default None

  • n_leading (list) – n-th leading jet which is plotted. For all, = None.

  • output_directory (str) – Name of the output directory. Only the dir name not path!

  • plot_type (str, optional) – File format for the output, by default “pdf”

  • track_origin (str, optional) – Track set that is to be used for plotting, by default “All”

  • **kwargs (dict) – Keyword arguments passed to the plot. You can use all arguments that are supported by the HistogramPlot class in the plotting API.

umami.input_vars_tools.plotting_functions.plot_n_tracks_per_jet(datasets_filepaths: list, datasets_labels: list, datasets_class_labels: list, datasets_track_names: list, n_jets: int, cut_vars_dict: dict, output_directory: str = 'input_vars_trks', plot_type: str = 'pdf', track_origin: str = 'All', **kwargs)#

Plot the number of tracks per jet as a histogram. If multiple datasets are provided, a ratio plot comparing each flavour individually will be added below the main plot.

Parameters:
  • datasets_filepaths (list) – List of filepaths to the files.

  • datasets_labels (list) – Label of the dataset for the legend.

  • datasets_class_labels (list) – List with dataset-specific class labels, e.g. [[“ujets”, “cjets”], [“cjets”]] to plot light-jets and c-jets for the first but only c-jets for the second dataset

  • datasets_track_names (list) – List with the track names of the files.

  • n_jets (int) – Number of jets to use.

  • cut_vars_dict (dict) – Dict with cuts on variables applied to the input datasets.

  • output_directory (str) – Name of the output directory. Only the dir name not path!

  • plot_type (str, optional) – File format for the output, by default “pdf”

  • track_origin (str, optional) – Track set that is to be used for plotting, by default “All”

  • **kwargs (dict) – Keyword arguments passed to the plot. You can use all arguments that are supported by the HistogramPlot class in the plotting API.

Module contents#