File Status Data

Module for the base FileStatusDatabase class.

class varats.data.databases.file_status_database.FileStatusDatabase[source]

Bases: EvaluationDatabase

Provides access to file status data.

This data is not cached, as most of it would be computed for the cache- integrity check anyways.

classmethod get_data_for_project(project_name, columns, commit_map, *case_studies, **kwargs)[source]

Retrieve data for a given project and case study.

Parameters:
  • project_name (str) – the project to retrieve data for

  • columns (List[str]) – the columns the resulting dataframe should have; all column names must occur in the COLUMNS class variable

  • commit_map (CommitMap) – the commit map to use

  • case_studies (CaseStudy) – the case study to retrieve data for

  • kwargs (Any) –

    • experiment_type: the experiment type to compute the status for

    • tag_blocked: whether to include information about blocked

      revisions

Return type:

DataFrame

Returns:

a pandas dataframe with the given columns and the

CACHE_ID: str = 'file_status_data'
COLUMNS: List[str] = ['revision', 'time_id', 'file_status']
COLUMN_TYPES = {'file_status': 'str', 'revision': 'str', 'time_id': 'int32'}