`mt.sql.base`

Base functions dealing with an SQL database.

Functions 

frame_sql(): Undocumented.
indices(): Returns the list of named indices of the dataframe, ignoring any unnamed index.
run_func(): Attempt to run a function a number of times to overcome OperationalError exceptions.
conn_ctx(): Undocumented.
engine_execute(): Undocumented.
read_sql(): Read an SQL query with a number of trials to overcome OperationalError.
read_sql_table(): Read an SQL table with a number of trials to overcome OperationalError.
exec_sql(): Execute an SQL query with a number of trials to overcome OperationalError.
list_schemas(): Lists all schemas.
list_tables(): Lists all tables of a given schema.
list_views(): Lists all views of a given schema.
table_exists(): Checks if a table exists.
create_temp_id_table(): Creates a temporary table to containing a list of ids.

mt.sql.base.frame_sql(frame_name, schema: str | None = None)

mt.sql.base.indices(df): Returns the list of named indices of the dataframe, ignoring any unnamed index.

mt.sql.base.run_func(func, *args, nb_trials: int = 3, logger: IndentedLoggerAdapter | None = None, **kwargs)

Attempt to run a function a number of times to overcome OperationalError exceptions.

Parameters:

func (function) – function to be invoked
args (sequence) – arguments to be passed to the function
nb_trials (int) – number of query trials
logger (mt.logg.IndentedLoggerAdapter, optional) – logger for debugging
kwargs (dict) – keyword arguments to be passed to the function

mt.sql.base.conn_ctx(engine)

mt.sql.base.engine_execute(engine, sql, *args, **kwargs)

mt.sql.base.read_sql(sql, engine, index_col: str | List[str] | None = None, chunksize: int | None = None, nb_trials: int = 3, exception_handling: str = 'raise', logger: IndentedLoggerAdapter | None = None, **kwargs) → DataFrame

Read an SQL query with a number of trials to overcome OperationalError.

The function wraps pandas.read_sql_query(). However, when chunksize is not None, it iterates over chunks and concatenates them. In addition, if logger is not None, a progress bar is shown in that case.

A dataframe is always returned.

Parameters:

sql (str or object) – SQL query to be executed. The query can be a string or an sqlalchemy object that can be used for querying. Passed as-is to pandas.read_sql_query().
engine (sqlalchemy.engine.Engine) – connection engine to the server
index_col (string or list of strings, optional, default: None) – Column(s) to set as index(MultiIndex). Passed as-is to pandas.read_sql_query().
chunksize (int, default None) – If specified, iteratively reads a number of chunksize rows. In this case, a progress bar is also shown if logger is provided.
nb_trials (int) – number of query trials. If chunksize is provided, this is only effective before an iterator is returned from pandas.
exception_handling ({'warn', 'raise'}) – policy for handling SQL-raised exceptions when iterating over many chunks to completely download the result. Only valid when chunksize is provided. Right now there are only 2 policies. Either to raise the exception as-is (‘raise’), or to raise the exception as a warning (‘warn’) and return whatever has been downloaded.
logger (mt.logg.IndentedLoggerAdapter, optional) – logger for debugging
kwargs (dict) – other keyword arguments to be passed directly to pandas.read_sql_query()

Returns:

the output dataframe

Return type:

pandas.DataFrame

mt.sql.base

Functions

`mt.sql.base`

Functions 