import pandas as pd★Universal alias — alwayspd.import numpy as npAlmost always imported alongside.pd.set_option('display.max_columns', None)★Tune display / compute settings (accepts a dict in 3.0).with pd.option_context('display.max_rows', 100):Temporarily override options in a block.pd.__version__ pd.show_versions()Version / full environment report.
pd.read_csv('f.csv')★CSV → DataFrame; args:usecols,dtype,parse_dates,chunksize.pd.read_excel('f.xlsx', sheet_name='S1')★Read one Excel sheet (or a list / None for all).pd.read_parquet('f.parquet')★Columnar Parquet — fast & typed.pd.read_json / read_html / read_xmlJSON / every <table> on a page / XML.pd.read_sql(query, conn)SQL query result (alsoread_sql_table).pd.read_csv(..., dtype_backend='pyarrow')★Read straight into Arrow-backed dtypes.
df.to_csv('f.csv', index=False)★index=Falseskips writing row labels.df.to_parquet('f.parquet')★Columnar, compressed, preserves dtypes.df.to_excel('f.xlsx', sheet_name='S1')Write to an Excel sheet.df.to_sql('table', conn, if_exists='replace')Write to a SQL table.df.to_dict() df.to_numpy() df.to_markdown()Convert to other in-memory forms.
pd.Series([1,2,3], index=['a','b','c'])★One labeled 1D array.pd.DataFrame({'a':[1,2], 'b':[3,4]})★Dict keys → columns, values → rows.pd.DataFrame(data, columns=[...], index=[...])From records with explicit labels.pd.date_range('2024-01-01', periods=5, freq='D')★Build a DatetimeIndex.pd.DataFrame().convert_dtypes()Infer the best nullable dtypes for each column.
df.head(n) df.tail(n)★First / last n rows (default 5).df.shape df.info() df.describe()★Dimensions / dtypes+nulls+memory / summary stats.df.dtypes df.columns df.indexPer-column types / column & row labels.df.memory_usage(deep=True)Per-column memory (deep = true object cost).df.sample(n) df.nunique() df['c'].unique()Random rows / distinct counts / distinct values.