geodatabase
This modules currently has this unique class: Geodatabase.
The Geodatabase class makes it possible to interact with spatialite databases. Example of projects using this package: kad-buildings or kad-clustering.
Created on Wed Dec 23 17:31:33 2020
@author: Youri.Baeyens
- class geodatabase.Geodatabase(db: str = ':memory:', srid: int = 31370, recreate: bool = False)[source]
Geodatabase.
Available methods:
- attach(path: str, alias: str)[source]
Attach external db.
- Example
>>> attach('c:\path o\mydb.sqlite','mydb')
- checksForTable(table: str) bool [source]
Detects table without rowid or with shadowed rowid.
- Raises
Exception – Exception raised if either shadowed rowid or absence of rowid raises are detected.
- createSpatialIndex(table: str, geometry: str = 'geometry')[source]
Create a spatial index.
- Raises
exception – Exception is thrown if the creation of spatial index failed.
- executeScript(query: str, **queryParameters) list [source]
Execute a sql script. The script can contain many SQLs separated by ;.
- Parameters
query – Path to the sql query. Do not specify the .sql extension.
- QueryParameters
list of “parameter=value” parameters that can be used inside the query.
- findConnectedComponents(inputTable: str, edgesColumnNames: [<class 'str'>, <class 'str'>], outputTable: str) pandas.core.frame.DataFrame [source]
Finds connected components of an undirected graph.
- Parameters
inputTable (str) – DESCRIPTION
edgesColumnNames (list) – DESCRIPTION
outputTable (str) – DESCRIPTION
- Returns
DESCRIPTION
- Return type
TYPE
- Example
>>> db.findConnectedComponents('t01_edges',['elt_a','elt_b'],'t02_cc')
- getGeometriesMetadata() pandas.core.frame.DataFrame [source]
Get geometries metadata.
- Returns
DataFrame with available metadata on geometries.
- Example
>>> db.getGeometriesMetadata()
f_geometry_column geometry_type coord_dimension srid spatial_index_enabled f_table_name t01_zip geometry 3003 4 31370 1 t04_zip_sample geometry 3 2 31370 1
- Example
>>> db.getGeometriesMetadata().loc['t01_zip',:] f_geometry_column geometry geometry_type 3003 coord_dimension 4 srid 31370 spatial_index_enabled 1 Name: t01_zip, dtype: object
- getListOfColumns(table: str, withLengths: bool = False, withDistincts: bool = False) pandas.core.frame.DataFrame [source]
Documents the structure and content of a table. The returned pandas DataFrame has got this structure:
cd_column_type: [‘INTEGER’,’DOUBLE’,’TEXT’,’POLYGON’,…]
fl_notnull_constraint: 1=nulls are not allowed, 0=nulls allowed
default_value
fl_primary_key: 1=a primary key, 0=not a primary key
ms_max_length: max length of column expressed in characters or bytes
- Example
>>> db.getListOfColumns('t01_zip',withLengths=True)
cd_column_type fl_primary_key ms_max_length tx_column_name pk_uid INTEGER 1 4.0 join_count INTEGER 0 5.0 nouveau_po TEXT 0 4.0 frequency INTEGER 0 2.0 cp_speciau INTEGER 0 1.0 shape_leng DOUBLE 0 13.0 shape_area DOUBLE 0 13.0 geometry POLYGON 0 142708.0
- getListOfTables() pandas.core.frame.DataFrame [source]
Get list of tables in geodatabase (including indexes and count of rows).
- Example
>>> db.getListOfTables()
ms_countOf_geometries ms_countOf_spatial_indexes ms_countOf_rows f_table_name T01_zip 1 1 1268.0 T02_PIP 0 0 225263.0 t03_sample 0 0 225262.0 t04_sample 0 0 2.0 t04_zip 1 1 10.0
- getTableCountOfRows(table: str) int [source]
Counts the number of rows in table.
- Example
>>> db.getTableCountOfRows('T01_ZIP') 1268
- inspectGeometry(table: str, geometry: str = 'geometry') list [source]
Inspects the nature of a table geometry.
- Returns
List of (geomType, srid, coordDimension, ms_countof_rows) t-uples.
- Example
>>> db.inspectGeometry('T01_ZIP') [('POLYGON ZM', 31370, 'XYZM', 1268)]
Warning
a Geometry can contain various types of geometries (eg: both Polygons and Multipolygons). As a consequence the returned list can contain multiple elements.
- loadCsv(csvFile: str, table: str, if_exists: Literal['fail', 'replace', 'append'] = 'replace', **readParameters)[source]
Loads csv file and into a table.
- Parameters
csvFile – Path to csv file. File extension should be mentionned.
table – Table name.
if_exists – What to do if table already exists.
readParameters – Parameters for the pandas.read_csv function.
- Returns
void
- Example
>>> dtype={ >>> 'ADR_ID':'str', >>> 'ADR_CREAT':'str', >>> 'ADR_MODIF':'str', >>> 'X':np.float64, >>> 'Y':np.float64 >>> } >>> parse_dates=['ADR_CREAT', 'ADR_MODIF'] >>> db.loadCsv('myFile.csv', >>> 'myTable', >>> dtype=dtype, >>> parse_dates=parse_dates, >>> sep='|')
- loadEmbeddedCsv(file: str, table: str, if_exists: Literal['fail', 'replace', 'append'] = 'replace', **readParameters)[source]
Loads an embedded csv file and into a table.
- Parameters
file – Embedded csv file.
table – Table name.
if_exists – What to do if table already exists.
readParameters – Parameters for the pandas.read_csv function.
- Returns
void
- Example
>>> file=''' >>> cd_sex|tx_sex >>> M|Male >>> F|Female >>> ''' >>> db.loadEmbeddedCsv(file,'t_sex',sep='|')
- loadShp(shapeFile: str, table: str, encoding: str = 'CP1252', srid: Optional[int] = None)[source]
Loads a shapefile and spatially indexes the resulting table.
- Parameters
shapeFile – Path to shapefile. File extension should be .shp and cannot be mentionned in shapeFile.
table – Name of the table to be created. This table will be automatically spatially indexed.
encoding – The character encoding adopted by the DBF member, as e.g. UTF-8 or CP1252.
srid – EPSG SRID value.
- Returns
void
- Example
>>> import geodatabase as gdb >>> db=gdb.geodatabase() >>> db.loadShp('C:/.../postaldistricts','T01_zip') Loading shapefile at 'C:/.../postaldistricts' into SQLite table 'T01_zip' Inserted 1268 rows into 'T01_zip' from SHAPEFILE
- pointInPolygon(points: str, polygons: str, outputTable: str)[source]
Assigns point to polygons.
- Parameters
points – Name of the table containing points.
polygons – Name of the table containing polygons.
outputTable – Name of the output table.
queryParameters – List of “parameter=value”. Those parameters can be used inside the query.
- Returns
void.
Warning
Do not VACUUM your database ! If you do, you risk to break your link table.
- recoverGeometry(table: str, geometry: str = 'geometry')[source]
Recovers geometry, ie, registers the geometry in spatialite’s metadata.
- Parameters
table – Name of the table containing the geometry.
geometry – Name of the geometry column.
- Raises
exception – Exception is raised if geometry cannot be recovered.
- Returns
void
- recoverSpatialIndex(table: str, geometry: str = 'geometry')[source]
Recover Spatial Index.
A delete operation followed by a VACCUM operation could leave an index in an inconsistent state.
This method checks for the consistency of the spatial index. If the spatial index is inconsistent then try to recover the index (ie, rebuild). If index recovery failed then an exception is thrown.
- Raises
Exception – Raises an Exception if geometry is inconsistent.
Warning
Inconsistent indexes can have a dramatic impact. Read this note on SpatialIndexes carefully! Please, run this method if you have a doubt on the integrity of your spatial index.
- reproject(inputTable: str, srid: int, outputTable: Optional[str] = None, if_exists: Literal['fail', 'replace'] = 'fail')[source]
Reproject.
This method reprojects a geometry.
- Parameters
inputTable – Name of the table containing the geometry to reproject.
srid – Reproject into the Reference System identified by srid.
outputTable – Name of the output table containing the reprojected geometry.
- Returns
void
- toCsv(table: str, archive_name: Optional[str] = None, mode: Literal['w', 'a'] = 'w', outputDirectory: str = '.', columnsToExclude: list[str] = ['geometry'])[source]
Exports a table to a csv file.
- Parameters
table – Table name.
outputDirectory – Output directory.
columnsToExclude – Columns to exclude from export.
- Returns
void
- Example
>>> db.toCsv('T01_table','../output')
- toSas(table: str, archive_name: Optional[str] = None, outputDirectory: str = '.', columnsToExclude: list[str] = ['geometry'])[source]
Create as sas program to import the csv export of a table.
- Parameters
table – Table name.
outputDirectory – Output directory. The default is ‘.’.
columnsToExclude – Columns to exclude from sas import. The geometry column is excluded by default as geometries cannot be imported in sas.
- Returns
void
- Example
>>> db.toSas('t03_pip_sample')
Content of ./t01_zip.sas
filename myZip ZIP 't03_pip_sample.zip'; data t03_pip_sample(compress=YES); length id_rec 8 rowid_point 8 rowid_polygon 8 ; infile myZip(t03_pip_sample.csv) dsd delimiter='|' encoding='utf8' firstobs=2 missover lrecl=10000; input id_rec rowid_point rowid_polygon ; run;