ETL Options
  • 14 Oct 2023
  • PDF

ETL Options

  • PDF

Article Summary

ETL Edit Transform Load 

Tools that can be used to ETL your data into your system.

ESRI

ArcGIS 

Data Interoperability Extension: This extension provides ETL capabilities within ArcGIS. It supports reading CSV files and FGDBs and offers many toolboxes used for data extraction, manipulation, and loading.

ArcGIS Pro

ArcGIS Pro offers an array of geoprocessing tools for ETL operations. It supports CSV and FGDB formats and allows you to perform various transformations and load data into different formats or databases.

ArcMap

ArcMap also supports ETL operations. With tools like the Table to Table or Feature Class to Feature Class, you can extract and transform data from CSV files and FGDBs.


QGIS

QGIS Processing Toolbox 

QGIS provides a processing tools that can be utilized for data processing. You can use tools like CSV Layer and Field Mapper to extract data from either a CSV or FGDB

QGIS Python Plugins

Plugins such as Table Manager and Table Manager Extended offer functionalities that allow you manipulate SmartFabric data. 

QGIS Database Manager: 

QGIS has a built-in database manager that allows interaction with spatial databases. CSV data or FGDBs can be imported into spatial databases or perform transformations. 


FME (Feature Manipulation Engine)

 FME is a data integration platform that specializes in ETL operations. It supports reading a multitude of data types including CSV and FGDB. FME has a very user friendly visual interface that allows a user to be particular in the data transformation they are trying to perform. 


GDAL (Geospatial Data Abstraction Library)

ogr2ogr

Command-line tool provided by GDAL allows you to extract data from either format, perform transformations, and load the data into different formats or databases. It supports many different geospatial formats.

ogrinfo

Another GDAL command-line tool, ogrinfo, can be used to query information about data sources. It provides metadata and schema details, which can be useful for designing ETL processes.

GeoKettle

Open source geospatial ETL tool. It supports CSV files and FGDBs as data sources and provides a graphical interface for designing ETL processes. 


SQL Shell (psql)

This command can be utilized to ETL data stored in a CSV into a PostgreSQL database. You can configure the database you wish to connect to within the command prompt utilizing a command like: psql -h <hostname> -d <database_name> -U <username>. Once you have connected to the database you can use a COPY FROM command that will take a locally stored CSV file and import it into the PG database. COPY <table_name> FROM '/path/to/csv/file.csv' DELIMITER ',' CSV HEADER;. 


Oracle Database

Oracle Data Pump

SQL*Loader can be used to ETL data into an Oracle database. If the data is being received in an FGDB format it must be converted into a suitable like a CSV for migration. You can utilize FME or ogr2ogr to do so. 

In Oracle Spatial, you can utilize the extension SDO_GEOMETRY that enables the storage, indexing, and querying of spatial data. 


Google BigQuery

This is a service provided by Google Cloud and allows you to analyze large datasets using SQL queries. BigQuery can directly load a new table in CSV format so long as it is uploaded and referenced in a Google Cloud bucket. 


Was this article helpful?