Uploading your own data¶
Uploading your datasets to GeoHub is quite straightforward. You can follow the steps descibed in this section to upload them easily.
Data upload¶
Firstly, please open datasets page from this link.
You can first DATA UPLOAD button at the header of datasets page. If you cannot see upload button, that means you have not signed in. Please follow this steps to login first.
--
You will be redirected to data upload page. The basic operation flow is shown in the following figure.
--
The key elements on data upload page are as follows. Each number is corresponded to the number on the above figure.
- Drag and drop files to a box or select files from select files button.
- Check all selected files appears in the table.
- As default, GeoHub will split all datasets into each file. If you want to convert multiple datasets into a file, please tick the checkbox.
- Once everything is ready, you can click UPLOAD button to proceed.
- Click Clear All button if you want to reset all selected files.
Single layer vs Multi layer dataset
--
Please stay in this page until all uploads will be completed, you will be automatically redirected to datasets page once all uploads are done.
Note
Most of geospatial data formats are a single file. However, if you want to upload some specific formats like ESRI Shapefile, please make sure you have selected all essential files (.shp
, .prj
, .dbf
, .shx
, etc) or make a zip file.
ArcGIS FileGeoDatabase (.fgb
) also needs to be archived as a zip file to upload.
Warning
GeoHub does not limit the maximum file size to upload. But you maybe encounter some error when you upload extremely large geospatial dataset (generaly more than 1GB file size might have issues of uploading). This may occur due to our data pipeline spec (CPU and RAM) is limited. Please try to avoid uploading such large files. If you want to upload a large dataset and encounter some errors, please contact to our team through email or GitHub.
Check your data at My data¶
--
Once your file is uploaded, you will be able to see it in Uploaded Data tab at datasets page.
--
If status becomes Processed, your file is ready to publish.
--
The following status maybe appears.
- Downloaded: Pipeline received a task and downloaded your file.
- Processing: Pipeline started ingesting your file
- Processed: Pipeline finished ingesting, but not published yet. The table will be automatically refreshed.
- Partially done: Some files completed ingesting, but some files are failure.
- Error: Failed to ingest all files.
- Published: all datasets associated to this uploaded dataset were published
Uploaded dataset menu¶
You can open the operation menu for uploaded dataset.
--
- Download: You can download original dataset from here
- Show logs: See this section
- Cancel: You only can see this before ingesting is completed. You can cancel ingesting by the pipeline.
- Delete: You can delete this data. but you cannot delete it if one of ingested datasets are published.
Check data pipeline logs¶
Sometimes you might want to see logs from data pipeline.
--
The following screenshot shows how you can show logs for your dataset.
- Click three dotted button at the end of row of a dataset.
- Click Show logs menu.
Note. Each number of instruction is corresponded to the number on the above figure.
Note
All logs from data pipeline can be seen as a popup dialog. If your data failed to be ingested by the data pipeline, please send all logs to GeoHub team by sending email or opening an issue at GitHub.
Next step¶
In next section, you are going to learn how to register a remote file source other than uploading.