Creating Point Collections
Last updated
Last updated
Point collections refer to a number a different locations (points on a map) and are typically grouped by a specific subject matter. For example a dataset of schools in South Africa.
Collections are created by uploading a csv file and associating the file with a theme and sub-theme.
Prepare a csv file containing at least the following fields: Name, longitude, latitude. You can see an example below:
In addition to these fields, various other fields can also be included as attributes for the point. These are shown on the point tooltip. An example of a file with additional attributes shown below:
Include the best identifier or identifying information
It's a good idea to include an standard identifiers for points in your dataset. This can make future updates and cross-referencing much easier. For example, for official facilities like public schools in South Africa, use their EMIS number.
See more
The "Name" column of your point collection cannot be a number data type. It may be necessary to convert the column to a text/string format before uploading. This can be done easily in Microsoft Excel and is illustrated in the figure below.
NOTE: It may sometimes be more suitable to change which column is given the "Name" heading as this is what is displayed in the tooltip on the map. In the screenshot below, the "Description" column would be a more appropriate choice to be made the "Name" column.
Once the file is ready to be uploaded, navigate to Point Collections
and click Add
which will allow you to name the collection and upload the file.
Select the profile which should be associated with these points and select the theme they belong to. Provide a label and the source of the data and proceed to upload.
Uploads to existing point collections add data without replacing existing points.
To add additional points to a point collection, prepare the file with the new points to be added, as outlined in the Preparing the points dataset section.
Select the point collection you would like to update, upload the file with the new additional point data, and proceed to save the changes.
You can edit information for a specific point by selecting it under Points > Locations.
To rectify/revise/update previously updated data in bulk, delete the existing point collection and recreate the adjusted file containing all the data.
Long term maintenance to a point dataset often involves the following tasks:
removing points that are no longer relevant
adding points that are not already in the database
updating data about points already in the database (e.g. opening hours, services provided, correcting/improving a description)
To be able to do this, you need to be able to compare your updated data to the data you have already uploaded to a collection in Wazimap.
To match incoming data to your existing data, you need an identifier. Failing that, you will need to use whatever other identifying information you have available.
An identifier is a value which uniquely and consistently identifies an object - in this case, a point in your point collection.
It is important to include some kind of identifier in your point data to facilitate updates to the data. Downstream users may also need to be able to rely on your identifiers being unique and consistent to be able to incorporate your updates if they keep a copy of your point data.
If your data does not have an official identifier that is consistent over time, and unique per point, think about how you will check if you have any duplicates and check whether newly-provided points are already in your database - e.g. by name, address, and so on. Think about what your users will be able to provide you, and include that in what is shown to them as well.
Consider making up and maintaining your own unique consistent identifier. We suggest using UUIDs because they are globally unique, and not just unique within one table. This is important in case you want to combine tables later, e.g. if you want to merge "public schools" and "private schools" into one "schools" table and continue to use your unique identifier.
You can use the following formula in excel to create a UUID:
When filled down a column, these identifiers will look like
Make sure to copy and paste as text (not the formula) into a new column and use the text version of it going forward. You don't want it to calculate a new random UUID for existing points.
In Excel, see Conditionally format duplicate values
In Workbench
add a deduplication step
select a column whose values probably ought to be unique
look for rows where duplicate number is greater than 1
If you have a consistent unique identifier:
In Excel, use VLOOKUP()
In Workbench, perhaps Join Tabs might work
If you have messy data with different capitalisation and potential spelling mistakes:
Try using OpenRefine with CSV reconcilliation