I'm working with a 37GB dataset of about 200 million points. I have an empty geom field, that I need to update. Before doing things over and over, I wonder if anyone knows if using THREADS helps an UPDATE statement (part of me thinks it is I/O bound because it has to write the table). But, another part of me thinks it is embarrassingly parallel because each record is independent of one another. The base query would look like this: UPDATE TABLE [mergedtable] SET geom = GeomMakePoint(VectorMakeX2(lon, lat)); but, would using THREADS make it faster? UPDATE ( SELECT [mfd_id], [geom],(GeomMakePoint(VectorMakeX2(lon,lat))) AS [geom New] FROM CALL Selection([mergedtable], TRUE) THREADS SystemCpuCount() ) SET [geom] = [geom New]; I'm not being lazy, it just take a long time (obviously), so before running the second query as a test, I wondered if anyone had thoughts on it.
|