Kel
06/15/2021, 10:19 PMbattk
06/15/2021, 10:32 PMbattk
06/15/2021, 10:32 PMKel
06/15/2021, 10:42 PMbattk
06/15/2021, 11:00 PMbattk
06/15/2021, 11:01 PMstalbert
06/15/2021, 11:23 PMgetInputData
and map
?Kel
06/15/2021, 11:37 PMstalbert
06/15/2021, 11:43 PMmap()
That is, map() runs in parallel but not until getInputData() has finished entirely. Would love to hear advice to the contrary but that's been my empirical observcations.stalbert
06/15/2021, 11:45 PMKel
06/15/2021, 11:46 PMgetInputData
stage, there is a long delay between getInputData
and map
?stalbert
06/15/2021, 11:47 PMgetInputData
in a MR script so it never even reached map
.stalbert
06/15/2021, 11:49 PMmap()
data in parallel and incrementally while your data was being returned from getInputData()
but it doesn't seem to operate in that sort of 'streaming' fashion.stalbert
06/15/2021, 11:50 PMgetInputData
returned a search reference, behind the scenes it must have been executing the search and trying to load ALL the results from the entire search. Perhaps the same is happening with ALL the lines from the file you're iterating?Kel
06/15/2021, 11:52 PMKel
06/15/2021, 11:54 PMbattk
06/15/2021, 11:55 PMbattk
06/15/2021, 11:55 PMbattk
06/15/2021, 11:55 PMKel
06/15/2021, 11:56 PMbattk
06/15/2021, 11:57 PMbattk
06/15/2021, 11:58 PMKel
06/16/2021, 12:00 AMKel
06/16/2021, 12:00 AM