the problem is i have a very large set of results i m proces NetSuite Professionals #general

the problem is i have a very large set of results ...

Michael Mascitto

06/27/2019, 7:35 PM

the problem is i have a very large set of results i’m processing and each result may require significant governance units. apparently map/reduce has a hard per result cap on governance and it won’t yield automatically if you exceed it. it’ll just throw an error. it only yields if governance is exceeded for the overall phase

battk

06/27/2019, 7:37 PM

what kind of hardcore processing are you doing that requires over 5000 points per result

battk

06/27/2019, 7:40 PM

and how much do you potentially need

scottvonduhn

06/27/2019, 7:47 PM

structured right, you could potentially give each "result" quite a lot of governance units. For example, there is no reason that you are limited to just one Reduce phase per result. You could do some pre-processing of the result in the Map phase, and for each unit of work, create a Reduce context to deal with it. I have done this before, using complex keys (some id or value unique to each result + a sequence letter, for example). You might need conditional logic in the Reduce phase to perform different work depending upon the sequence letter in the key, but it is doable. However, if all of the processing for each result has to be performed sequentially, then it won't help, as you can't guarantee the ordering of the reduce phases.

Michael Mascitto

06/27/2019, 8:31 PM

Ok, thanks for the suggestion. Basically, i’m loading an advanced promotion discount search, going through each result, and if the result uses an item saved search for eligible items, i load that search and store the item names from the search into an array. the problem is there are potentially millions of items that could be returned for one item search per result of the promo discount search

battk

06/27/2019, 8:50 PM

how many advanced promotion discount are there

battk

06/27/2019, 9:01 PM

if there are less than 5 million or so, you can make the getInputData step divide your results into search pages

battk

06/27/2019, 9:01 PM

your map step can get the results for each page

battk

06/27/2019, 9:01 PM

and your reduce can combine the results into your array

Michael Mascitto

06/27/2019, 9:11 PM

ok. maybe i’ll try that. thanks!

Michael Mascitto

06/28/2019, 9:57 PM

Is there a way to push all my map/reduce results into an array so i can export them to a file when they’re done? i tried pushing each processed result from the reduce phase to a global variable and then exporting that variable during the summarize phase, but that doesn’t work. i can’t export the processed result during the reduce phase because then that just gives me a separate file per result

battk

06/28/2019, 9:58 PM

global variables wont work (at all honestly)

battk

06/28/2019, 9:58 PM

they don't share the same context

battk

06/28/2019, 10:00 PM

you are supposed to use the summaryContext output for that

battk

06/28/2019, 10:01 PM

https://system.app.na0.com/app/help/helpcenter.nl?fid=section_4472729410.html

battk

06/28/2019, 10:03 PM

my general warning, any of netsuite's iterator functions require you to return true to continue iterating

Michael Mascitto

06/28/2019, 10:06 PM

yeah i was having trouble retrieving the output property. i’m new to map/reduce, so i’m sure i’m probably doing something wrong. can i pass stuff from one phase into the summary context and access through the output property?

battk

06/28/2019, 10:13 PM

you can only pass the keys and values you write

battk

06/28/2019, 10:13 PM

what does your attempt to use the output look like

Michael Mascitto

06/28/2019, 10:14 PM

i think i couldn’t log it at all

Michael Mascitto

06/28/2019, 10:14 PM

i’m going to take another look though

battk

06/28/2019, 10:15 PM

keep in mind that a lot of the stuff in ss2 use getter and setter functions

battk

06/28/2019, 10:15 PM

not everything will log

Michael Mascitto

06/28/2019, 10:16 PM

i’ve got a more serious problem at the moment. my reduce phase timed out because i guess the result was too big. i thought i might run into this, which is why i was trying to figure out how to yield in a scheduled script using 2.0. during the reduce phase, if my promo object has an item saved search, i’m running the search and storing the results in an array. but some of these searches pull 2-3 million results. once you hit that per result cap with a map/reduce, the script just fails

Michael Mascitto

06/28/2019, 10:18 PM

i don’t think i can spread out the process in the map phase either. i mean the logic is pretty minimal. i’m passing this promo object from the map phase to reduce. the reduce phase just checks if there’s a saved search id. if there is, it runs the search and returns the results. that’s all i’m having it do. thinking i can break that logic up

battk

06/28/2019, 10:18 PM

well, there is no yielding in ss2 scheduled scripts

Michael Mascitto

06/28/2019, 10:18 PM

*can’t

Michael Mascitto

06/28/2019, 10:18 PM

yeah that’s what i hear

Michael Mascitto

06/28/2019, 10:19 PM

makes it tough though if one result happens to be huge. might have to write this in 1.0

battk

06/28/2019, 10:20 PM

i still say do the promo search in getInputData and have it get all the searches you have to run

battk

06/28/2019, 10:21 PM

you can make it so that the object you return has keys of the search to run and values of an array of pages for the search

battk

06/28/2019, 10:21 PM

your map writes the results of each page

battk

06/28/2019, 10:22 PM

and your reduce has 5000 points to combine your search results

Michael Mascitto

06/28/2019, 10:25 PM

can getInput handle that much though? the logic from top down is i’m running a promo search that may have around 100 results, push the fields i need into an object, checking if it’s got an item saved search, if it does, push all the items into an array and store in the original object, then at the end push all the results into an array and create a json file

Michael Mascitto

06/28/2019, 10:26 PM

if i search for my promos, and each result of that search may have an item search … that just seems like a lot of searching for the getInput phase

battk

06/28/2019, 10:26 PM

100 searches at 5 points each is 500 points

battk

06/28/2019, 10:26 PM

you dont need to fetch the results

battk

06/28/2019, 10:27 PM

sorry

Michael Mascitto

06/28/2019, 10:27 PM

right. but what if one promo result has an item search that contains a few million

battk

06/28/2019, 10:27 PM

bad math

battk

06/28/2019, 10:27 PM

you dont run the item search

Michael Mascitto

06/28/2019, 10:27 PM

no?

battk

06/28/2019, 10:28 PM

https://system.na0.netsuite.com/app/help/helpcenter.nl?fid=section_4486596158.html

battk

06/28/2019, 10:28 PM

use Search.runPaged to get a PagedData object

battk

06/28/2019, 10:28 PM

it costs 5 points

battk

06/28/2019, 10:28 PM

it tells you meta information about the search results

battk

06/28/2019, 10:29 PM

you would be interested in the pageRanges, which tell you how to fetch the results

Michael Mascitto

06/28/2019, 10:30 PM

ok. so you’re saying get pagedData objects for all my item searches during the getInput phase, then fetch the results in another phase?

battk

06/28/2019, 10:30 PM

correct

Michael Mascitto

06/28/2019, 10:31 PM

gotcha. ok, i may try that. thanks!

battk

06/28/2019, 10:46 PM

you would need to split the data so that each key processed by a map would represent one page of data

battk

06/28/2019, 10:46 PM

i guess you could do multiple pages per key if you really want

battk

06/28/2019, 10:59 PM

but you would want to fit 1000 points

Michael Mascitto

06/30/2019, 12:07 AM

fyi i don’t think getting the paged data for the item searches ahead of time during getInput will work, because when you pass that data into the other phases and it’s stringified, you can reference the object directly to get the actual page ranges, but methods like fetch won’t work anymore to retrieve fields and such

battk

06/30/2019, 3:02 AM

plan on getting a new PagedData each time and using the stringified pageRange to tell which index to fetch

battk

06/30/2019, 3:03 AM

should almost work the same unless someone changing search results between getInputData and/or map is a real concern

Michael Mascitto

07/01/2019, 4:27 PM

Yeah i don’t think map/reduce is going to work. The reduce phase has a 5k per result limit. Let’s say all i’m doing in that phase is fetching all my paged data. Some of these results may have 2-3 million lines in the search. Take 2.5 million as an example … at 1k lines per page that’s 2,500 pages i’ll have to fetch. A single fetch() is 5 units. That means to fetch 2,500 pages it will cost me 12.5k units, far exceeding the reduce phase limit. I’m thinking i may have to script this in 1.0

battk

07/01/2019, 4:34 PM

ive been trying to steer you to fetch individual pages in the map phase instead of getting all pages in the reduce phase specifically to avoid that problem

Michael Mascitto

07/01/2019, 4:59 PM

Yeah but the map phase has a significantly lower limit, only 1k

Michael Mascitto

07/01/2019, 4:59 PM

per result

Michael Mascitto

07/01/2019, 4:59 PM

that means i can only do 200 fetches before i hit the limit

Michael Mascitto

07/01/2019, 5:00 PM

i may need 2,500 fetches

battk

07/01/2019, 5:05 PM

individual pages

battk

07/01/2019, 5:05 PM

fetch 1 page per map key

5 Views

Open in Slack

Previous Next