Any suggestions for dealing with governance in a map reduce NetSuite Professionals #suitescript

Any suggestions for dealing with governance in a m...

mrob

10/26/2023, 11:27 PM

Any suggestions for dealing with governance in a map/reduce? I'm getting a lot of

SSS_USAGE_LIMIT_EXCEEDED

which I believe is due to the various actions performed in a reduce context.

mrob

10/26/2023, 11:28 PM

The reduce loops through 1 journal record for all journal lines with a certain account, does a search to find a purchase order based on a custom field value on the journal line, and then sets the resulting value on the journal line. Doesn't seem like it should be too much governance to me? Though it can run hundreds of times for a single journal entry (meaning there might be hundreds of lines with the relevant expense account on that journal)

mrob

10/26/2023, 11:29 PM

reduce code

Untitled

mrob

10/26/2023, 11:30 PM

Assuming I'm right and the 100s of searches is what's hitting the SSS_USAGE_Limit (seems to coincide with the error stack) the only thing I can think of is to loop through the journal twice. Once to get all the po ids, then do a single search to build a big hash map, and then a second loop through the lines to check against the hash map?

mrob

10/26/2023, 11:31 PM

The reduce runs... something like 90 times (once per relevant journal entry) though this will increase by 1 every month I think

battk

10/26/2023, 11:34 PM

do one search for all your purchase orders instead of one per line

battk

10/26/2023, 11:36 PM

your implementation will require you to use a bunch of "OR"s in your filter expression, though you can make the search easier for yourself if you can get actual internal ids of purchase orders instead of po number

mrob

10/26/2023, 11:38 PM

Yeah unfortunately the only data on the journal line is the tranid

mrob

10/26/2023, 11:38 PM

I'd make a big array and do anyof i suppose

mrob

10/26/2023, 11:39 PM

pull all the records with internalid and tranid columns into a big array and then loop through the entries a second time and check the array

battk

10/26/2023, 11:41 PM

anyof is for select fields, tranid is not a select

mrob

10/26/2023, 11:43 PM

looks like there's hopefully an Any option though

mrob

10/26/2023, 11:43 PM

Hmm that didnt' save right, I guess Any is just the default dropdown option?

mrob

10/26/2023, 11:44 PM

Maybe a query instead of a search at that point? I assume there'll be some governance issue with 1000 tranids

battk

10/26/2023, 11:45 PM

no real difference between a query or search here

battk

10/26/2023, 11:45 PM

you have the same options

battk

10/26/2023, 11:45 PM

a bunch of ors, or a in condition

battk

10/26/2023, 11:46 PM

with the in condition being limited to 1000 ids before you need to start making multiple of them for the ors

mrob

10/26/2023, 11:46 PM

Okay, I just assumed the search would have tighter governance there

battk

10/26/2023, 11:46 PM

again, usually you just start with a bunch of 'or's

mrob

10/26/2023, 11:46 PM

Unfortunately I don't know the maximum number of tranids i need to plan for here

battk

10/26/2023, 11:49 PM

usual limit is 10000, so either query or search isnt going to run out of points if you write is correctly

mrob

10/26/2023, 11:50 PM

okay thanks battk i'll start working on figuring this out

alien4u

10/26/2023, 11:53 PM

My humble advice is to understand how Map Reduces works, once you fully understand them you can start to properly and efficiently design/architect your solution: If I'm not mistaken NetSuite Map Reduces are an implementation or fork of this: https://static.googleusercontent.com/media/research.google.com/en//archive/mapreduce-osdi04.pdf If you wan to skip that, then this: https://docs.oracle.com/en/cloud/saas/netsuite/ns-online-help/section_4387799161.html#bridgehead_1518486832

alien4u

10/26/2023, 11:54 PM

A properly designed Map Reduce will never run out of governance.

💯 1

Luiz Morais

10/27/2023, 6:41 AM

@mrob to get all internal ids from tranid in one search you can user a formula like this: CASE WHEN {tranid} IN (‘PO1234’, ‘PO4567’) then 1 else 0 end EQUAL TO 1 I usually use Array.join(‘,’) to create the group

Luiz Morais

10/27/2023, 6:42 AM

also, you could make your input data return only the lines that needs to be updated, on map you find the PO, then on reduce you update all lines together

Watz

10/27/2023, 10:36 AM

Copy code

SELECT *
FROM transactionLines TL
   LEFT JOIN transaction PO_T on PO_T.[documentnumber] = TL.[memo/where thereference is]
WHERE TL.transaction = ? --use your starting journal as a starting point.
AND PO_T.id is not null

Watz

10/27/2023, 10:37 AM

I still feel like this could be resolved with a query in getInputData that only returns the lines that you need to work on.

☝🏻 1

Watz

10/27/2023, 10:39 AM

Pass all transaction lines to Map. Group them by transaction Id as key and keep all values as values in Reduce, process the lines that are included in values. Preferrably you will include lineuniquekey or lineid from getInputData. You will also retrieve all relevant values from the PO in getInputData as well. So that when you actually update the journal, it will only require the governance units needed for load and save on the journal (perhaps some other small stuff)

mrob

10/27/2023, 3:51 PM

Thanks Watz and Luiz! I'll dig into those comments

mrob

10/30/2023, 5:52 PM

Re: Only passing journal lines to map/reduce Currently the getinputdata does 2 things- 1. finds journals with lines with expense account 20003 2. finds inactive vendors associated with those journals and temporarily re-activates them (writes internal ids to a file in the file cabinet then read by the summarize) Then the reduce- 1. Loops through journal, identifying only relevant lines, searches for an associated PO for those relevant lines, then sets the value on the line Summary- 1. re-active vendors inactivated in getinputdata (read from the file in the file cabinet) So I don't understand how finding the lines themselves in getinputdata makes a difference in regards to governance if I kept this same structure? Meaning, in my perspective the thing I need to do is not change what's being passed to the reduce method, but instead simply figure out a way to only perform a single search after having looped through all the journal lines. I do understand how I could potentially find this information in the getinputdata function, but it seems 'cleaner' or better segmented to me to do it in the reduce still (and either loop through the journal twice or use a query b/w the journal and the pos)?

mrob

10/30/2023, 5:54 PM

I already have this suiteql for identifying inactive vendors-

Copy code

var suiteQL = `SELECT V.ID
               FROM Transaction T
               JOIN TransactionLine TL ON T.ID = TL.TRANSACTION
               JOIN Transaction PO ON PO.TRANID = TL.CUSTCOL_AD_PO_NUM_JOURNALS
               JOIN Vendor V ON V.ID = PO.ENTITY
               WHERE T.TYPE = 'Journal' AND V.ISINACTIVE = 'T' AND TL.EXPENSEACCOUNT IN ('${expenseAccountId}')`;

so I assume I can just cut it short and do something like...

Copy code

SELECT PO.TRANID
FROM Transaction T
JOIN TransactionLine TL on T.ID = TL.TRANSACTION
JOIN Transaction PO on PO.TRANID = TL.CUSTCOL_AD_PO_NUM_JOURNALS 
WHERE T.TYPE = 'Journal' AND PO.TYPE = 'Purchase Order' AND T.ID is ('${journal.id}')

Luiz Morais

10/30/2023, 6:03 PM

I’d split this processing in : 1. Get Input Data: return your query to that so each line will be one execution of map 2. Map: Make Vendors active and map PO Id to object with Line Number, PO Internal Id and any other data needed on Reduce and write it with journal entry internal id 3. Reduce: All lines of same journal will on same execution of reduce, load JE, iterate all lines to update journal entry, save it then inactive all vendors again. 4. Summarize: just return execution errors.

mrob

10/30/2023, 6:17 PM

Interesting so Map would run 1000s of times but since it's writing context with the same key the Reduce would only runs 10s of times? (this would be done by shuffle stage essentially)

mrob

10/30/2023, 6:19 PM

For an idea on total performance, for now the script will be reviewing/updating ~50,000 journal lines, but every month that number will grow (ideally the number that actually are changed dramatically drops after first pass but all 50K+ need to be reviewed for changes every month)

Watz

10/30/2023, 6:21 PM

You should minimize the "reviewing" by using suiteql to only get the lines that you can update.

Watz

10/30/2023, 6:23 PM

In Luiz proposed division of work across the stages, watch out for two map-instances modifying the same vendor. Also, in reduce. Make sure that you don't have too many vendors to activate again. It need to fit inside the 5000 points.

mrob

10/30/2023, 6:24 PM

Yeah I think I'd have to do the vendors in getinput/summary still

mrob

10/30/2023, 6:25 PM

I do think just doing a query in the reduce statements so it runs once/journal will work but I do agree that journal lines in map and journals in reduce is a better structure

Watz

10/30/2023, 6:27 PM

How many vendors could a single journal contain?

mrob

10/30/2023, 6:28 PM

hundreds or thousands

mrob

10/30/2023, 6:29 PM

They are sort-of a custom type thing built for a revenue recognition process

Watz

10/30/2023, 6:30 PM

Doing it in getinput and summarize isn't really that much better as you're limited to 10000 points. For all journals.

mrob

10/30/2023, 6:30 PM

Yeah, for the moment I believe i'm activating 350 vendors

mrob

10/30/2023, 6:30 PM

which is a single submitfields for each vendor

mrob

10/30/2023, 6:33 PM

So I guess it's 350 x 5 which you're right, is still a notable governance but I think still works in my 10K limit for the moment... I don't expect the inactive vendors to dramatically increase in the near future either. And I do need to get these 4 map/reduce done asap also so I think even if a re-factor for vendor activation is needed it can happen after these go live.

NElliott

10/31/2023, 8:30 AM

As long as you know the 350 records up front in theory you could chunk this and call a SuiteLet one or more times to process the submitfields part. A Suitelet has a governance of 1k, lets say you process 175 in each SuiteLet (175*5 = 875), so that's within the 1k limits and the cost to call the SuiteLet is 10 governance, that's a total of 20 governance in your example. ((350/175)*10). I'm saying this without fully considering the order you're doing things. You may need to create an array of promises and await the completion if you need to know they have all been updated before moving on.

Watz

10/31/2023, 9:10 AM

300s / 175records = 1,7 seconds allowed per record.submitFields() You wouldn't need to use promises, right? But it might take 2*5 minutes for the two calls to complete instead of 5 minutes.

NElliott

10/31/2023, 9:13 AM

I don't know if you would need to, I'm just proposing that if you need to track the success/failure, and the SuiteLet was returning something meaningful, you may need to use an array of promises if you fired off 2 x 175 requests 😉

NElliott

10/31/2023, 9:14 AM

Of course time is also a factor not just the raw governance of the api calls. YMMV

Watz

10/31/2023, 9:14 AM

But you can still track the success/failure if you're doing them synchronously. The only upside with promises would be parallel processing of suitelet-calls. Maybe I'm misunderstanding you. 🙂

NElliott

10/31/2023, 9:22 AM

Ah, sorry yes, I see what you mean, doing it asynchronously is not definitively relevant to tracking success / failure. 👍🏻 My thought was that if you need to know if all calls were successful it'd be more efficient with an array of promises as you can process the .success of each as they land and not wait. Anyhooooow this could all be moot, I just came back from a few days off, catching up with messages and thought it worth throwing it out as a potential option and may not even be relevant 🙂

🌴 1

mrob

11/03/2023, 6:09 PM

Sorry for the delay but I believe I was successful with my query-

Copy code

var suiteQL = `SELECT PO.TRANID, PO.ENTITY
                      FROM Transaction T
                      JOIN TransactionLine TL ON T.ID = TL.TRANSACTION
                      JOIN Transaction PO ON PO.TRANID = TL.CUSTCOL_AD_PO_NUM_JOURNALS
                      WHERE T.TYPE = 'Journal'
                        AND PO.TYPE = 'PurchOrd'
                        AND T.ID IN ('${journalRecord.id}')
                        AND TL.EXPENSEACCOUNT IN ('${expenseAccountId}')
                        AND PO.ENTITY IS NOT NULL`;

for my vendor script (runs once per reduce/journal entry)

mrob

11/03/2023, 6:10 PM

And

Copy code

var suiteQL = `SELECT PO.TRANID, PO.ENTITY AS PO_ENTITY, SO.ENTITY AS SO_ENTITY
        FROM Transaction J
        JOIN TransactionLine JL ON J.ID = JL.TRANSACTION
        JOIN Transaction PO ON PO.TRANID = JL.CUSTCOL_AD_PO_NUM_JOURNALS
        JOIN Transaction SO on SO.ID = PO.CUSTBODY_RSM_SO_REFERENCE
        WHERE J.TYPE = 'Journal'
          AND PO.TYPE = 'PurchOrd'
          AND SO.TYPE = 'SalesOrd'
          AND J.ID IN ('${journalRecord.id}')
          AND JL.EXPENSEACCOUNT IN ('${expenseAccountId}')
          AND PO.ENTITY IS NOT NULL`;

for my customer script which is running now fingers crossed

35 Views

Open in Slack

Previous Next