PCD switch statements

jdean · Joined: 28 Jun 2018 Posts: 10

Greetings all!

I'm looking at using a large switch statement for a binary parser using the PCD 5.081 compiler. After reading about some of the tricks to get the PCH compiler to generate jump tables for switch statements here on the forum, I was hopeful I'd have similar luck with PCD.

I've run tests trying different types in the switch statement (8/16bit, signed/unsigned), omitting the default case, different numbers of cases (12, and 500), and tried substituting breaks for the return statements I was initially using.

Every single switch statement I've seen generated uses the same XOR ###, BZ ### approach for every case. Fine for a small number of cases, but I'm hoping for something better suited to a large set of cases.

Has anyone been able to get any other behavior out of switch/case on PCD?

Any observations or thoughts are appreciated

jdean · Joined: 28 Jun 2018 Posts: 10

*** note, the case values in the previous post are sequential enum values, placed in order. Presently both the enum and switch statement are generated by a separate tool from a specification.

Alright! So, proceeding under the assumption that this is the only output PCD will generate for switch statements, I'm now trying to generate a table-based switch construct.

Since the full set of cases are known by the generator, the idea is to produce a define which will goto a label corresponding to the enum.

Easy right? Well... it appears that relative branches aren't supported by the #asm directive. Absolute branches compile fine, but a relative branch in the form of "BRA W4" produces "Expecting an opcode mnemonic"

Surprising, since this particular form is both supported by my chip, and in the CCSC documentation under the "#asm" topic.

Am I on my own?

Ttelmah · Joined: 11 Mar 2010 Posts: 19962

Remember there is nothing to stop you having an array of functions.

Call the one specified by an index. Instant call table. Call the functions something like case10 etc., and it can be constructed very quickly probably with a macro.

Talk to CCS about PCD not using a table. I can understand them not switching to this, for a lot more cases than in PCH/PCM, since the PIC24/30/33 are much more efficient at handling the test and branch, but they should still switch to the table form at a larger number of cases.

If you study the branch they use, they don't individually test every case. The tests are done in 'segments', and so only a limited number of tests will be used for any value. The actual speed difference to a table will be very small indeed.
Attached a crude demo of a 'case' table:

jdean · Joined: 28 Jun 2018 Posts: 10

Ttelmah: It looks like the array of function pointers is probably going to win out here.

I've resumed testing the switch statements as I've never seen the compiler produce either a jump table, or even a segmented approach to identifying a match. Additionally, CCS support has assured me that the compiler will choose a jump table when cases are "relatively consecutive" and large enough in number. However, with enough cases, then overall function size becomes a limit as the compile will fail once the function exceeds a segment which appears to be 64K (bytes or words, haven't confirmed which)

Independent functions should avoid that specific limitation. Many thanks for the suggestion.

Finishing my avenue of switch investigation, I've now tested up to 2964 cases, which was the most my particular test case could fit into the 64K segment. What's interesting is the compiler stopped using XOR/BRA Z once 1025 cases were hit. Above that point, it switched to CP/BRA Z, which is actually worse. The first 32 cases are CP/BRA for 2 instructions each, and remaining cases get MOV/CP/BRA for 3 each (CP only allows up to 0x1F for the integer literal) In all cases, BRA Z is used to test for equality, no range checks appear to be generated.

For both XOR and CP, there is exactly one test per case, and all branch targets are the case statements. Segmenting the tests should require additional tests, which is fine, but I'm simply not seeing any generated.

I've also tested this behavior across 5.081, 5.078 and 5.065 and the results are identical. At this point, I'm guessing maybe this is due to the specific chip family, dsPIC33E.

In case it is of help to anyone, I did have some success implementing a workaround. Though with the function size limit, and issues due to differences between dsPIC33E and dsPIC33F series, I'll be abandoning it for now. My workaround wound up looking like this:

Ttelmah · Joined: 11 Mar 2010 Posts: 19962

Only 'caveat' with this is your PARSER_BRANCH is a define, so not actually called. Where will the returns take you?.

jdean · Joined: 28 Jun 2018 Posts: 10

To the site which called 'validate_symargs'

The define was created as a way to reuse the construct in different functions. Since these are goto/label pairs, they are scoped to the function where the PARSER_BRANCH define was applied.

I'm playing with the func ptr array you suggested, is PSV the only way to make that array const? I tried enabling it and went from 20% ram used to "Not enough RAM for all variables", and that was with a single entry in the array.

*edit* looks like the error about ram for all variables is due to other const & rom arrays that exist in the project. I came across another post of yours discussing PSV which explained that fairly well. I'm currently looking at using a const array without PSV, which is giving me errors, but I'm hopeful it's just my type signature. Will report back if I get stuck.

Ttelmah · Joined: 11 Mar 2010 Posts: 19962

PSV should not reduce your RAM.
It allows a ROM window to be created into the RAM address space. It doesn't use any RAM itself.
What is your chip?.
You need to enable it right at the start (immediately after the processor include). Adding it later can cause issues.
It costs a little ROM, since every ROM construct then uses an instruction word for every two bytes.
It does though allow you to use const data in places requiring pointers.

jdean · Joined: 28 Jun 2018 Posts: 10

right, the chip is a dsPIC33EP512MU810, my bad

I haven't dug too far into why enabling PSV gave me issues, but I was able to use a const array by first copying the function pointer into ram. I'm happy enough with that, so I'm off to the races.

PSV looks like a very interesting feature, and while I likely won't dig into it right away, I'm sure I'll get around to it. Thanks for bringing it to my attention.

Ttelmah · Joined: 11 Mar 2010 Posts: 19962

It may be a compiler issue, which depends on the exact 'nature' of what you have declared with rom and const.
I had some significant problems earlier this year, and after a lot of investigation CCS fixed these. 5.081, should have the fixes, but it's possible that other data patterns may still be giving problems. CCS said to me only a few days agao: