Algorithms

ky_rerun24-Feb-09 15:16

ky_rerun

24-Feb-09 15:16

wouldn't static code analyzation be a better choice by simply parsing all of the includes assuming its a c or c++ program.

or another option to have you project just output the result of the preprocessor in visual studio it will tell you which files it included.

a programmer traped in a thugs body

Adam Dare24-Feb-09 18:35

Adam Dare

24-Feb-09 18:35

Hmm, I'm not sure how hard it would be to create something that would properly process it statically. There is some thinging here that needs to be done around tracking geneology, but once I figure that out I would thing this shouldn't be too hard, but then again I always get blindsided by some of the small details I didn't see when I was thinking about a problem at a higher level. Smile | :)

Although I don't think all of the files that need to be processed are C/C++ files, there are resource files as other types of files.

And these projects aren't built in VS, they're built in our own build system base off of nmake.

Adam

ky_rerun24-Feb-09 20:24

ky_rerun

24-Feb-09 20:24

Since you are using nmake I'm going to assume you are using gcc while this won't give your resource files it will let you see your source dependencies. How are you using the resource files are they linked into the some type of executable shouldn't you be able to see what you linking in in your make file.

this is from the gcc online manual

-

M
    Instead of outputting the result of preprocessing, output a rule suitable for make describing the dependencies of the main source file. The preprocessor outputs one make rule containing the object file name for that source file, a colon, and the names of all the included files, including those coming from -include or -imacros command line options.

    Unless specified explicitly (with -MT or -MQ), the object file name consists of the basename of the source file with any suffix replaced with object file suffix. If there are many included files then the rule is split into several lines using \-newline. The rule has no commands.

    This option does not suppress the preprocessor's debug output, such as -dM. To avoid mixing such debug output with the dependency rules you should explicitly specify the dependency output file with -MF, or use an environment variable like DEPENDENCIES_OUTPUT (see Environment Variables). Debug output will still be sent to the regular output stream as normal.

Passing -M to the driver implies -E, and suppresses warnings with an implicit -w.

a programmer traped in a thugs body

Adam Dare25-Feb-09 4:13

Adam Dare

25-Feb-09 4:13

Actually we build using the VC compiler, just not inside VS or a VS project. That system doesn't scale well for our needs. While this is an interesting approach I'm still favoring the monitoring of the file activity external to the compiler for serveral reasons:

1. We have C/C++ and C# projects in our tree, and possibly other languages as well that I haven't had to deal with yet.
2. We deal with a number of different building tools that may not have output options like the more mature C/C++ compilers do.
3. Since I have a number of differnt build tools I need to track dealing with each one separately and maintaining the output processing code for handling each tools output sounds like a big task and fragile

Taking a more build tool agnostic approach to gathering this information I feel will make the tool more reliable and take less maintenance work. I won't be trying to track down all of the differnt build tools we use and the necessary command line options to gather the information. And then have to figure out how to process each tools output to gather the information I need.

Adam

error correction

Deresen12-Feb-09 2:02

12-Feb-09 2:02

I need some error correction for an integer of 32 bits.
I've found several solutions, like 'reed solomon', 'FEC', 'parity bit'. But all of those are for a lot of bits.

I just need error correction for 32 bits and the error correction should be maximum 24 bits.

Does any of you have any idea of how to clear this problem and which of the error corrections I should use (or even create one myself)?

Ravadre12-Feb-09 4:44

Ravadre

12-Feb-09 4:44

Hello,

http://en.wikipedia.org/wiki/Hamming_code - that one is pretty easy, scalable (you decide of length of data chunk, longer chunk - lesser bits will be taken for correction purposes).

Why parity bit is for lot of bits? You can use 1 parity bit for 2 bits of data, for 20 bits, for 200... (of course it will be less effective).

Btw - Hamming codes can correct error bits, not only check if data is not corrupted.

Deresen12-Feb-09 5:07

12-Feb-09 5:07

I see, hamming_code would be great indeed (thank you). But it's only possible for single error correction. In the best case I would like to correct every bit, but that's impossible, but I would like to have the highest efficiency.
I'm now busy with trying parity, horizontal + vertical + diagonal. And then check if the parity's are ok.

But is there another way than the hamming? Because I think this one is not really efficient.

Ravadre12-Feb-09 6:45

Ravadre

12-Feb-09 6:45

Hello,

Actually, I think that Hamming is very efficient for what it offers (especially for longer streams, you add 1 bit for every *2 bits of data), if you need more bits corrected - use shorter version etc.

If I have understand you well and by

Deresen wrote:
trying parity, horizontal + vertical + diagonal

you mean putting input into matrix and adding parity bits to every row/column, then you won't be able to fix any bits if you will have 2 errors (this is sometimes true, sometimes not, but still, you can't guarantee correcting 2 bits) and the cost of parity bits is much higher then using hamming.

You can also google for Convolutional code, but I don't know much about them, so I can't guarantee it is what you have been lookng for.

Arash Partow12-Feb-09 12:04

Arash Partow

12-Feb-09 12:04

whats wrong with Reed-Solomon?
If I understand your requirements, you need an RS(4,1) code over GF(2^8) or RS(8,2) over GF(2^4), and because the codeword size is small you can use Euclid's algorithm to efficiently compute the key equation rather than Berlekamp-Massey (modified).

check this link out:
http://en.wikipedia.org/wiki/Reed%E2%80%93Solomon_error_correction[^]

Deresen12-Feb-09 21:04

12-Feb-09 21:04

To be honest, I did not really understand the reed solomon error correction.
And I've read this '2 byte errors per 32-byte block', this also means 2 bits per 32 bits. And that's to less for me.

The big problem is that I also have to check the error correction, if that is right. So I have to correct that stream also. Is this a possibility with reed solomon? And could you please give a small example of how the reed solomon works, for instance with a byte?

Arash Partow13-Feb-09 1:14

Arash Partow

13-Feb-09 1:14

Reed-Solomon does provide the ability to detect errors and also correct them, RS is actually quite trivial to understand and implement, RS operates off-of a basis type which is called a symbol. A symbol can be any size from 2-bits and up.

My suggestions were based on two symbol sizes, either an 8-bit or 4-bit symbol size. The 4-bit symbol size would be recommended in your case, as it means 8 symbols per your data block (32-bits). The proposed code of RS(8,2) over GF(2^4) means any two of the 8 symbols can have any number of bit errors (1-4 bits of error/burst error), from this upto both symbols of error can be accurately DETECTED and CORRECTED.

Further more if you happen to know which bits are in error you can increase the correction capabilities 2 fold via erasure correction methods.

If you're familiar with C++ the following library has RS code examples for various bit sizes: http://www.schifra.com/downloads.html[^]

Deresen13-Feb-09 1:54

13-Feb-09 1:54

Thank you very much for your time.
This will give me enough homework for a while.

Let's dive into C++ again.

supercat912-Feb-09 12:46

supercat9

12-Feb-09 12:46

What is the expected nature of errors? Is there a significant likelihood of having multiple independent bit errors, or would bit errors more likely be concentrated?

Correcting multiple independent bit errors is very hard; it is clearly impossible to do two-bit correction on 32 bits of data with less than 11 bits of ECC data (using 42 bits to code a 32-bit word, allowing for zero, one, or two errors, there would be 1,765 ways each code word could be represented; a 10-bit ECC would only allow for 1,024).

On the other hand, if all bit errors will be localized in some particular fashion, error detection and correction becomes much simpler. For example, if all errors will be within a single byte, one can store a parity byte along with enough parity information to detect errors in each byte. If a problem is indicated with any byte, the parity byte will allow it to be corrected.

Strings issue...

ventomito10-Feb-09 8:25

ventomito

10-Feb-09 8:25

Hi everyone!
I'm working on a project and i need to do what follows:
i have an array of strings,example:

house3
house2
green23
green.5
H6
H01
G19
G78
..and so on.

I have to extract only the recurrent strings: house,green,H,G.
Any idea about it? (the language is java but doesn't matter, i just need the idea!)
Thanks in advance,
Enrico.

Program your life ^^

cmk10-Feb-09 12:34

cmk

10-Feb-09 12:34

The following should give you some ideas:
http://en.wikipedia.org/wiki/Patricia_tree[^]

...cmk

The idea that I can be presented with a problem, set out to logically solve it with the tools at hand, and wind up with a program that could not be legally used because someone else followed the same logical steps some years ago and filed for a patent on it is horrifying.
- John Carmack

Mohammad Dayyan11-Feb-09 0:30

Mohammad Dayyan

11-Feb-09 0:30

What are you gonna do exactly ?
Do you want to identify the strings that contain house,green,H,G ?

ventomito11-Feb-09 4:39

ventomito

11-Feb-09 4:39

not exactly. I have some strings and i have no idea which they are.
I have to identify the recurrent substrings such as house, green...but i don't have them as input..

Program your life ^^