Introduction
This article aims to present a framework for porting source code from one language or platform to another, using XML as the intermediate format in the conversion process.
Background
The concept of this article came to my mind last weekend or so while I was thinking about some programming and design related things. I don't know about you, the reader, but I have ported some amount of code from one language to another for work and personal projects. In most cases, I either ported the code due to design/usability issues (something lacking in source language/platform) or to support a larger target audience (who use a particular language or platform, usually due to convenience or their skill set).
Porting code, depending on what you are porting it to, can be trivial or complex. But even in the trivial case, it can still be tedious and time consuming to do. And porting to more than one target language or platform will multiply that factor some more. This can be alleviated with some form of automation or tools that handle this.
I don't know if there is any good solution for this out there, but this article describes a possible solution. Due to the complexity of this solution, no code or real examples are presented. This article only describes a theoretical solution that is yet to be implemented.
The framework for porting code
The process of porting source code consists of two steps:
- Port source code (of the source language/platform) to the defined XML schema.
- Translate the intermediate XML to the target language/platform.
Therefore, the framework consists of three parts:
- Source language/platform code parser that converts the code to XML.
- An XML schema that defines things like general programming language constructs such as loops, branches, variables, etc.
- An XML parser that converts the XML generated by step 1 to the target language/platform.
The XML schema would be like a subset of XML, like SVG, XSL, etc., something yet to be defined (and maybe even standardized).
Ideally, to make the conversion effort simple and standardized for users/developers, the following are suggested when using this framework:
- Parser for the source language/platform should be written in the source language or built on the source platform. This will make it easier for the user to tweak the parser, when needed.
- Parser for the target language/platform should be written in XSLT. Since the intermediate format is XML, XSLT provides the easiest language and platform agnostic way of parsing it. Additionally, with an XSL stylesheet template, you can easily derive customized stylesheets for each target language/platform by replacing the placeholder sections with target code and adding/removing some XSLT code snippets as needed.
Ideally, if this framework did exist (with defined XML schema and pre-built parsers), I would like it to cover much of the general programming and scripting languages out there like C, C++, C#, Java, JavaScript, Perl, PHP, Python, VBScript, Visual Basic, Visual Basic .NET, etc.
Though I am thinking of using this framework mostly to port between different languages, I believe it is also possible to extend the framework to support porting between platforms like Windows and Linux/Unix as well as porting from x86 to embedded platforms, etc.
Disclaimer: No framework is perfect. I would not expect this framework to provide flawless conversion between languages or platforms, but it would at least simplify the porting of 50-75 percent of the code since most code is usually of the general programming construct type as mentioned earlier in the article. The parts that aren't well ported can then be custom ported as needed.
Theoretical example of framework in action
Assume you have the following code snippet in VBScript:
For i = 1 To 100
WScript.Echo "Code ran " & i & " times."
Next
Run the code through the source parser and you should get some XML like the following:
<loop type="for">
<condition><var id="i"/><compare type="lessthan"/>100</condition>
<output>Code ran <var id="i"/> times.</output>
</loop>
Run the code through the target parser (for C++) and you should get something like the following:
for(int i = 1; i < 100; i++)
{
cout << "Code ran " << i << " times." << endLine;
}
Disclaimer: This isn't a perfect example, code-wise, but is given just to illustrate the idea. The XML in the example probably is not the optimal schema, I just came up with it real quick. A real schema would take time to design and fine-tune.
Intentions of the framework
The framework is intended to be a platform and language independent solution that only relies on XML and XSLT. The language/platform choice for the source code parser and processing the XML with XSLT would be up to the user.
Conclusion
That is all, any questions or comments? I would really love to work on such a framework along with the code parsers, but I don't have the time for it right now. So, I thought I would post my idea out for others to see and try.
If someone does pick up work on such a framework, please inform me as I would like to assist with such a project.
If such a framework is already in progress or exists, again, please inform me. I may delete this article, or this article might just provide another alternative implementation to such a framework.
History
- July 14th 2007 - Initial post of the article.