Hooks & Seeds
the holy grail
of data management
----------
Throughout the last few years of coding research I've gained a lot of useful skills and even dived fairly deep into mathematics, involving algebra and trigonometry algorithms, and customising formulas to coordinate with the lesser simple text-processing and translation. However, there came a time where none of this is any match (quite literally) for attacking complex code and arbitrary values and variables nested inside subroutines within subroutines within subroutines, for pages and pages and pages, or randomly scattered throughout punctuation-heavy scripts making it like some junkyard pinball torture dungeon, especially those constructed from front-end designer software like MS-Office, and not coded by hand. It's easy when the original data is organised in a logical and predictable way, but that just isn't reality, even when it's you who has authored it to the best of your abilities.
A more robust overall solution is necessary - Hooks & Seeds.
This technique uses a regular expression to capture any arbitrary data matched between two definitions, usually a "key" and then the data's following delimiter such as a semicolon. It then replaces it with a "hook", which is a unique identifier, an iterated number generated by a for-loop, recursively completely throughout the file. At the same time, these "hooks" are likewise generating a list of "seeds", the desired values or variables, with the index of the file matching the number assigned as the "hook". Once finished, it will also generate a "barrow" file, which is a separate utility used to later recombine them, refactor and clean the file, with the "seeds" (after you're done with them) matching back with every "hook" it belongs to. The file will become as it was originally with the same functionality after using the "barrow" and the list of "seeds" ready for anything. I can admit it's the most ugly monstrous code I've ever written, or even seen, but it does work as evidenced by a test-run of both the Quran and Bible combined into a single input and processed by it. As useful as these books are ever going to be for me. Not very "use case" but inarguably certainly impressively competent.
The reason goes just beyond dealing with chunky complex data in a straight-forward way, but turns the exact values and variables into a contiguous dataset called a "datastream", which suddenly opens everything up to some amazing potential. One example is to turn every "color" value from a CSS file into a PPM image, allowing an full CSS file to be edited as an image in its entirety, from CSS to PPM and back again. Even if it was collapsed and jumbles the formatting of the values. Entire thorough colour themes for entire websites can be achieved in mere minutes, something worthy of a business all of its own. Another example is to add noise to derived values, such as "nudging" colours, positions or rotations to give natural subtle variance to otherwise overly-neat and clinical arrangements and visuals. Thousands of lights and millions of vertices can be randomly colourised procedurally. Another example is to use audio volume data as an animation influenced as to be automatically synchronised, such as a lamp flickering to matching audio of faulty electronics, perfectly timed. Another example even involves using smartphone gyroscopic and accelerometer metrics to conduct things like animations, particles and special effects. The limitation is imagination, as once something is an indexed datastream, it can be influenced, manipulated, translated, calculated and outright combobulated, whether mathematically, against another datastream or as itself its own other datastream, artistically. Once the hook numbers are in as assigned by the indices in the seed file, it too can be reordered in any conceivable way, since those hooks can be placed before or after the seeds, and flipped around at any time and even temporarily separated if necessary.
This might all sound like reinventing the wheel, because a hundred different pieces of software can (might, or don't) do these hundred different things. If Excel comes to mind, it ought to. There are some important things to keep in mind, because this is data management, not data processing, ala Excel. This is all command-line, cross-compatible and mostly native Linux, all legacy system requirements and all freeware. This particularly is a little AWK and mostly SED, performing like a drunk clown juggling knives on a unicycle, convoluted but tremendously flexible. It is cascading code, portable scripts which in themselves can construct utilities of their own using data management, such is the case with how it creates a custom barrow-file from within itself. It also keeps all of this stuff "art-centric", controlled by artists, residing in the base and ported to engines from the backend, by artists. It's part of the IDE and can be deployed from the context menu in Windows Explorer to working folders, and perhaps most importantly, instantly and easily deployed to a cloud service in a plug and play manner. I don't think there is any upper limit, since it uses memory very piecemeal with a tradeoff on thrashing the storage to shreds, which makes it a little slower than it could be, but also allows astronomical amounts of "big data" to be handled. The nature of these old Linux tools being run on modern PC hardware more than makes up for any sluggishness, and the cloud is the perfect place to thrash the drives on "someone else's computer" if you don't want to sacrifice a dedicated drive of your own to an early grave. A small price to pay for the avalanche of benefits from doing it all this way. After all, a candle that burns twice as bright also only burns half as long, and a sacrifice of dedicated SSDs is likely inevitable for snappy local results.
Some editing of hexadecimal as binary data was necessary, which turned out a blessing as it was both much easier than I'd anticipated, and exactly concerns my next logical step from here which involves automating materials in a 3D Studio MAX file, a binary proprietary and non-portable format, by using the same for-loop iterations and incrementing material IDs, which can then be used to automatically assign them to matching IDs of geometry from Radiant, another "holy grail" scenario, but for mappers, since it will drastically minimise the barriers to having renders in Mental Ray straight from Radiant, with materials derived from scripts used by the engine itself and all automatically taken care of. In some ways, it's a lot like these seeds and hooks but, again, uses a completely art-centric approach that is both portable and mostly automated. All things considered, file format restraints between software has become all but meaningless anymore, even complex compiled binary data once thought untouchable.
The final interesting thing about all of this and quite incidental, is not only the procedural things that can be done, but also actually making use of artificial intelligence. I thought it was awesome enough already all by itself, but I realised something. While everyone else seems to be sitting on their thumbs waiting for it to just do something for them, of any use, I was (for hopefully the last time ever) in total mad-scientist mode and not paying much attention anyway. If it will cut the lunch of any chump, that doesn't seem to be me, instead of waiting for miracles or frozen in fear, I've simply kept busy, gotten the hardest parts out of the way first, to allow the rest to be easy from here. It didn't really even occur to me that this universe I've existed in, the datasets, datastreams, algorithms, procedural-generation and self-modifying self-generating code - holy cow, it's just pure destiny for any API that is actually going to do anything worth a damn. That may take some years yet, because it's a overhyped stink-ass puppetshow at the moment, but there are interesting crumbs some people are doing with it so far, here and there, and once all farts dissipate and dust settles, then all of this evergreen future-proof backend groundwork I've now laid, that which has almost killed me, will have arms wide open, loving and ready to embrace whatever actual genuine substance the future may bring.
...
(without the labels and comments,
just looks like a sack of grenades
exploded on top of my keyboard)
.
.
.