Please note that FlexCDC still needs to have source and destination instances defined in the configuration, even when using plugins. This is because the FlexCDC state (how much into which binary log has FlexCDC progressed, and what tables are being captured) is stored in the "dest". Normally when using a plugin, the source and destination instances will be the same. FlexCDC will create a 'flexviews' database with a number of state tables in the destination instance. This also means you still have to use the create_mvlog.php or Flexview's create_mvlog(...) procedure to mark which tables to capture! See the previous blog post about FlexCDC.
When you create the mvlog, there will still be a changelog table created in the dest, just like when not using a plugin. This is because the INFORMATION_SCHEMA is used to get column datatypes and additional information (such as if an int is signed or unsigned) and this lookup is done against the table in the dest. The reason this is needed, is because mysqlbinlog, the utility used to scrape the binlog, produces strange output for large signed integers (it provides the signed and unsigned version), thus FlexCDC must figure out the right one to choose from the actual DDL of the changelog table. FlexCDC can't look at the DDL of the source table though, because the consumer may be behind, and the current structure may not match the structure of the rows in the log.
The new plugin system allows you to do a lot of nifty things like:
The lastest version of Swanhart-Tools includes an Example plugin (in flexcdc/include/example_plugin.php) that simply prints the events that come through it, not logging them into the changelog table at all. There is an example of the output at the end of the post.
The example plugin looks like this:
<?php class FlexCDC_Plugin { static function begin_trx($uow_id, $gsn) { echo "START TRANSACTION: trx_id, $uow_id, Prev GSN: $gsn"; } static function commit_trx($uow_id, $gsn) { echo "COMMIT: trx_id, $uow_id, Last GSN: $gsn"; } static function rollback_trx($uow_id) { echo "ROLLBACK: trx_id, $uow_idn"; } static function insert($row, $db, $table, $trx_id, $gsn) { echo "TRX_ID: $trx_id, Schema:$db, Table: $table, DML: INSERT, AT: $gsn"; print_r($row); } static function delete($row, $db, $table, $trx_id, $gsn) { echo "TRX_ID: $trx_id, Schema:$db, Table: $table, DML: DELETE, AT: $gsn"; print_r($row); } static function update_before($row, $db, $table, $trx_id, $gsn) { echo "TRX_ID: $trx_id, Schema:$db, Table: $table, DML: UPDATE (OLD), AT: $gsn"; print_r($row); } static function update_after($row, $db, $table, $trx_id, $gsn) { echo "TRX_ID: $trx_id, Schema:$db, Table: $table, DML: UPDATE (NEW), AT: $gsn"; print_r($row); } }Important Note: You must define all seven of these functions in your plugin, even if you do not plan to have actions for each of the change types - just leave the function body empty to do no action (the call is simply a noop that case.) Note that the plugin functions must be declared STATIC. This is due to the way that FlexCDC calls the functions.
Transaction state callbacks There are three callback functions which notify the plugin of changes in transaction state. Before I go into what they do, I want to note the $trx_id and $gsn parameters which are present in every callback. Each transaction is assigned a monotonically increasing transaction identifier. This sequence uniquely identifies each transaction that FlexCDC processes. In addition, each row change is assigned a unique sequence number which FlexCDC calls the Generic Sequence Number (or GSN).
As you can see, the start_trx(...) callback (called when a transaction starts) is passed both the new transaction number and also the highest GSN used in the previous transaction. This is called the GSN high water mark (GSNHWM). At transaction commit, the commit_trx(...) callback is called and the transaction id and the last GSN assigned in the transaction are passed into the callback. This same value will appear as the GSNHWM in the next start_trx(...) callback. Finally, at rollback any sequence numbers assigned in that transaction will be re-used, so no GSN is passed to the rollback callback, but a transaction id is, which lets you determine exactly which transaction is rolling back.
Row change callbacks
Each of the four row change callback functions capture a particular change to the data set. Each of the functions take five parameters. The first ($row) is an array which contains the row being acted upon. The second ($db) is the schema which contains the row. The third ($table) is the table that contains the row. Each callback also receives the transaction identifier, and of course, each row change is assigned a unique GSN.
For example:
An update will fire both update_before(...) and update_after(...) callbacks with the row images before and after the change, respectively. There is an example of this at the end of the post.
Configuring FlexCDC to use a plugin
plugin=plugin_file.php
The plugin must be in the FlexCDC include/ directory. You will find example_plugin.php in this directory, to serve as an example.
How it works
Here is the output from the example plugin, for an update that affected 3 rows (update test.t3 set c1 = c1 - 1):
START TRANSACTION: trx_id, 36, Prev GSN: 77 TRX_ID: 36, Schema:test, Table: t3, DML: UPDATE (OLD), AT: 78 Array ( [0] => 2 [1] => 1 ) TRX_ID: 36, Schema:test, Table: t3, DML: UPDATE (NEW), AT: 79 Array ( [0] => 1 [1] => 1 ) TRX_ID: 36, Schema:test, Table: t3, DML: UPDATE (OLD), AT: 80 Array ( [0] => 0 [1] => 2 ) TRX_ID: 36, Schema:test, Table: t3, DML: UPDATE (NEW), AT: 81 Array ( [0] => -1 [1] => 2 ) TRX_ID: 36, Schema:test, Table: t3, DML: UPDATE (OLD), AT: 82 Array ( [0] => 0 [1] => 2 ) TRX_ID: 36, Schema:test, Table: t3, DML: UPDATE (NEW), AT: 83 Array ( [0] => -1 [1] => 2 ) COMMIT: trx_id, 36, Last GSN: 83