If no budget and you are mostly a software guy and are doing this because if might be useful if you can get the performance you are looking for, then consider an AWS Cloud FPGA... Amazon EC2 F1 Instances[^] No messing with drivers installing tools and and all the other stuff that you have to do before you can do what you want to do. Nicely documented with low slope paths for people coming from a software background. Probably the very fastest and cheapest way to get your algorithm actually running on an FPGA. And if it does turn out that your new FPGA is useful, you can make an image deploy it on effectively an infinite number of hardware units with a click of the mouse.
-josh