So we don't have an out of the box solution for executing pipelines in a web app at the moment. The reason for this is because the side effect behaviors you would like from a batch prediction pipeline are dramatically different from a real time service that passes an input through a prediction execution graph so we would need to build a solution that is mindful of these differences. Assuming that the data available during real time prediction is exactly the same as the data you trained on (which is often times not true because ETL does a lot of stuff to data which raw production systems that your app will be talking to haven't), you would be producing different side effects and you would also have different requirements around things like alerting, model prediction (a lot of runtimes e.g. sklearn BaseEstimator predictions are optimized for batch inputs but not for transactions), and most importantly model artifacts. The last one is particularily interesting because often times the model you train is dependent on the runtime it will be used in e.g. tweaking n_jobs when doing RandomizedCV with any sklearn model. Given all these complexities, we haven't really bit off the real time serving problem, but would love to see contributions that tackle it!