Minor:
- Add JOSS paper of version 0.12.5
- Update Julia reference DiffEqJump -> JumpProcesses (name change)
- Unfix jinja2
- Update flake8 dependencies after new issues
Minor:
- Document outdated Google Colab version (Python 3.7)
Minor:
- Move near-zero population weight to warning (#563)
- Improve test installation instructions; fix for updates of flake8-print and black (#567)
- Document custom priors (#559)
- Update citation hints, add logo license (#554)
- Fix double logging in grid search
Major changes compared to 0.11.0:
New features:
- Add Silk acceptance rate curve based optimal threshold scheme (#539)
- Interface Copasi model simulators via BasiCO (thanks to Frank Bergmann) (#531)
- Interface simulators in the Julia language via pyjulia (#514)
- Add Wasserstein and Sliced Wasserstein optimal transport distances (#500)
- Finalize sensitivity weighted distance functions using inverse regression models and augmented regression targets (#478, 484)
Technical changes:
- Support python>=3.8 (#543)
Internals:
- Automatic code formatting via black, isort, and nbqa (#506, #508, #544)
Random:
- The logo is pink (#549)
- Add Silk acceptance rate curve based optimal threshold scheme (#539)
- Fix Julia version in tests (#542)
- Apply nbqa black and isort to notebooks (#544)
- Update to stable tox
- Require python>=3.8 (#543)
- Implement color grouping by summary statistics and parameters in sankey plots (#536)
- Add BasiCO-Copasi model interface (thanks to Frank Bergmann) (#531)
- CI: Run Amici without sensitivity calculation (#533)
- Allow pickling of AMICI models (#527)
- Remove external handler cleanup as not used (#529)
- Allow timeout of external handlers (#530)
- Interface Julia simulators via pyjulia (#514)
- Refactor PCA distance, add tests (#518)
- Remove pyarrow as hard dependency for pandas storage (#523)
- Hierarchically structure examples, update "Parameter Inference" introduction (#524)
- Add minimum epsilon difference stopping condition (#525)
- Decompose ABCSMC.run for easier outer loop (#510)
- Unfix sphinx version for documentation (#509)
- Streamline function wrapper objects (#511)
- Remove rpy2 warning upon import of pyabc.external (#512)
- Move ot distance to scipy due to bug in pot 0.8.0 (#512)
- Regularly scheduled CI (#504)
- Fix Dask for Windows (#503)
- Apply the uncompromising Python code formatter black (#506)
- Apply automatic import sorting via isort (#508)
- Implement Wasserstein and Sliced Wasserstein distances (#500)
- Add env variable to bound population size in tests (#501)
- Update to amici 0.11.19 for scaled residual support (#491)
- Add links for online execution of notebooks on Google Colab and nbviewer (#492)
- Tests: Fix early stopping test for first generation (#494)
- Remove codacy due to excessive permission requests
- Tidy up example titles
Summary statistics:
- Allow transformed parameters as regression targets via ParTrafo (#478)
- Add Sankey flow plot (#484)
- Add "informative" notebook to document regression-based summary statistics and weights (#484)
Sampler:
- Speed up redis done-list checking by atomic operations (#482)
Diverse:
- Shorten date-time log (#456)
- Add look-ahead example notebook (#461)
- Fix decoration of plot_acceptance_rates_trajectory (#465)
- Hot-fix redis clean-up (#475)
Breaking changes:
- API of the (Adaptive)PNormDistance was altered substantially to allow cutom definition of update indices.
- Internal weighting of samples (should not affect users).
Semi-automatic summary statistics:
- Implement (Adaptive)PNormDistance with the ability to learn summary statistics from simulations.
- Add sumstat submodule for generic mappings (id, trafos), and especially a PredictorSumstat summary statistic that can make use of Predictor objects.
- Add subsetting routines that allow restricting predictor model training samples.
- Add predictor submodule with generic Predictor class and concrete implementations including linear regression, Lasso, Gaussian Process, Neural Network.
- Add InfoWeightedPNormDistance that allows using predictor models to weight data not only by scale, but also by information content.
Outlier-robust adaptive distances:
- Update documentation towards robust distances.
- Add section in the corresponding notebook.
- Implement PCMAD outlier correction scheme.
Changes to internal sample weighting:
- Do not normalize weights of in-memory particles by model; this allows to more easily use the sampling weights and the list of particles for adaptive components (e.g. distance functions)
- Normalization of population to 1 is applied on sample level in the sampler wrapper function
- In the database, normalization is still by sample to not break old db support; would be nicer to also there only normalize by total sum -- requires a db update though.
Changes to internal object instruction from samples:
- Pass sample instead of weighted_sum_stats to distance function. This is because thus the distance can choose on its own what it wants -- all or only accepted particles; distances; weights; parameters; summary statistics.
Visualization:
- Function to plot adaptive distance weights from log file.
- Allow color customization for plot_credible_intervals plots (#414)
- pyABC logo to grey to fit with both black and white backgrounds (#453)
- Add style set to global figure parameters, enabling dark mode (#454)
Sampler:
- Allow redis dynamical sampler to only wait for relevant particles after a generatio, giving a speed-up without drawbacks (#448)
- Add option to limit number of delayed look-ahead samples to limit memory usage (#428)
Logging:
- Standardize output of floats (#450)
- Use hierarchical logging (ABC.Submodule) (#417)
General:
- Refactor: Remove deprecated nr_samples_per_parameter, internal simplifications (#422)
- Tidy up and minimize dependencies (#436, #441)
- External: Remove simulation files after collecting results (#434)
- Make feather/pyarrow dependency optional for older hardware (#442)
Documentation:
- Add description of JupyterHub to documentation (#439)
CI:
- Test webserver basic functionality
- Rerun stochastically failing tests (all #436)
- Test whether dataframe storage routines work properly (#442)
General:
- Speed up parameter handling (30% internal speed-up) (#387)
- Streamline testing via tox and pre-commit hooks, add flake8 config file (#408)
- Update to python 3.9 (#411)
- Extract PEtab bounds and nominal parameters (#416)
- Allow specifying parameter names in density plots (#416)
- Normalize look-ahead sampling by subpopulation ESS (#418, #421)
Documentation:
- Update contribution and issue guidelines (#408)
- Add example on yaml2sbml usage (#416)
- Clean up user docs on parallelization and storage (#419)
Fixes:
- Fix redis for later started workers (#410)
- Fix PEtab parameter scale import, support all PEtab prior distributions (#413)
Database:
- Add database migration scheme based on alembic (#419)
- Store proposal ids, increment database version to 0 (#419)
- Update branch name master -> main in agreement with https://github.com/github/renaming (#406).
- Notebook improvements:
- Add ground truth to noise notebook.
- Rename notebook "quickstart" -> "model_selection".
- Split notebook tests in 2, extend, reduce test matrix (python 3.7).
- Improve output at the beginning and end (e.g. no end_time at beginning; show duration).
- Add walltime plots (plot_total_walltime, plot_walltime, plot_eps_walltime).
- Make sure ABCSMC.run() is always properly finished (sampler, history) by a wrapper (all #401).
- Redis sampler with look-ahead mode:
- Fix insufficient logging of look-ahead samples.
- Log all accepted particles.
- Add plot_lookahead_... plots for look-ahead mode diagnostics.
- Add global figure parameter settings for pyABC (all #405).
- Check components for their adaptivity for correct application of look-ahead mode (#397).
- Require pandas >= 1.2.0 for PEtab.
- Uniquely identify redis runs via a uuid.
- Secure proper termination of processes for redis and dask (all #338).
- Rework redis sampler, implement a look-ahead mode for pre-defined (#338) and adaptive algorithms (#374) for near-perfect parallel efficiency.
- Allow subprocessing in multiprocessed redis workers (#377).
- Add a static-scheduling redis sampler (#379).
- Check whether epsilon is nan before sampling (#382).
- Remove useless IdentityFakeDistance (#390).
- Add stopping criterion for total walltime (#370).
- Allow to normalize acceptance rate plots by ESS (#346).
- Add a list of pyABC references (#348).
- Update to petabtests 0.0.0a5 (#362).
- Add stopping criterion for total number of samples (#364).
- Remove dill dependency, thus fixing a cloudpickle error, and run selected tests also with python 3.7 (#367).
- Move progress usage to attribute level (#336).
- Add check for whether redis is up already (#337).
- Add a self-tuned finite-space discrete transition (#341).
- Refactor and modularize ABCSMC inference module (#333).
- Make fast random choice function robust across dimensions (#333).
- Remove bkcharts dependency (#328).
- Add optional progress bar to various samplers (#330).
- Refactor package metadata (#330).
- Refactor CI build, add code quality tests (#331).
- Add warning when many zero-prior samples are generated (#331).
- Refactor __all__ imports and docs API build (#312).
- Fix json export of aggregated adaptive distances (#316).
- Apply additional flake8 checks on code quality (#317).
- Assert model input is of type pyabc.Parameter (#318).
- Extend noise notebook to estimated noise parameters (#319).
- Implement optional pickling for multicore samplers; add MacOS pipeline tests (#320).
- Speed up multivariate normal multiple sampling (#299).
- Set default value for OMP_NUM_THREADS=1, stops warnings (#299).
- Base default number of parallel cores on PYABC_NUM_PROCS (#309).
- Update all notebooks to the latest numpy/scipy (#310).
- Update CI test system: latest Ubuntu, python 3.8, simplify R build (#296).
- Add weights logging to adaptive distances (#295).
- Migrate CI tests to GitHub Actions for speed-up, reliability and maintainability (#297, #298).
- Allow separate calibration population sizes, slightly reformulate PopulationStrategy class (#278).
- Allow specifying initial weights for adaptive distances, then without sampling from the prior (#279).
- Check PEtab test suite in tests (#281).
- Exact inference via stochastic acceptor finalized and tested (developed throughout the 0.9 series).
- Support basic PEtab functionality using AMICI ODE simulations (#268).
- Various error fixes (#265, #267).
- Log number of processes used by multiprocessing samplers (#263).
- Implement pyabc.acceptor.ScaledPDFNorm (#269).
- Implement list population size (#274, #276).
- On history loading, automatically find an id of a successful run (#273).
- Add optional check whether database is non-existent, to detect typos.
- Set lower bound in 1-dim KDEs to <= 0 to not wrongly display near-uniform distributions. (both #257)
- Implement redis password protection for sampler and manage routine (#256).
- Make samplers available in global namespace (#249).
- Implement ListTemperature (#248).
- Allow plotting the relative ESS (#245).
- Allow resampling of weighted particles (#244).
- Fix ABCSMC.load with rpy2 (#242).
- Add summary statistics callback plot function (#231).
- Add possibility to log employed norms in StochasticAcceptor (#231) and temperature proposals in Temperature (#232).
- Implement optional early stopping in the MulticoreEvalParallelSampler and the SingleCoreSampler, when a maximum simulation number is exceeded (default behavior untouched).
- Log stopping reason in ABCSMC.run (all #236).
- Implement Poisson (#237) and negative binomial (#239) stochastic kernels.
- Enable password protection for Redis sampler (#238).
- Fix scipy deprecations (#234, #241).
- In ABCSMC.run, allow a default infinite number of iterations, and log the ESS in each iteration.
- Reformulate exponential temperature decay, allowing for a fixed number of iterations or fixed ratios.
- Solve acceptance rate temperature match in log space for numeric stability.
- Perform temperation of likelihood ratio in log space for numeric stability (all #221).
- Fix wrong maximum density value in binomial kernel.
- Allow not fixing the final temperature to 1 (all #223).
- Allow passing id to history directly (#225).
- Pass additional arguments to Acceptor.update.
- Give optional min_rate argument to AcceptanceRateScheme (all #226).
- In plot functions, add parameter specifying the reference value color (#227).
- Fix extras_require directive.
- Fix error with histogram plot arguments.
- Extend test coverage for visualization (all #215).
- ABCSMC.{new,load,run} all return the history with set id for convenience.
- Document pickling paradigm of ABCSMC class (see doc/sampler.rst).
- Always use lazy evaluation in updates (all #216).
- Restructure run function of ABCSMC class (#216, #218).
- Run notebooks on travis only on pull requests (#217).
- Correct weighting in AcceptanceRateScheme (#219).
- Fix error that prevented using rpy2 based summary statistics with non rpy2 based models (#213).
- Introduce acceptor.StochasticAcceptor to encode the stochastic acceptance step generalizing the standard uniform criterion.
- Introduce distance.StochasticKernel to encode noise distributions, with several concrete implementations already.
- Introduce epsilon.Temperature to capture the temperature replacing the traditional epsilons. In addition, multiple concrete pyabc.epsilon.TemperatureSchemes have been implemented that handle the calculation of the next temperature value (all #197).
- Add high-level versions of the kde plotting routines (#204).
- Add unit tests for common epsilon schemes (#207).
- Move to cffi>=1.13.1 after that bug was surprisingly quickly fixed (#195).
- Create sub-module for epsilon (#189).
- Add plots for sample and acceptance rate trajectories (#193).
- Add create_sqlite_db_id convenience function to create database names.
- Temporarily require cffi=1.12.2 for rpy2 on travis (all #185).
- Introduce UniformAcceptor and SimpleFunctionAcceptor classes to streamline the traditional acceptance step.
- Add AcceptorResult and allow weights in the acceptance step (all #184).
- Use latest pypi rpy2 version on travis and rtd since now the relevant issues were addressed there (easier build, esp. for users).
- Update rtd build to version 2 (all #179).
- Render logo text for platform independence.
- Prevent stochastic transition test from failing that often.
- Remove deprecated pd.convert_objects call in web server.
- Allow pandas.Series as summary statistics, by conversion to pandas.DataFrame (all #180).
- Add AggregatedDistance function, and a basic self-tuned version AdaptiveAggregatedDistance.
- Add additional factors to PNormDistance and AggregatedDistance for flexibility. Minor API break: argument w renamed to weights.
- In the adaptive_distances and the aggregated_distances notebooks, add examples where some methods can fail.
- Add plot_total_sample_numbers plot (all #173).
- Some extensions of external simulators interface (#168).
- Add basic plots of summary statistics (#165).
- Document high-performance infrastructure usage (#159).
- Self-administrative: Add social preview (#158), and link to zenodo (#157).
- Fix external deprecations (#153).
- Re-add R related tests (#148).
- Update to rpy2 3.1.0 (major change) (#140).
- pandas data frames saved in database via pyarrow parquet, no longer msgpack (deprecated), with backward compatibility for old databases (#141).
- Redis workers no longer stop working when encountering model errors (#133).
- Minor edits, esp. color, size, axes options to plotting routines.
- Fix dependency updates (rpy2, sklearn) and travis build.
- Add option to limit number of particles for adaptive distance updates.
- Rename confidence -> credible intervals and plots (Bayesian context).
- Extract from database and plot reference parameter values.
- Allow to plot MAP value approximations in credible interval plots.
- Add a general interface to external scripts that allow using pyabc in a simple way in particular with other programing languages.
- Reorganize distance module (minor API change: distance_functions -> distance, and some classes shortened accordingly)
- Allow to pass parameters to Acceptor and Distance.
- Make time and parameter arguments to distance functions optional.
- Rewrite lazy evaluation for calibration sample in ABCSMC class.
- Give default values for ABCSMC.run arguments, which set no stopping criterion.
- Add function and plot for effective sample size.
- Run some notebooks as part of the tests.
- Automatize pypi upload via travis.
- Save number of samples taken in calibration step in database.
- Fix error with reported number of simulations in EpsMixin based samplers.
- Fix several warnings.
- Monitor code quality using codacy and codecov.
- Extend visualization routines: Add histogram, sample number, epsilon trajectory, model probability, and credible interval plots.
- Test visualization routines on travis.
- Fix problem with the History.get_weighted_distances function after update to sqlalchemy>=1.3.0.
- Add random walk based transition for discrete parameters.
- Tidy up returning of rejected samples in Sample (not only summary statistics).
- Recreate a population from file in History.get_population().
- Speed up loading from database by eager loading.
- Document the change of the contribution scheme to master+develop.
- Allow for the database to save no summary statistics for testing purposes.
- Tidy up some pyabc.History methods.
- pyabc.History.id set by default to the largest index (previously 0), corresponding to the latest inserted analysis.
- Fix several errors with the readthedocs (rtd) documentation.
- Speed-up rtd build by removing unnecessary conda and pip requirements.
- Clean-up requirements for travis and rtd.
- Change rtd design from alabaster to sphinx_rtd_theme since it implements better navigation.
- ABCSMC can pass observed summary statistics to distance functions (required for some scale functions, and to make the methods robust to volatile summary statistics).
- Implementation of more scale functions (distance_functions.scales), in particular some taking into account the bias to the observed data.
- AdaptivePNormDistance accepts a Callable as scaling scheme, allowing for more flexibility.
- Can specify kde and number of bins for all visualization routines.
- Can re-submit observed sum stats to ABCSMC.load() function in case it cannot be read correctly from the db.
- Fix serious memory problem resulting from pickling more than necessary for parallel sampling.
- Update logo, readme.
- Make tidying optional in abc-export (default behavior not changed).
- Minor error and warning fixes due to API changes in pandas, seaborn (not used any more), and change of the R installation on travis.
- Default visualizations like plot_kde_matrix() can plot reference values, useful for testing purposes.
- Acceptance transferred to an Acceptor object to allow for more flexibility (i.e. not only on a single comparison as per default).
- This acceptor is passed to the ABCSMC object.
- Update of distance and epsilon synchronized after each iteration and moved to update() methods.
- initialize() for DistanceFunction and Epsilon also called in load() method, given a time point to initialize for, and made optional via a require_initialize flag. This makes sure these objects are always correctly initialized.
- PNormDistance and AdaptivePNormDistance (prev. WeightedPNormDistance) improved to allow for more customization.
- ABCSMC.set_data() method removed.
- API breaks for DistanceFunction, Epsilon, Model.
- Implementation of adaptive distances feature. Distance functions can adapt via an update() method.
- In particular add WeightedPNormDistance (special case: WeightedEuclideanDistance). Also add non-weighted versions.
- Simplify Sampler.sample_until_n_accepted interface.
- Extend Sampler class to allow for customization, e.g. by the distance functions.
- Generalize MedianEpsilon to QuantileEpsilon.
- Make Viserver work with latest bokeh version.
- Add batch sampling now also to the REDIS evaluation parallel sampler (dynamic scheduling)
- Bug fix. Fix a race condition in the redis evaluation parallel sampler (dynamic scheduling). An error occured if a worker tried to start to work on a population after the other workers had already terminated the population.
- Minor bug fix. Ensure that the multicore samplers raise an Exception if an Exception occurs in the worker processes.
- Clarify that weighted distances are not normalized in case of having more than a single simulation per proposed parameter. Also add corresponding tests.
- Add n_worker method to the RedisEvalParallelSampler to enable querying of the number of connected workers.
- Add in-memory database support. Useful, e.g., for benchmarking on slow filesystems or with rather slow network connections.
Make git and gitpython an optional dependency.
- Add "abc-redis-manager reset-workers" command in case workers were unexpectedly killed.
- Adapt web server to changed bkcharts API.
- Bug fix. Rand seed initialization in case of starting multiple workers with --processes in redis server was not correct.
- Bug fix in MulticoreEvalParallelSampler. The multiprocessing.Queue could fill up and cause a deadlock on joining the workers. This is now fixed.
- Rename
population_specification
topopulation_size
. - Improve
plot_kde_matrix
plot ranger are now handled in a less confusing way
- Minor doc fixes
- Python 3.5 support dropped. It might still work for a while with Python 3.5 but this is not guaranteed anymore.
- Add kde matrix visualization function
- Add 2d tumor growth example
- Add Gillespie example
- Change license
- Minor bug fix. Visualization server produced error when JSON information was empty.
- Adapt to new bkcharts packge.
Ensure R source file is reloaded when unpickling R objects.
Add --id
option to abc-export to handle databases with multiple ABC runs.
Ensure that summary statistics have names.
Also add kwargs to plot_kde_2d
which are passed to pcolormesh.
Add --processes
option to abc-redis-worker to start a number of workers
in parallel.
Make rpy2 an optional dependency. If rpy2 is installed, then R can be used if not, the rest will still work.
minor bug fixes
- minor bug fix in plot_kde_2d if the axis is provided
- minor bug fix. The external.R interface did not display the source code correctly.
- minor doc updates
- support serialization of DataFrames used as summary statistics for storage in the database. This feature is still considered experimental.
- Add command line utility to export pyABC's database to different file formats such as csv, feather, html, json and more.
- Add (experimental) support for models defined in R.
- Add some visualization functions for convenience.
Bug fixes for web server.
Minor internal refactorings and minor documetation updates. Nothing a user should notice.
- Deprecate the "set_data" method of the ABCSMC class. Use the "new" method instead.
- Add a "load" method to the ABCSMC class for easier resuming stored ABCSMC runs.
- Add an example to the documentation how to resume stored ABC-SMC runs.
- Rename the acceptance_rate parameter form ABCSMC.run to min_acceptance_rate for clarity. Usage of acceptance_rate is deprecated.
- Various documentation improvements, correcting typos, clarifications, etc.
Easier early stopping models via the IntegratedModel class. Also has now examples.
- Minor refactoring for better Windows compatibility. But runs in serial on Windows
- ABCSMC.run gets a new parameter "acceptance_rate" to stop sampling if the acceptance rate drops too low.
- History.get_all_populations returns a DataFrame with columns "t", "population_end_time", "samples", "epsilon", "particles". That is "nr_samples" got renamed to "samples" and "particles" is new.
Performance improvement. Use MulticoreEvalParallelSampler as default. This should bring better performance for machines with many cores and comparatively small population sizes.
Bug fix. Ensure numpy.int64 can also be passed to History methods were an integer argument is expected.
Bug fix. Forgot to add the new Multicore base class.
MulticoreEvalParallelSampler gets an n_procs parameter.
Change the signature from History.get_distribution(t, m) to History.get_distribution(m, t) and make the time argument optional defaulting to the last time point
- Minor History API changes
- Remove History.get_results_distribution
- rename History.get_weighted_particles_dataframe to History.get_distribution
- Minor ABCSMC API changes
- Mark the de facto private methods as private by prepending an underscore. This should not cause trouble as usually noone would ever use these methods.
- Usability improvements and minor API canges
- ABCSMC accepts now an integer to be passed for constant population size
- The maximum number populations specification has moved from the PopulationStrategy classes to the ABCSMC.run method. The ABCSMC.run method will be where it is defined when to stop.
- Improvements to adaptive population size strategy
- Use same CV estimation algorithm for Transition and PopulationStrategy
- Bootstrapping on full joint space for model selection
- Fix edge case of models without parameters for population size adaptation
- Changes to the experimental adaptive population strategy.
- Smarter update for model selection
- Better CV estimation
- fix minor bug in RVs wrapper. args and keyword args were not passed to the wrapper random variable.
- Add local transition class which makes a local KDE fit.
- Fix corner cases of adaptive population size strategy
- Change the default: Do not stop if only a single model is alive.
- Also include population 0, i.e. a sample from the prior, in the websever visualization
- Minor bug fixes
- Fix inconsistency in ABC options if db_path given as sole string argument
- Add four evaluation parallel samplers
- Dask based implementation
- More communication overhead
- Future executor evaluation parallel sampler
- Very similar to the Dask implementation
- Redis based implementation
- Less communication overhad
- Performs also well for short running simulations
- Multicore evaluation parallel sampler
- In most common cases, where the population size is much bigger than the number of cores, this sampler is not going to be faster than the multicore particle parallel sampler.
- However, on machines with lots of cores and moderate sized populations this sampler might be faster
- Fix SGE regression. Forgot to update a module path on refactoring.
Comply with PEP8 with a few exceptions where it does not make sense. Flake8 runs now with the test. The tests do not pass if flake8 complains.
Remove legacy classes such as the MultivariateMultiTypeNormalDistributions and the legacy covariance calculation. Also remove devideas folder.
Refactor the ABCSMC.set_data and provide defaults.
Provide more default values for ABCSMC. This improves usability.
The new sampler relies on forking instead of pickling for the sample_one
,
simulate_one
and accept_one
functions.
This brings a huge performance improvement for single machine multicore settings
compared to multiprocessing.Pool.map
like execution which repeatedly pickles.
Initial release to the public.